Turkish Intelligence That Fits Anywhere.
Anadolu-350M is a daily-use, WebGPU-oriented SLM built to outperform 1B parameters on utility tasks. Designed for complex Turkish reasoning, rewriting, and autonomous tool execution natively.
Built for Pure Utility & Efficiency
An Anadolu-350M model does not chase generic benchmarks. It is strictly optimized for professional Turkish text generation, reliable JSON outputs, and ultra-high efficiency on edge devices.
Turkish Hybrid Tokenizer
Designed with dictionary-driven root segmentation and affix-aware fallback. Unites morphological segmentation with continued BPE to drastically reduce tokens per character, improving context length and reducing inference cost.
Hybrid Linear Attention + Residuals (HLAR)
Uses a 3:1 ratio of efficient KDA linear blocks to global causal attention blocks. Greatly reduces KV cache memory footprint, leading to explosive decode speeds and robust 8K context handling.
Tool Routing & Web Search
Not a frozen knowledge store, but a routing engine. Distilled specifically to output valid structured schemas like web_search(), extract_page_text(), and structured_extract() to ground its answers reliably.
Product-First Benchmark Approach
Trained to outperform 0.8B models specifically on Turkish linguistic quality and utility tasks. Evaluated using CETVEL and TurkBench.
CETVEL (Turkish Core Eval)
Tests morphology, sentiment, factual Turkish reasoning.
IFEval (Instruction Following)
Tests structured formatting, concise output, JSON extraction.
Designed for Deployment Reality
Quantizes perfectly to 4-bit and int8. Run the model entirely on the edge via WebGPU with sub-second TTFT.