Beta
Use case · Writing

Cheapest writing LLMs

Best-value models for general writing · blogs, marketing, drafts, summaries. Ranked by input token price with quality score.

Models30
Cheapest$0.01
ScopeLLM · Multimodal
What this page is
This page ranks general-purpose LLMs (text and multimodal chat models) for writing workloads. For bulk writing, value per dollar is what matters: the quality gap between the cheapest and the most expensive model is much smaller than the price gap. Our "top picks" use a quality-to-price ratio, not just raw benchmarks. For specialized coding or reasoning writing, see the dedicated pages.

General-purpose LLMs, cheapest first.

#ModelIn $/1MOut $/1MType
1liquid logoLFM2-2.6B$0.01$0.02OSS
2liquid logoLFM2-8B-A1B$0.01$0.02OSS
3ibm-granite logoGranite 4.0 Micro$0.02$0.11OSS
4Google DeepMind logoGemma 3n 4B$0.02$0.04OSS
5Meta logoLlama 3.1 8B Instruct$0.02$0.05OSS
6Mistral AI logoMistral Nemo$0.02$0.04OSS
7Meta logoLlama 3.2 1B Instruct$0.03$0.20OSS
8Google DeepMind logoGemma 2 9B$0.03$0.09OSS
9OpenAI logogpt-oss-20b$0.03$0.14OSS
10liquid logoLFM2-24B-A2B$0.03$0.12OSS
11Meta logoLlama 3 8B Instruct$0.03$0.04OSS
12Alibaba Qwen logoQwen2.5 Coder 7B Instruct$0.03$0.09OSS
13Alibaba Qwen logoQwen-Turbo$0.03$0.13OSS
14Amazon logoNova Micro 1.0$0.04$0.14Closed
15Cohere logoCommand R7B (12-2024)$0.04$0.15Closed
16OpenAI logogpt-oss-120b$0.04$0.19OSS
17Google DeepMind logoGemma 3 12B$0.04$0.13OSS
18Google DeepMind logoGemma 3 4B$0.04$0.08OSS
19sao10k logoLlama 3 8B Lunaris$0.04$0.05OSS
20NVIDIA logoNemotron Nano 9B V2$0.04$0.16OSS
21Alibaba Qwen logoQwen2.5 7B Instruct$0.04$0.10OSS
22arcee-ai logoTrinity Mini$0.04$0.15OSS
23OpenAI logoGPT-5 Nano$0.05$0.40Closed
24Mistral AI logoMistral Small 3$0.05$0.08OSS
25NVIDIA logoNemotron 3 Nano 30B A3B$0.05$0.20OSS
26allenai logoOlmo 2 32B Instruct$0.05$0.20OSS
27Alibaba Qwen logoQwen3 8B$0.05$0.40OSS
28Alibaba Qwen logoQwen3.5-9B$0.05$0.15OSS
29Meta logoLlama 3.2 3B Instruct$0.05$0.34OSS
30z-ai logoGLM 4.7 Flash$0.06$0.40OSS
Cheapest
LFM2-2.6B
$0.01/M
$ per 1M input tokens
Why the gap

The premium on writing models is almost entirely brand voice and tone. The quality gap on factual accuracy, grammar, and structure is within noise for most real-world writing jobs.

Most expensive
GLM 4.7 Flash
$0.06/M
$ per 1M input tokens
Not in 2026. Models like Gemini Flash, Claude Haiku, GPT-4.1 mini, and DeepSeek V3 produce polished prose at a fraction of premium prices. For routine drafts, newsletters, and social posts, there is almost no quality gap.