Beta
Budget · Under $20/M

LLMs under $20 per million tokens

Every model priced below $20 per 1M input tokens. Ranked by benchmark score within budget. The "I care about quality" tier.

Models40
Top quality81.9
Max price$20.00/M
What this page is
This page ranks every model with input priced below $20 per million tokens, highest quality first. The under-$20 tier includes near-frontier models like Claude Sonnet and Gemini 2.5 Pro along with strong reasoning models. Above this tier you enter Opus and GPT-5 territory. Use this page when you care about quality but still want to keep bills sane.

Input under $20/M, highest benchmark score first.

#ModelIn $/1MOut $/1MType
1OpenAI logoGPT-5 Chat$1.25$10.00Closed
2Alibaba Qwen logoQwen3.5 397B A17B$0.39$2.34OSS
3DeepSeek logoDeepSeek V3.2 Speciale$0.40$1.20OSS
4Google DeepMind logoGemini 2.5 Pro Preview 05-06$1.25$10.00Closed
5stepfun logoStep 3.5 Flash$0.10$0.30OSS
6xiaomi logoMiMo-V2-Flash$0.09$0.29OSS
7OpenAI logoGPT-5.1-Codex-Max$1.25$10.00Closed
8OpenAI logoo4 Mini High$1.10$4.40Closed
9Alibaba Qwen logoQwen3.6 Plus$0.33$1.95OSS
10OpenAI logoGPT-5.2-Codex$1.75$14.00Closed
11z-ai logoGLM 5.1$0.95$3.15OSS
12writer logoPalmyra X5$0.60$6.00Closed
13xAI logoGrok 3 Beta$3.00$15.00Closed
14minimax logoMiniMax M2$0.26$1.00OSS
15z-ai logoGLM 4.5$0.60$2.20OSS
16OpenAI logogpt-oss-120b (free)$0.00$0.00OSS
17OpenAI logoGPT-5.1-Codex$1.25$10.00Closed
18OpenAI logogpt-oss-20b$0.03$0.14OSS
19OpenAI logogpt-oss-20b (free)$0.00$0.00OSS
20Alibaba Qwen logoQwen3 30B A3B Thinking 2507$0.08$0.40OSS
21OpenAI logoGPT-4 Turbo (older v1106)$10.00$30.00Closed
22xAI logoGrok 3 Mini Beta$0.30$0.50Closed
23Google DeepMind logoGemini 2.0 Flash Lite$0.07$0.30Closed
24minimax logoMiniMax M2.7$0.30$1.20OSS
25Google DeepMind logoGemma 4 31B$0.13$0.38OSS
26Alibaba Qwen logoQwen3 Next 80B A3B Thinking$0.10$0.78OSS
27Google DeepMind logoGemini 3.1 Pro Preview$2.00$12.00Closed
28OpenAI logoGPT-5.1-Codex-Mini$0.25$2.00Closed
29OpenAI logoo3 Mini High$1.10$4.40Closed
30meituan logoLongCat Flash Chat$0.20$0.80OSS
31Google DeepMind logoGemini 2.5 Flash Lite$0.10$0.40Closed
32DeepSeek logoDeepSeek V3$0.32$0.89OSS
33OpenAI logoGPT-5.4$2.50$15.00Closed
34Alibaba Qwen logoQwen3 Max$0.78$3.90OSS
35Alibaba Qwen logoQwen3 32B$0.08$0.24OSS
36xiaomi logoMiMo-V2-Pro$1.00$3.00Closed
37DeepSeek logoR1 0528$0.50$2.15OSS
38Mistral AI logoMixtral 8x7B Instruct$0.54$0.54OSS
39z-ai logoGLM 5$0.72$2.30OSS
40Anthropic logoClaude Opus 4.6$5.00$25.00Closed
Cheapest
GPT-5 Chat
$1.25/M
$ per 1M input tokens
Why the gap

At the top of this tier, pricing approaches frontier. The delta to true frontier (Opus, GPT-5) is 2x to 5x for roughly 5 to 15 percent quality gain on most tasks.

Most expensive
GPT-4 Turbo (older v1106)
$10.00/M
$ per 1M input tokens
Because most workloads don't need this tier. $20/M catches Claude Sonnet, Gemini 2.5 Pro Ultra tiers, some specialized reasoning models. For a 10K-token prompt, that's $0.20 just for input. At scale, it adds up fast.