Budget · Under $20/M

LLMs under $20 per million tokens

Every model priced below $20 per 1M input tokens. Ranked by benchmark score within budget. The "I care about quality" tier.

Models40
Top quality85.0
Max price$20.00/M
What this page is
This page ranks every model with input priced below $20 per million tokens, highest quality first. The under-$20 tier includes near-frontier models like Claude Sonnet and Gemini 2.5 Pro along with strong reasoning models. Above this tier you enter Opus and GPT-5 territory. Use this page when you care about quality but still want to keep bills sane.

Input under $20/M, highest benchmark score first.

#ModelIn $/1MOut $/1MType
1OpenAI logoGPT-5.5$5.00$30.00Closed
2OpenAI logoGPT-5 Chat$1.25$10.00Closed
3Alibaba Qwen logoQwen3.5 397B A17B$0.39$2.34OSS
4DeepSeek logoDeepSeek V3.2 Speciale$0.40$1.20OSS
5Google DeepMind logoGemini 2.5 Pro Preview 05-06$1.25$10.00Closed
6stepfun logoStep 3.5 Flash$0.10$0.30OSS
7xiaomi logoMiMo-V2-Flash$0.09$0.29OSS
8OpenAI logoGPT-5.1-Codex-Max$1.25$10.00Closed
9OpenAI logoo4 Mini High$1.10$4.40Closed
10Alibaba Qwen logoQwen3.6 Plus$0.33$1.95OSS
11OpenAI logoGPT-5.2-Codex$1.75$14.00Closed
12z-ai logoGLM 5.1$1.05$3.50OSS
13writer logoPalmyra X5$0.60$6.00Closed
14xAI logoGrok 3 Beta$3.00$15.00Closed
15minimax logoMiniMax M2$0.26$1.00OSS
16z-ai logoGLM 4.5$0.60$2.20OSS
17OpenAI logogpt-oss-120b (free)$0.00$0.00OSS
18OpenAI logoGPT-5.1-Codex$1.25$10.00Closed
19OpenAI logogpt-oss-20b$0.03$0.14OSS
20OpenAI logogpt-oss-20b (free)$0.00$0.00OSS
21Alibaba Qwen logoQwen3 30B A3B Thinking 2507$0.08$0.40OSS
22OpenAI logoGPT-4 Turbo (older v1106)$10.00$30.00Closed
23xAI logoGrok 3 Mini Beta$0.30$0.50Closed
24Google DeepMind logoGemini 2.0 Flash Lite$0.07$0.30Closed
25minimax logoMiniMax M2.7$0.30$1.20OSS
26Google DeepMind logoGemma 4 31B$0.13$0.38OSS
27Alibaba Qwen logoQwen3 Next 80B A3B Thinking$0.10$0.78OSS
28Google DeepMind logoGemini 3.1 Pro Preview$2.00$12.00Closed
29OpenAI logoGPT-5.1-Codex-Mini$0.25$2.00Closed
30OpenAI logoo3 Mini High$1.10$4.40Closed
31meituan logoLongCat Flash Chat$0.20$0.80OSS
32Google DeepMind logoGemini 2.5 Flash Lite$0.10$0.40Closed
33DeepSeek logoDeepSeek V3$0.32$0.89OSS
34OpenAI logoGPT-5.4$2.50$15.00Closed
35Alibaba Qwen logoQwen3 Max$0.78$3.90OSS
36Alibaba Qwen logoQwen3 32B$0.08$0.24OSS
37xiaomi logoMiMo-V2-Pro$1.00$3.00Closed
38DeepSeek logoR1 0528$0.50$2.15OSS
39Mistral AI logoMixtral 8x7B Instruct$0.54$0.54OSS
40z-ai logoGLM 5$0.60$1.92OSS
Cheapest
GPT-5.5
$5.00/M
$ per 1M input tokens
Why the gap

At the top of this tier, pricing approaches frontier. The delta to true frontier (Opus, GPT-5) is 2x to 5x for roughly 5 to 15 percent quality gain on most tasks.

Most expensive
GPT-4 Turbo (older v1106)
$10.00/M
$ per 1M input tokens
Because most workloads don't need this tier. $20/M catches Claude Sonnet, Gemini 2.5 Pro Ultra tiers, some specialized reasoning models. For a 10K-token prompt, that's $0.20 just for input. At scale, it adds up fast.