Beta
Budget · Under $1/M

LLMs under $1 per million tokens

Every model priced below $1 per 1M input tokens. Ranked by benchmark score within budget.

Models40
Top quality78.4
Max price$1.00/M
What this page is
This page lists every model with input priced below $1 per million tokens, ranked by benchmark quality. This is the volume tier · most production traffic for chat, classification, extraction, and bulk content runs on sub-$1 models. The quality variance within this budget is huge, so the ranking matters.

Input under $1/M, highest benchmark score first.

#ModelIn $/1MOut $/1MType
1Alibaba Qwen logoQwen3.5 397B A17B$0.39$2.34OSS
2DeepSeek logoDeepSeek V3.2 Speciale$0.40$1.20OSS
3stepfun logoStep 3.5 Flash$0.10$0.30OSS
4xiaomi logoMiMo-V2-Flash$0.09$0.29OSS
5Alibaba Qwen logoQwen3.6 Plus$0.33$1.95OSS
6z-ai logoGLM 5.1$0.95$3.15OSS
7writer logoPalmyra X5$0.60$6.00Closed
8minimax logoMiniMax M2$0.26$1.00OSS
9z-ai logoGLM 4.5$0.60$2.20OSS
10OpenAI logogpt-oss-120b (free)$0.00$0.00OSS
11OpenAI logogpt-oss-20b$0.03$0.14OSS
12OpenAI logogpt-oss-20b (free)$0.00$0.00OSS
13Alibaba Qwen logoQwen3 30B A3B Thinking 2507$0.08$0.40OSS
14xAI logoGrok 3 Mini Beta$0.30$0.50Closed
15Google DeepMind logoGemini 2.0 Flash Lite$0.07$0.30Closed
16minimax logoMiniMax M2.7$0.30$1.20OSS
17Google DeepMind logoGemma 4 31B$0.13$0.38OSS
18Alibaba Qwen logoQwen3 Next 80B A3B Thinking$0.10$0.78OSS
19OpenAI logoGPT-5.1-Codex-Mini$0.25$2.00Closed
20meituan logoLongCat Flash Chat$0.20$0.80OSS
21Google DeepMind logoGemini 2.5 Flash Lite$0.10$0.40Closed
22DeepSeek logoDeepSeek V3$0.32$0.89OSS
23Alibaba Qwen logoQwen3 Max$0.78$3.90OSS
24Alibaba Qwen logoQwen3 32B$0.08$0.24OSS
25DeepSeek logoR1 0528$0.50$2.15OSS
26Mistral AI logoMixtral 8x7B Instruct$0.54$0.54OSS
27z-ai logoGLM 5$0.72$2.30OSS
28baidu logoERNIE 4.5 21B A3B Thinking$0.07$0.28OSS
29Alibaba Qwen logoQwen3 8B$0.05$0.40OSS
30Alibaba Qwen logoQwen3 235B A22B$0.46$1.82OSS
31moonshotai logoKimi K2 0711$0.57$2.30OSS
32OpenAI logoGPT-5 Mini$0.25$2.00Closed
33Alibaba Qwen logoQwen3 235B A22B Thinking 2507$0.15$1.50OSS
34Mistral AI logoMistral Small 3.1 24B$0.35$0.56OSS
35Alibaba Qwen logoQwen3 30B A3B Instruct 2507$0.09$0.30OSS
36DeepSeek logoDeepSeek V3 0324$0.20$0.77OSS
37minimax logoMiniMax M2.5$0.12$0.99OSS
38Alibaba Qwen logoQwen3 Next 80B A3B Instruct$0.09$1.10OSS
39moonshotai logoKimi K2 Thinking$0.60$2.50OSS
40DeepSeek logoDeepSeek V3.2 Exp$0.27$0.41OSS
Cheapest
Qwen3.5 397B A17B
$0.39/M
$ per 1M input tokens
Why the gap

At the sub-$1 tier, the "most expensive" is still cheap. The tradeoff is raw benchmark quality vs provider reliability. Open-source models dominate the cheap end.

Most expensive
GLM 5.1
$0.95/M
$ per 1M input tokens
Most open-source models (DeepSeek V3, Qwen3.5, GLM-4.6, Llama 3.3) plus small proprietary models (Gemini Flash, GPT-4o-mini, Claude Haiku) on various providers. The cheap end clusters around $0.05 to $0.30 per million.