Beta
Use case · Reasoning

Cheapest reasoning LLMs

The cheapest models that hold up on GPQA, AIME, MATH, MMLU, HLE. Ranked by price per 1M input tokens.

Models30
Cheapest$0.00
ScopeGPQA · AIME · MATH
What this page is
This page ranks every model with credible reasoning scores (GPQA, AIME, MATH, MMLU, HLE, DROP, BBH) by input price. Reasoning models burn a lot of thinking tokens, so the headline input price is only part of the bill. The cheap end is dominated by open-source reasoners like DeepSeek, Qwen3, and GLM. Premium o-series and Claude Opus sit at the top of the price scale. Pair with our cost calculator to model real workloads.

Models with credible reasoning scores, cheapest first.

#ModelIn $/1MOut $/1MType
1Google DeepMind logoGemma 3 27B (free)$0.00$0.00OSS
2OpenAI logogpt-oss-120b (free)$0.00$0.00OSS
3OpenAI logogpt-oss-20b (free)$0.00$0.00OSS
4Meta logoLlama 3.2 3B Instruct (free)$0.00$0.00OSS
5Meta logoLlama 3.3 70B Instruct (free)$0.00$0.00OSS
6Meta logoLlama 3.1 8B Instruct$0.02$0.05OSS
7Mistral AI logoMistral Nemo$0.02$0.04OSS
8Meta logoLlama 3.2 1B Instruct$0.03$0.20OSS
9Google DeepMind logoGemma 2 9B$0.03$0.09OSS
10OpenAI logogpt-oss-20b$0.03$0.14OSS
11Meta logoLlama 3 8B Instruct$0.03$0.04OSS
12Alibaba Qwen logoQwen2.5 Coder 7B Instruct$0.03$0.09OSS
13OpenAI logogpt-oss-120b$0.04$0.19OSS
14Alibaba Qwen logoQwen2.5 7B Instruct$0.04$0.10OSS
15OpenAI logoGPT-5 Nano$0.05$0.40Closed
16Alibaba Qwen logoQwen3 8B$0.05$0.40OSS
17Meta logoLlama 3.2 3B Instruct$0.05$0.34OSS
18Microsoft logoPhi 4$0.07$0.14OSS
19baidu logoERNIE 4.5 21B A3B Thinking$0.07$0.28OSS
20Alibaba Qwen logoQwen3 235B A22B Instruct 2507$0.07$0.10OSS
21Google DeepMind logoGemini 2.0 Flash Lite$0.07$0.30Closed
22Google DeepMind logoGemma 3 27B$0.08$0.16OSS
23Meta logoLlama 4 Scout$0.08$0.30OSS
24Alibaba Qwen logoQwen3 30B A3B Thinking 2507$0.08$0.40OSS
25Alibaba Qwen logoQwen3 32B$0.08$0.24OSS
26xiaomi logoMiMo-V2-Flash$0.09$0.29OSS
27Alibaba Qwen logoQwen3 30B A3B Instruct 2507$0.09$0.30OSS
28Alibaba Qwen logoQwen3 Next 80B A3B Instruct$0.09$1.10OSS
29Alibaba Qwen logoQwen3 Next 80B A3B Thinking$0.10$0.78OSS
30Google DeepMind logoGemini 2.0 Flash$0.10$0.40Closed
Cheapest
Gemma 3 27B (free)
$0.00/M
$ per 1M input tokens
Why the gap

Premium reasoners pay for longer thinking budgets, better tool use, and vendor reliability. For many tasks, Gemma 3 27B (free) closes 70 to 90 percent of the GPQA gap at a fraction of the cost.

Most expensive
Gemini 2.0 Flash
$0.10/M
$ per 1M input tokens
Models with explicit reasoning scores on GPQA Diamond, AIME 2024/2025, MATH-500, MMLU-Pro, HLE, DROP, BBH, or ARC-AGI. Reasoning models typically use extended chain-of-thought and burn more tokens on hard problems.