Beta
Use case · Agents

Cheapest agent LLMs

The cheapest LLMs that score well on agentic benchmarks. Ranked by price per 1M input tokens.

Models30
Cheapest$0.00
ScopeSWE · tau · WebArena
What this page is
This page ranks every LLM with meaningful agentic scores (SWE-bench, tau-bench, WebArena, GAIA, AgentBench) by input price. Agent workloads spike token consumption because of looping, tool schemas, and retries. Choosing a cheaper model for routine tool calls while reserving frontier models for planning can cut agent bills by 50 to 90 percent.

Models with agent benchmark scores, cheapest first.

#ModelIn $/1MOut $/1MType
1Google DeepMind logoGemma 4 26B A4B (free)$0.00$0.00OSS
2Google DeepMind logoGemma 4 31B (free)$0.00$0.00OSS
3OpenAI logogpt-oss-120b (free)$0.00$0.00OSS
4OpenAI logogpt-oss-20b (free)$0.00$0.00OSS
5liquid logoLFM2.5-1.2B-Instruct (free)$0.00$0.00OSS
6liquid logoLFM2.5-1.2B-Thinking (free)$0.00$0.00OSS
7Alibaba Qwen logoQwen3 Coder 480B A35B (free)$0.00$0.00OSS
8Alibaba Qwen logoQwen3 Next 80B A3B Instruct (free)$0.00$0.00OSS
9ibm-granite logoGranite 4.0 Micro$0.02$0.11OSS
10liquid logoLFM2-24B-A2B$0.03$0.12OSS
11OpenAI logogpt-oss-120b$0.04$0.19OSS
12OpenAI logoGPT-5 Nano$0.05$0.40Closed
13Alibaba Qwen logoQwen3.5-9B$0.05$0.15OSS
14Alibaba Qwen logoQwen3 235B A22B Instruct 2507$0.07$0.10OSS
15Meta logoLlama 4 Scout$0.08$0.30OSS
16xiaomi logoMiMo-V2-Flash$0.09$0.29OSS
17Alibaba Qwen logoQwen3 Next 80B A3B Instruct$0.09$1.10OSS
18Alibaba Qwen logoQwen3 Next 80B A3B Thinking$0.10$0.78OSS
19Google DeepMind logoGemini 2.0 Flash$0.10$0.40Closed
20Google DeepMind logoGemini 2.5 Flash Lite$0.10$0.40Closed
21NVIDIA logoNemotron 3 Super$0.10$0.50OSS
22stepfun logoStep 3.5 Flash$0.10$0.30OSS
23minimax logoMiniMax M2.5$0.12$0.99OSS
24Alibaba Qwen logoQwen2.5 72B Instruct$0.12$0.39OSS
25Google DeepMind logoGemma 4 31B$0.13$0.38OSS
26Alibaba Qwen logoQwen3 235B A22B Thinking 2507$0.15$1.50OSS
27Meta logoLlama 4 Maverick$0.15$0.60OSS
28Mistral AI logoMistral Small 4$0.15$0.60OSS
29Alibaba Qwen logoQwen3 Coder Next$0.15$0.80OSS
30upstage logoSolar Pro 3$0.15$0.60Closed
Cheapest
Gemma 4 26B A4B (free)
$0.00/M
$ per 1M input tokens
Why the gap

Premium agent models pay for more reliable tool use, fewer hallucinated tool names, and longer-horizon planning. For simple tool loops, the cheap end works fine.

Most expensive
Llama 4 Maverick
$0.15/M
$ per 1M input tokens
Agents loop. A single task may produce 10 to 100 LLM calls with heavy context re-use. That amplifies input costs (because the system prompt and tool schema repeat) and multiplies output costs for every thinking step.