Compare · ModelsLive · 2 picked · head to head
Llama 4 Maverick vs gpt-oss-120b (free)
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
gpt-oss-120b (free) wins on 4/4 benchmarks
gpt-oss-120b (free) wins 4 of 4 shared benchmarks. Leads in speed · coding.
Category leads
speed·gpt-oss-120b (free)coding·gpt-oss-120b (free)
Hype vs Reality
Attention vs performance
Llama 4 Maverick
#193 by perf·no signal
gpt-oss-120b (free)
#20 by perf·no signal
Best value
Llama 4 Maverick
Llama 4 Maverick
74.7 pts/$
$0.38/M
gpt-oss-120b (free)
—
$0.00/M
Vendor risk
Who is behind the model
Meta AI
$1.50T·Tier 1
OpenAI
$840.0B·Tier 1
Head to head
4 benchmarks · 2 models
Llama 4 Maverickgpt-oss-120b (free)
Artificial Analysis · Agentic Index
gpt-oss-120b (free) leads by +30.6
Llama 4 Maverick
7.2
gpt-oss-120b (free)
37.9
Artificial Analysis · Coding Index
gpt-oss-120b (free) leads by +13.0
Llama 4 Maverick
15.6
gpt-oss-120b (free)
28.6
Artificial Analysis · Quality Index
gpt-oss-120b (free) leads by +14.9
Llama 4 Maverick
18.4
gpt-oss-120b (free)
33.3
Aider polyglot
gpt-oss-120b (free) leads by +26.2
Aider Polyglot · measures how well AI models can edit code across multiple programming languages using the Aider coding assistant framework.
Llama 4 Maverick
15.6
gpt-oss-120b (free)
41.8
Full benchmark table
| Benchmark | Llama 4 Maverick | gpt-oss-120b (free) |
|---|---|---|
Artificial Analysis · Agentic Index | 7.2 | 37.9 |
Artificial Analysis · Coding Index | 15.6 | 28.6 |
Artificial Analysis · Quality Index | 18.4 | 33.3 |
Aider polyglot Aider Polyglot · measures how well AI models can edit code across multiple programming languages using the Aider coding assistant framework. | 15.6 | 41.8 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| $0.15 | $0.60 | 1.0M tokens (~524 books) | $2.62 | |
| $0.00 | $0.00 | 131K tokens (~66 books) | — |
People also compared