Compare · ModelsLive · 2 picked · head to head
o3 vs gpt-oss-120b (free)
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
o3 wins on 3/4 benchmarks
o3 wins 3 of 4 shared benchmarks. Leads in speed · coding.
Category leads
speed·o3coding·o3
Hype vs Reality
Attention vs performance
o3
#67 by perf·no signal
gpt-oss-120b (free)
#20 by perf·no signal
Vendor risk
Who is behind the model
OpenAI
$840.0B·Tier 1
OpenAI
$840.0B·Tier 1
Head to head
4 benchmarks · 2 models
o3gpt-oss-120b (free)
Artificial Analysis · Agentic Index
gpt-oss-120b (free) leads by +1.8
o3
36.1
gpt-oss-120b (free)
37.9
Artificial Analysis · Coding Index
o3 leads by +9.8
o3
38.4
gpt-oss-120b (free)
28.6
Artificial Analysis · Quality Index
o3 leads by +5.1
o3
38.4
gpt-oss-120b (free)
33.3
Aider polyglot
o3 leads by +39.5
Aider Polyglot · measures how well AI models can edit code across multiple programming languages using the Aider coding assistant framework.
o3
81.3
gpt-oss-120b (free)
41.8
Full benchmark table
| Benchmark | o3 | gpt-oss-120b (free) |
|---|---|---|
Artificial Analysis · Agentic Index | 36.1 | 37.9 |
Artificial Analysis · Coding Index | 38.4 | 28.6 |
Artificial Analysis · Quality Index | 38.4 | 33.3 |
Aider polyglot Aider Polyglot · measures how well AI models can edit code across multiple programming languages using the Aider coding assistant framework. | 81.3 | 41.8 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| $2.00 | $8.00 | 200K tokens (~100 books) | $35.00 | |
| $0.00 | $0.00 | 131K tokens (~66 books) | — |