Beta
Compare · ModelsLive · 2 picked · head to head

Gemini 2.5 Pro vs gpt-oss-120b (free)

Side by side · benchmarks, pricing, and signals you can act on.

Winner summary

Gemini 2.5 Pro wins 6 of 10 shared benchmarks. Leads in speed · coding · knowledge.

Category leads
speed·Gemini 2.5 Procoding·Gemini 2.5 Promath·gpt-oss-120b (free)knowledge·Gemini 2.5 Prolanguage·gpt-oss-120b (free)
Hype vs Reality
Gemini 2.5 Pro
#59 by perf·no signal
QUIET
gpt-oss-120b (free)
#20 by perf·no signal
QUIET
Best value
Gemini 2.5 Pro
10.0 pts/$
$5.63/M
gpt-oss-120b (free)
$0.00/M
Vendor risk
Google DeepMind logo
Google DeepMind
$4.00T·Tier 1
Low risk
OpenAI logo
OpenAI
$840.0B·Tier 1
Medium risk
Head to head
Gemini 2.5 Progpt-oss-120b (free)
Artificial Analysis · Agentic Index
gpt-oss-120b (free) leads by +5.2
Gemini 2.5 Pro
32.7
gpt-oss-120b (free)
37.9
Artificial Analysis · Coding Index
Gemini 2.5 Pro leads by +3.3
Gemini 2.5 Pro
31.9
gpt-oss-120b (free)
28.6
Artificial Analysis · Quality Index
Gemini 2.5 Pro leads by +1.4
Gemini 2.5 Pro
34.6
gpt-oss-120b (free)
33.3
Aider polyglot
Gemini 2.5 Pro leads by +41.3
Aider Polyglot · measures how well AI models can edit code across multiple programming languages using the Aider coding assistant framework.
Gemini 2.5 Pro
83.1
gpt-oss-120b (free)
41.8
OpenCompass · AIME2025
gpt-oss-120b (free) leads by +4.7
Gemini 2.5 Pro
88.7
gpt-oss-120b (free)
93.4
OpenCompass · GPQA-Diamond
Gemini 2.5 Pro leads by +5.8
Gemini 2.5 Pro
84.7
gpt-oss-120b (free)
78.9
OpenCompass · HLE
Gemini 2.5 Pro leads by +2.8
Gemini 2.5 Pro
21.1
gpt-oss-120b (free)
18.3
OpenCompass · IFEval
gpt-oss-120b (free) leads by +0.2
Gemini 2.5 Pro
90.0
gpt-oss-120b (free)
90.2
OpenCompass · LiveCodeBenchV6
gpt-oss-120b (free) leads by +7.1
Gemini 2.5 Pro
71.3
gpt-oss-120b (free)
78.4
OpenCompass · MMLU-Pro
Gemini 2.5 Pro leads by +6.1
Gemini 2.5 Pro
85.8
gpt-oss-120b (free)
79.7
Full benchmark table
BenchmarkGemini 2.5 Progpt-oss-120b (free)
Artificial Analysis · Agentic Index
32.737.9
Artificial Analysis · Coding Index
31.928.6
Artificial Analysis · Quality Index
34.633.3
Aider polyglot
Aider Polyglot · measures how well AI models can edit code across multiple programming languages using the Aider coding assistant framework.
83.141.8
OpenCompass · AIME2025
88.793.4
OpenCompass · GPQA-Diamond
84.778.9
OpenCompass · HLE
21.118.3
OpenCompass · IFEval
90.090.2
OpenCompass · LiveCodeBenchV6
71.378.4
OpenCompass · MMLU-Pro
85.879.7
Pricing · per 1M tokens · projected $/mo at 10M tokens
ModelInputOutputContextProjected $/mo
Google DeepMind logoGemini 2.5 Pro$1.25$10.001.0M tokens (~524 books)$34.38
OpenAI logogpt-oss-120b (free)$0.00$0.00131K tokens (~66 books)