Compare · ModelsLive · 2 picked · head to head
gpt-oss-20b (free) vs Kimi K2 Thinking
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
Kimi K2 Thinking wins on 6/6 benchmarks
Kimi K2 Thinking wins 6 of 6 shared benchmarks. Leads in math · knowledge · language.
Category leads
math·Kimi K2 Thinkingknowledge·Kimi K2 Thinkinglanguage·Kimi K2 Thinkingcoding·Kimi K2 Thinking
Hype vs Reality
Attention vs performance
gpt-oss-20b (free)
#27 by perf·no signal
Kimi K2 Thinking
#79 by perf·no signal
Best value
Kimi K2 Thinking
gpt-oss-20b (free)
—
$0.00/M
Kimi K2 Thinking
34.4 pts/$
$1.55/M
Vendor risk
Who is behind the model
OpenAI
$840.0B·Tier 1
moonshotai
private · undisclosed
Head to head
6 benchmarks · 2 models
gpt-oss-20b (free)Kimi K2 Thinking
OpenCompass · AIME2025
Kimi K2 Thinking leads by +6.2
gpt-oss-20b (free)
87.9
Kimi K2 Thinking
94.1
OpenCompass · GPQA-Diamond
Kimi K2 Thinking leads by +13.8
gpt-oss-20b (free)
68.9
Kimi K2 Thinking
82.7
OpenCompass · HLE
Kimi K2 Thinking leads by +9.7
gpt-oss-20b (free)
11.6
Kimi K2 Thinking
21.3
OpenCompass · IFEval
Kimi K2 Thinking leads by +3.5
gpt-oss-20b (free)
88.9
Kimi K2 Thinking
92.4
OpenCompass · LiveCodeBenchV6
Kimi K2 Thinking leads by +8.7
gpt-oss-20b (free)
68.4
Kimi K2 Thinking
77.1
OpenCompass · MMLU-Pro
Kimi K2 Thinking leads by +11.5
gpt-oss-20b (free)
72.8
Kimi K2 Thinking
84.3
Full benchmark table
| Benchmark | gpt-oss-20b (free) | Kimi K2 Thinking |
|---|---|---|
OpenCompass · AIME2025 | 87.9 | 94.1 |
OpenCompass · GPQA-Diamond | 68.9 | 82.7 |
OpenCompass · HLE | 11.6 | 21.3 |
OpenCompass · IFEval | 88.9 | 92.4 |
OpenCompass · LiveCodeBenchV6 | 68.4 | 77.1 |
OpenCompass · MMLU-Pro | 72.8 | 84.3 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| $0.00 | $0.00 | 131K tokens (~66 books) | — | |
| $0.60 | $2.50 | 262K tokens (~131 books) | $10.75 |
People also compared