Compare · ModelsLive · 2 picked · head to head
DeepSeek V3.2 Speciale vs gpt-oss-120b (free)
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
DeepSeek V3.2 Speciale wins on 7/9 benchmarks
DeepSeek V3.2 Speciale wins 7 of 9 shared benchmarks. Leads in math · knowledge · language.
Category leads
speed·gpt-oss-120b (free)math·DeepSeek V3.2 Specialeknowledge·DeepSeek V3.2 Specialelanguage·DeepSeek V3.2 Specialecoding·DeepSeek V3.2 Speciale
Hype vs Reality
Attention vs performance
DeepSeek V3.2 Speciale
#4 by perf·#5 by attention
gpt-oss-120b (free)
#20 by perf·no signal
Best value
DeepSeek V3.2 Speciale
DeepSeek V3.2 Speciale
97.8 pts/$
$0.80/M
gpt-oss-120b (free)
—
$0.00/M
Vendor risk
Mixed exposure
One or more vendors flagged
DeepSeek
$3.4B·Tier 1
OpenAI
$840.0B·Tier 1
Head to head
9 benchmarks · 2 models
DeepSeek V3.2 Specialegpt-oss-120b (free)
Artificial Analysis · Agentic Index
gpt-oss-120b (free) leads by +37.9
DeepSeek V3.2 Speciale
0.0
gpt-oss-120b (free)
37.9
Artificial Analysis · Coding Index
DeepSeek V3.2 Speciale leads by +9.3
DeepSeek V3.2 Speciale
37.9
gpt-oss-120b (free)
28.6
Artificial Analysis · Quality Index
gpt-oss-120b (free) leads by +3.8
DeepSeek V3.2 Speciale
29.4
gpt-oss-120b (free)
33.3
OpenCompass · AIME2025
DeepSeek V3.2 Speciale leads by +2.6
DeepSeek V3.2 Speciale
96.0
gpt-oss-120b (free)
93.4
OpenCompass · GPQA-Diamond
DeepSeek V3.2 Speciale leads by +7.8
DeepSeek V3.2 Speciale
86.7
gpt-oss-120b (free)
78.9
OpenCompass · HLE
DeepSeek V3.2 Speciale leads by +10.3
DeepSeek V3.2 Speciale
28.6
gpt-oss-120b (free)
18.3
OpenCompass · IFEval
DeepSeek V3.2 Speciale leads by +1.5
DeepSeek V3.2 Speciale
91.7
gpt-oss-120b (free)
90.2
OpenCompass · LiveCodeBenchV6
DeepSeek V3.2 Speciale leads by +2.5
DeepSeek V3.2 Speciale
80.9
gpt-oss-120b (free)
78.4
OpenCompass · MMLU-Pro
DeepSeek V3.2 Speciale leads by +5.8
DeepSeek V3.2 Speciale
85.5
gpt-oss-120b (free)
79.7
Full benchmark table
| Benchmark | DeepSeek V3.2 Speciale | gpt-oss-120b (free) |
|---|---|---|
Artificial Analysis · Agentic Index | 0.0 | 37.9 |
Artificial Analysis · Coding Index | 37.9 | 28.6 |
Artificial Analysis · Quality Index | 29.4 | 33.3 |
OpenCompass · AIME2025 | 96.0 | 93.4 |
OpenCompass · GPQA-Diamond | 86.7 | 78.9 |
OpenCompass · HLE | 28.6 | 18.3 |
OpenCompass · IFEval | 91.7 | 90.2 |
OpenCompass · LiveCodeBenchV6 | 80.9 | 78.4 |
OpenCompass · MMLU-Pro | 85.5 | 79.7 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| $0.40 | $1.20 | 164K tokens (~82 books) | $6.00 | |
| $0.00 | $0.00 | 131K tokens (~66 books) | — |
People also compared
DeepSeek V3.2 Speciale vs GPT-5 ChatClaude Mythos Preview vs DeepSeek V3.2 SpecialeDeepSeek V3.2 Speciale vs Qwen3.5 397B A17BClaude Instant vs DeepSeek V3.2 SpecialeDeepSeek V3.2 Speciale vs Step 3.5 FlashDeepSeek-V2 (MoE-236B, May 2024) vs DeepSeek V3.2 SpecialeDeepSeek V3.2 Speciale vs MiMo-V2-FlashGPT-5 Chat vs gpt-oss-120b (free)