Compare · ModelsLive · 2 picked · head to head
gpt-oss-120b (free) vs Qwen3.5 397B A17B
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
Qwen3.5 397B A17B wins on 8/9 benchmarks
Qwen3.5 397B A17B wins 8 of 9 shared benchmarks. Leads in speed · knowledge · language.
Category leads
speed·Qwen3.5 397B A17Bmath·gpt-oss-120b (free)knowledge·Qwen3.5 397B A17Blanguage·Qwen3.5 397B A17Bcoding·Qwen3.5 397B A17B
Hype vs Reality
Attention vs performance
gpt-oss-120b (free)
#20 by perf·no signal
Qwen3.5 397B A17B
#3 by perf·no signal
Best value
Qwen3.5 397B A17B
gpt-oss-120b (free)
—
$0.00/M
Qwen3.5 397B A17B
57.4 pts/$
$1.36/M
Vendor risk
Who is behind the model
OpenAI
$840.0B·Tier 1
Alibaba (Qwen)
$293.0B·Tier 1
Head to head
9 benchmarks · 2 models
gpt-oss-120b (free)Qwen3.5 397B A17B
Artificial Analysis · Agentic Index
Qwen3.5 397B A17B leads by +18.0
gpt-oss-120b (free)
37.9
Qwen3.5 397B A17B
55.8
Artificial Analysis · Coding Index
Qwen3.5 397B A17B leads by +12.7
gpt-oss-120b (free)
28.6
Qwen3.5 397B A17B
41.3
Artificial Analysis · Quality Index
Qwen3.5 397B A17B leads by +11.8
gpt-oss-120b (free)
33.3
Qwen3.5 397B A17B
45.0
OpenCompass · AIME2025
gpt-oss-120b (free) leads by +1.1
gpt-oss-120b (free)
93.4
Qwen3.5 397B A17B
92.3
OpenCompass · GPQA-Diamond
Qwen3.5 397B A17B leads by +9.5
gpt-oss-120b (free)
78.9
Qwen3.5 397B A17B
88.4
OpenCompass · HLE
Qwen3.5 397B A17B leads by +9.2
gpt-oss-120b (free)
18.3
Qwen3.5 397B A17B
27.5
OpenCompass · IFEval
Qwen3.5 397B A17B leads by +1.3
gpt-oss-120b (free)
90.2
Qwen3.5 397B A17B
91.5
OpenCompass · LiveCodeBenchV6
Qwen3.5 397B A17B leads by +4.6
gpt-oss-120b (free)
78.4
Qwen3.5 397B A17B
83.0
OpenCompass · MMLU-Pro
Qwen3.5 397B A17B leads by +7.9
gpt-oss-120b (free)
79.7
Qwen3.5 397B A17B
87.6
Full benchmark table
| Benchmark | gpt-oss-120b (free) | Qwen3.5 397B A17B |
|---|---|---|
Artificial Analysis · Agentic Index | 37.9 | 55.8 |
Artificial Analysis · Coding Index | 28.6 | 41.3 |
Artificial Analysis · Quality Index | 33.3 | 45.0 |
OpenCompass · AIME2025 | 93.4 | 92.3 |
OpenCompass · GPQA-Diamond | 78.9 | 88.4 |
OpenCompass · HLE | 18.3 | 27.5 |
OpenCompass · IFEval | 90.2 | 91.5 |
OpenCompass · LiveCodeBenchV6 | 78.4 | 83.0 |
OpenCompass · MMLU-Pro | 79.7 | 87.6 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| $0.00 | $0.00 | 131K tokens (~66 books) | — | |
| $0.39 | $2.34 | 262K tokens (~131 books) | $8.78 |
People also compared
GPT-5 Chat vs Qwen3.5 397B A17BClaude Mythos Preview vs Qwen3.5 397B A17BDeepSeek V3.2 Speciale vs Qwen3.5 397B A17BClaude Instant vs Qwen3.5 397B A17BQwen3.5 397B A17B vs Step 3.5 FlashDeepSeek-V2 (MoE-236B, May 2024) vs Qwen3.5 397B A17BMiMo-V2-Flash vs Qwen3.5 397B A17BGPT-5 Chat vs gpt-oss-120b (free)