Compare · ModelsLive · 2 picked · head to head

Qwen3 Next 80B A3B Thinking vs GPT-5.1

Side by side · benchmarks, pricing, and signals you can act on.

Winner summary

Qwen3 Next 80B A3B Thinking wins 3 of 6 shared benchmarks. Leads in knowledge · math.

Category leads
arena·GPT-5.1knowledge·Qwen3 Next 80B A3B Thinkinglanguage·GPT-5.1math·Qwen3 Next 80B A3B Thinkingreasoning·GPT-5.1
Hype vs Reality
Qwen3 Next 80B A3B Thinking
#34 by perf·no signal
QUIET
GPT-5.1
#97 by perf·no signal
QUIET
Best value
15.9x better value than GPT-5.1
Qwen3 Next 80B A3B Thinking
140.4 pts/$
$0.44/M
GPT-5.1
8.8 pts/$
$5.63/M
Vendor risk
Alibaba Qwen logo
Alibaba (Qwen)
$293.0B·Tier 1
Low risk
OpenAI logo
OpenAI
$840.0B·Tier 1
Medium risk
Head to head
Qwen3 Next 80B A3B ThinkingGPT-5.1
Chatbot Arena Elo · Overall
GPT-5.1 leads by +69.5
Qwen3 Next 80B A3B Thinking
1369.0
GPT-5.1
1438.5
HELM · GPQA
Qwen3 Next 80B A3B Thinking leads by +18.8
Qwen3 Next 80B A3B Thinking
63.0
GPT-5.1
44.2
HELM · IFEval
GPT-5.1 leads by +12.5
Qwen3 Next 80B A3B Thinking
81.0
GPT-5.1
93.5
HELM · MMLU-Pro
Qwen3 Next 80B A3B Thinking leads by +20.7
Qwen3 Next 80B A3B Thinking
78.6
GPT-5.1
57.9
HELM · Omni-MATH
Qwen3 Next 80B A3B Thinking leads by +0.3
Qwen3 Next 80B A3B Thinking
46.7
GPT-5.1
46.4
HELM · WildBench
GPT-5.1 leads by +5.6
Qwen3 Next 80B A3B Thinking
80.7
GPT-5.1
86.3
Full benchmark table
BenchmarkQwen3 Next 80B A3B ThinkingGPT-5.1
Chatbot Arena Elo · Overall
1369.01438.5
HELM · GPQA
63.044.2
HELM · IFEval
81.093.5
HELM · MMLU-Pro
78.657.9
HELM · Omni-MATH
46.746.4
HELM · WildBench
80.786.3
Pricing · per 1M tokens · projected $/mo at 10M tokens
ModelInputOutputContextProjected $/mo
Alibaba Qwen logoQwen3 Next 80B A3B Thinking$0.10$0.78131K tokens (~66 books)$2.68
OpenAI logoGPT-5.1$1.25$10.00400K tokens (~200 books)$34.38