Compare · ModelsLive · 2 picked · head to head

Qwen3 235B A22B vs DeepSeek V3

Side by side · benchmarks, pricing, and signals you can act on.

CiteAdd another

Winner summary

Qwen3 235B A22B wins on 8/8 benchmarks

Qwen3 235B A22B wins 8 of 8 shared benchmarks. Leads in coding · arena · knowledge.

Category leads

coding·Qwen3 235B A22Barena·Qwen3 235B A22Bknowledge·Qwen3 235B A22Bmath·Qwen3 235B A22Breasoning·Qwen3 235B A22B

Hype vs Reality

Attention vs performance

Qwen3 235B A22B

#60 by perf·no signal

QUIET

DeepSeek V3

#45 by perf·no signal

QUIET

See full mindshare →

Best value

DeepSeek V3

2.0x better value than Qwen3 235B A22B

Qwen3 235B A22B

49.6 pts/$

$1.14/M

DeepSeek V3

97.5 pts/$

$0.60/M

Explore pricing →

Vendor risk

Mixed exposure

One or more vendors flagged

Alibaba (Qwen)

$293.0B·Tier 1

Low risk

DeepSeek

$3.4B·Tier 1

Higher risk

See the AI economy →

Head to head

8 benchmarks · 2 models

Qwen3 235B A22BDeepSeek V3

Aider polyglot

Qwen3 235B A22B leads by +11.2

Aider Polyglot · measures how well AI models can edit code across multiple programming languages using the Aider coding assistant framework.

Qwen3 235B A22B

59.6

DeepSeek V3

48.4

Chatbot Arena Elo · Overall

Qwen3 235B A22B leads by +16.3

Qwen3 235B A22B

1374.4

DeepSeek V3

1358.2

Fiction.LiveBench

Qwen3 235B A22B leads by +17.7

Fiction.LiveBench · a continuously updated benchmark using recently published fiction to test reading comprehension and reasoning, preventing data contamination.

Qwen3 235B A22B

67.7

DeepSeek V3

50.0

GPQA diamond

Qwen3 235B A22B leads by +18.9

Graduate-Level Google-Proof QA (Diamond set) · expert-crafted questions in physics, biology, and chemistry that are difficult even for domain PhDs.

Qwen3 235B A22B

60.9

DeepSeek V3

42.0

Lech Mazur Writing

Qwen3 235B A22B leads by +6.0

Lech Mazur Writing · evaluates creative writing ability, assessing prose quality, narrative coherence, and stylistic sophistication.

Qwen3 235B A22B

83.0

DeepSeek V3

77.0

MATH level 5

Qwen3 235B A22B leads by +4.0

MATH Level 5 · the hardest tier of the MATH benchmark, featuring competition-level problems from AMC, AIME, and Olympiad-style mathematics.

Qwen3 235B A22B

68.9

DeepSeek V3

64.8

SimpleBench

Qwen3 235B A22B leads by +14.5

SimpleBench · tests fundamental reasoning capabilities with straightforward problems designed to expose gaps in basic logical and spatial thinking.

Qwen3 235B A22B

17.2

DeepSeek V3

2.7

WeirdML

Qwen3 235B A22B leads by +1.2

WeirdML · tests models on unusual and adversarial machine learning tasks that require creative problem-solving beyond standard patterns.

Qwen3 235B A22B

37.3

DeepSeek V3

36.1

Full benchmark table

Benchmark	Qwen3 235B A22B	DeepSeek V3
Aider polyglot Aider Polyglot · measures how well AI models can edit code across multiple programming languages using the Aider coding assistant framework.	59.6	48.4
Chatbot Arena Elo · Overall	1374.4	1358.2
Fiction.LiveBench Fiction.LiveBench · a continuously updated benchmark using recently published fiction to test reading comprehension and reasoning, preventing data contamination.	67.7	50.0
GPQA diamond Graduate-Level Google-Proof QA (Diamond set) · expert-crafted questions in physics, biology, and chemistry that are difficult even for domain PhDs.	60.9	42.0
Lech Mazur Writing Lech Mazur Writing · evaluates creative writing ability, assessing prose quality, narrative coherence, and stylistic sophistication.	83.0	77.0
MATH level 5 MATH Level 5 · the hardest tier of the MATH benchmark, featuring competition-level problems from AMC, AIME, and Olympiad-style mathematics.	68.9	64.8
SimpleBench SimpleBench · tests fundamental reasoning capabilities with straightforward problems designed to expose gaps in basic logical and spatial thinking.	17.2	2.7
WeirdML WeirdML · tests models on unusual and adversarial machine learning tasks that require creative problem-solving beyond standard patterns.	37.3	36.1

Pricing · per 1M tokens · projected $/mo at 10M tokens

Model	Input	Output	Context	Projected $/mo
Qwen3 235B A22B	$0.46	$1.82	131K tokens (~66 books)	$7.96
DeepSeek V3	$0.32	$0.89	164K tokens (~82 books)	$4.63

People also compared

DeepSeek V3 vs GPT-4o GPT-5 Mini vs Qwen3 235B A22B DeepSeek V3 vs Qwen2.5 Coder 32B Instruct DeepSeek V3 vs DeepSeek V3.2 Speciale DeepSeek V3 vs DeepSeek-V2 (MoE-236B, May 2024)DeepSeek V3 vs R1 0528 DeepSeek V3 vs DeepSeek V3 0324