Compare · ModelsLive · 2 picked · head to head

Claude Sonnet 4.5 vs Qwen3 235B A22B Instruct 2507

Side by side · benchmarks, pricing, and signals you can act on.

Winner summary

Claude Sonnet 4.5 wins 3 of 3 shared benchmarks. Leads in reasoning · coding.

Category leads
reasoning·Claude Sonnet 4.5coding·Claude Sonnet 4.5
Hype vs Reality
Claude Sonnet 4.5
#132 by perf·no signal
QUIET
Qwen3 235B A22B Instruct 2507
#99 by perf·no signal
QUIET
Best value
121.3x better value than Claude Sonnet 4.5
Claude Sonnet 4.5
4.7 pts/$
$9.00/M
Qwen3 235B A22B Instruct 2507
567.3 pts/$
$0.09/M
Vendor risk
Anthropic logo
Anthropic
$380.0B·Tier 1
Medium risk
Alibaba Qwen logo
Alibaba (Qwen)
$293.0B·Tier 1
Low risk
Head to head
Claude Sonnet 4.5Qwen3 235B A22B Instruct 2507
ARC-AGI
Claude Sonnet 4.5 leads by +52.7
ARC-AGI · the original Abstraction and Reasoning Corpus, testing whether AI can solve novel visual pattern recognition tasks without memorization.
Claude Sonnet 4.5
63.7
Qwen3 235B A22B Instruct 2507
11.0
ARC-AGI-2
Claude Sonnet 4.5 leads by +12.4
ARC-AGI-2 · the second iteration of the Abstraction and Reasoning Corpus, testing novel pattern recognition and abstract reasoning without prior training data.
Claude Sonnet 4.5
13.6
Qwen3 235B A22B Instruct 2507
1.3
WeirdML
Claude Sonnet 4.5 leads by +9.0
WeirdML · tests models on unusual and adversarial machine learning tasks that require creative problem-solving beyond standard patterns.
Claude Sonnet 4.5
47.7
Qwen3 235B A22B Instruct 2507
38.7
Full benchmark table
BenchmarkClaude Sonnet 4.5Qwen3 235B A22B Instruct 2507
ARC-AGI
ARC-AGI · the original Abstraction and Reasoning Corpus, testing whether AI can solve novel visual pattern recognition tasks without memorization.
63.711.0
ARC-AGI-2
ARC-AGI-2 · the second iteration of the Abstraction and Reasoning Corpus, testing novel pattern recognition and abstract reasoning without prior training data.
13.61.3
WeirdML
WeirdML · tests models on unusual and adversarial machine learning tasks that require creative problem-solving beyond standard patterns.
47.738.7
Pricing · per 1M tokens · projected $/mo at 10M tokens
ModelInputOutputContextProjected $/mo
Anthropic logoClaude Sonnet 4.5$3.00$15.001.0M tokens (~500 books)$60.00
Alibaba Qwen logoQwen3 235B A22B Instruct 2507$0.07$0.10262K tokens (~131 books)$0.78