Compare · ModelsLive · 2 picked · head to head
Claude 3.5 Sonnet vs Yi 6B
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
Claude 3.5 Sonnet wins on 2/2 benchmarks
Claude 3.5 Sonnet wins 2 of 2 shared benchmarks. Leads in math · knowledge.
Category leads
math·Claude 3.5 Sonnetknowledge·Claude 3.5 Sonnet
Hype vs Reality
Attention vs performance
Claude 3.5 Sonnet
#129 by perf·no signal
Yi 6B
#183 by perf·no signal
Vendor risk
Who is behind the model
Anthropic
$380.0B·Tier 1
U
Unknown
private · undisclosed
Head to head
2 benchmarks · 2 models
Claude 3.5 SonnetYi 6B
MATH level 5
Claude 3.5 Sonnet leads by +46.5
MATH Level 5 · the hardest tier of the MATH benchmark, featuring competition-level problems from AMC, AIME, and Olympiad-style mathematics.
Claude 3.5 Sonnet
51.7
Yi 6B
5.2
MMLU
Claude 3.5 Sonnet leads by +30.0
Massive Multitask Language Understanding · 57 subjects spanning STEM, humanities, social sciences, and more. The standard benchmark for broad knowledge.
Claude 3.5 Sonnet
82.0
Yi 6B
52.0
Full benchmark table
| Benchmark | Claude 3.5 Sonnet | Yi 6B |
|---|---|---|
MATH level 5 MATH Level 5 · the hardest tier of the MATH benchmark, featuring competition-level problems from AMC, AIME, and Olympiad-style mathematics. | 51.7 | 5.2 |
MMLU Massive Multitask Language Understanding · 57 subjects spanning STEM, humanities, social sciences, and more. The standard benchmark for broad knowledge. | 82.0 | 52.0 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| — | — | — | — | |
U Yi 6B | — | — | — | — |