Compare · ModelsLive · 2 picked · head to head

Mistral Medium 3 vs Llama 2-13B

Side by side · benchmarks, pricing, and signals you can act on.

Winner summary

Mistral Medium 3 wins 2 of 2 shared benchmarks. Leads in knowledge · math.

Category leads
knowledge·Mistral Medium 3math·Mistral Medium 3
Hype vs Reality
Mistral Medium 3
#145 by perf·no signal
QUIET
Llama 2-13B
#128 by perf·no signal
QUIET
Best value
Mistral Medium 3
33.3 pts/$
$1.20/M
Llama 2-13B
no price
Vendor risk
Mistral AI logo
Mistral AI
$14.0B·Tier 1
Medium risk
Meta logo
Meta AI
$1.50T·Tier 1
Low risk
Head to head
Mistral Medium 3Llama 2-13B
GPQA diamond
Mistral Medium 3 leads by +44.3
Graduate-Level Google-Proof QA (Diamond set) · expert-crafted questions in physics, biology, and chemistry that are difficult even for domain PhDs.
Mistral Medium 3
46.0
Llama 2-13B
1.8
MATH level 5
Mistral Medium 3 leads by +78.3
MATH Level 5 · the hardest tier of the MATH benchmark, featuring competition-level problems from AMC, AIME, and Olympiad-style mathematics.
Mistral Medium 3
81.6
Llama 2-13B
3.3
Full benchmark table
BenchmarkMistral Medium 3Llama 2-13B
GPQA diamond
Graduate-Level Google-Proof QA (Diamond set) · expert-crafted questions in physics, biology, and chemistry that are difficult even for domain PhDs.
46.01.8
MATH level 5
MATH Level 5 · the hardest tier of the MATH benchmark, featuring competition-level problems from AMC, AIME, and Olympiad-style mathematics.
81.63.3
Pricing · per 1M tokens · projected $/mo at 10M tokens
ModelInputOutputContextProjected $/mo
Mistral AI logoMistral Medium 3$0.40$2.00131K tokens (~66 books)$8.00
Meta logoLlama 2-13B