Beta
Compare · ModelsLive · 2 picked · head to head

Llama 3.3 70B Instruct vs Llama 3.1 70B Instruct

Side by side · benchmarks, pricing, and signals you can act on.

Winner summary

Llama 3.3 70B Instruct wins 6 of 8 shared benchmarks. Leads in coding · arena · general.

Category leads
coding·Llama 3.3 70B Instructarena·Llama 3.3 70B Instructgeneral·Llama 3.3 70B Instructknowledge·Llama 3.1 70B Instructlanguage·Llama 3.3 70B Instructmath·Llama 3.3 70B Instructreasoning·Llama 3.1 70B Instruct
Hype vs Reality
Llama 3.3 70B Instruct
#107 by perf·no signal
QUIET
Llama 3.1 70B Instruct
#152 by perf·no signal
QUIET
Best value
2.4x better value than Llama 3.1 70B Instruct
Llama 3.3 70B Instruct
223.3 pts/$
$0.21/M
Llama 3.1 70B Instruct
94.5 pts/$
$0.40/M
Vendor risk
Meta logo
Meta AI
$1.50T·Tier 1
Low risk
Meta logo
Meta AI
$1.50T·Tier 1
Low risk
Head to head
Llama 3.3 70B InstructLlama 3.1 70B Instruct
Aider · Code Editing
Llama 3.3 70B Instruct leads by +0.8
Llama 3.3 70B Instruct
59.4
Llama 3.1 70B Instruct
58.6
Chatbot Arena Elo · Overall
Llama 3.3 70B Instruct leads by +25.2
Llama 3.3 70B Instruct
1318.0
Llama 3.1 70B Instruct
1292.8
BBH (HuggingFace)
Llama 3.3 70B Instruct leads by +0.6
Llama 3.3 70B Instruct
56.6
Llama 3.1 70B Instruct
55.9
GPQA
Llama 3.1 70B Instruct leads by +3.7
Llama 3.3 70B Instruct
10.5
Llama 3.1 70B Instruct
14.2
IFEval
Llama 3.3 70B Instruct leads by +3.3
Llama 3.3 70B Instruct
90.0
Llama 3.1 70B Instruct
86.7
MATH Level 5
Llama 3.3 70B Instruct leads by +10.3
Llama 3.3 70B Instruct
48.3
Llama 3.1 70B Instruct
38.1
MMLU-PRO
Llama 3.3 70B Instruct leads by +0.3
Llama 3.3 70B Instruct
48.1
Llama 3.1 70B Instruct
47.9
MUSR
Llama 3.1 70B Instruct leads by +2.1
Llama 3.3 70B Instruct
15.6
Llama 3.1 70B Instruct
17.7
Full benchmark table
BenchmarkLlama 3.3 70B InstructLlama 3.1 70B Instruct
Aider · Code Editing
59.458.6
Chatbot Arena Elo · Overall
1318.01292.8
BBH (HuggingFace)
56.655.9
GPQA
10.514.2
IFEval
90.086.7
MATH Level 5
48.338.1
MMLU-PRO
48.147.9
MUSR
15.617.7
Pricing · per 1M tokens · projected $/mo at 10M tokens
ModelInputOutputContextProjected $/mo
Meta logoLlama 3.3 70B Instruct$0.10$0.32131K tokens (~66 books)$1.55
Meta logoLlama 3.1 70B Instruct$0.40$0.40131K tokens (~66 books)$4.00