Compare · ModelsLive · 2 picked · head to head
Hermes 3 70B Instruct vs Llama 3 8B Instruct
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
Hermes 3 70B Instruct wins on 6/6 benchmarks
Hermes 3 70B Instruct wins 6 of 6 shared benchmarks. Leads in general · knowledge · language.
Category leads
general·Hermes 3 70B Instructknowledge·Hermes 3 70B Instructlanguage·Hermes 3 70B Instructmath·Hermes 3 70B Instructreasoning·Hermes 3 70B Instruct
Hype vs Reality
Attention vs performance
Hermes 3 70B Instruct
#147 by perf·no signal
Llama 3 8B Instruct
#182 by perf·no signal
Best value
Llama 3 8B Instruct
6.9x better value than Hermes 3 70B Instruct
Hermes 3 70B Instruct
128.3 pts/$
$0.30/M
Llama 3 8B Instruct
880.0 pts/$
$0.04/M
Vendor risk
Who is behind the model
nousresearch
private · undisclosed
Meta AI
$1.50T·Tier 1
Head to head
6 benchmarks · 2 models
Hermes 3 70B InstructLlama 3 8B Instruct
BBH (HuggingFace)
Hermes 3 70B Instruct leads by +35.4
Hermes 3 70B Instruct
53.8
Llama 3 8B Instruct
18.4
GPQA
Hermes 3 70B Instruct leads by +12.8
Hermes 3 70B Instruct
14.9
Llama 3 8B Instruct
2.1
IFEval
Hermes 3 70B Instruct leads by +52.6
Hermes 3 70B Instruct
76.6
Llama 3 8B Instruct
24.0
MATH Level 5
Hermes 3 70B Instruct leads by +17.1
Hermes 3 70B Instruct
21.0
Llama 3 8B Instruct
3.9
MMLU-PRO
Hermes 3 70B Instruct leads by +23.7
Hermes 3 70B Instruct
41.4
Llama 3 8B Instruct
17.8
MUSR
Hermes 3 70B Instruct leads by +3.5
Hermes 3 70B Instruct
23.4
Llama 3 8B Instruct
19.9
Full benchmark table
| Benchmark | Hermes 3 70B Instruct | Llama 3 8B Instruct |
|---|---|---|
BBH (HuggingFace) | 53.8 | 18.4 |
GPQA | 14.9 | 2.1 |
IFEval | 76.6 | 24.0 |
MATH Level 5 | 21.0 | 3.9 |
MMLU-PRO | 41.4 | 17.8 |
MUSR | 23.4 | 19.9 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| $0.30 | $0.30 | 131K tokens (~66 books) | $3.00 | |
| $0.03 | $0.04 | 8K tokens (~4 books) | $0.33 |