Home/Models/Hermes 2 Pro - Llama-3 8B
nousresearch logo

Hermes 2 Pro - Llama-3 8B

by nousresearch · Released May 2024

Open Source
38.2
avg score
Rank #152
Compare
Better than 35% of all models
Context
8K tokens (~4 books)
Input $/1M
$0.14
Output $/1M
$0.14
Type
text
License
Open Source
Benchmarks
6 tested
Data updated today
About

Hermes 2 Pro is an upgraded, retrained version of Nous Hermes 2, consisting of an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced...

Tested on 6 benchmarks with 22.1% average. Top scores: IFEval (53.6%), BBH (HuggingFace) (30.7%), MMLU-PRO (22.8%).

Looking for similar performance at lower cost?
GLM 4 32B scores 37.8 (99% as good) at $0.10/1M input · 29% cheaper
Capabilities
reasoning
11.3
#128 globally
math
8.4
#190 globally
knowledge
14.3
#205 globally
language
53.6
#101 globally
general
30.7
#29 globally
Benchmark Scores
Compare All
Tested on 6 benchmarks · Ranked across 5 categories
Score Distribution (all 233 models)
0255075100
▲ You are here
MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

11.3
MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

8.4
MMLU-PRO

HuggingFace MMLU-Pro. Harder version of MMLU with 10 answer choices instead of 4 and more challenging questions.

22.8
GPQA

HuggingFace evaluation of GPQA (Graduate-Level Google-Proof Q&A). PhD-level science questions that cannot be easily searched.

5.7
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Documentation
Community
BenchGecko API
hermes-2-pro-llama-3-8b
Specifications
  • Typetext
  • Context8K tokens (~4 books)
  • ReleasedMay 2024
  • LicenseOpen Source
  • StatusActive
  • Cost / Message~$0.000
Available On
nousresearch logonousresearch$0.14
Share & Export
Tweet
Hermes 2 Pro - Llama-3 8B is an open-source text AI model by nousresearch, released in May 2024. It has an average benchmark score of 38.2. Context window: 8K tokens.