Home/Models/Llama 3.1 8B Instruct
Meta logo

Llama 3.1 8B Instruct

by Meta · Released Jul 2024

Open Source
34.3
avg score
Rank #167
Compare
Better than 28% of all models
Context
16K tokens (~8 books)
Input $/1M
$0.02
Output $/1M
$0.05
Type
text
License
Open Source
Benchmarks
16 tested
Data updated today
About

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...

Tested on 16 benchmarks with 27.4% average. Top scores: Chatbot Arena Elo — Overall (1211.0%), GSM8K (82.4%), PIQA (62.4%).

Looking for similar performance at lower cost?
Gemma 3 27B (free) scores 35.0 (102% as good) at $0.00/1M input · 100% cheaper
Capabilities
coding
19.7
#127 globally
reasoning
8.5
#145 globally
math
30.8
#129 globally
knowledge
26.8
#184 globally
general
29.2
#30 globally
language
50.6
#105 globally
Benchmark Scores
Compare All
Tested on 16 benchmarks · Ranked across 7 categories
Score Distribution (all 233 models)
0255075100
▲ You are here
Aider — Code Editing

Code editing benchmark from the Aider project. Measures ability to apply targeted code changes while maintaining correctness and style.

37.6
WeirdML

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

1.7
MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

8.5
GSM8K

Grade school math word problems. 8,500 problems testing multi-step arithmetic reasoning. A foundational math benchmark.

82.4
MATH level 5

Competition-level math from AMC, AIME, and olympiad problems. Level 5 is the hardest tier, requiring creative problem-solving.

22.9
MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

15.5
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Documentation
Community
BenchGecko API
llama-3-1-8b-instruct
Specifications
  • Typetext
  • Context16K tokens (~8 books)
  • ReleasedJul 2024
  • LicenseOpen Source
  • StatusActive
  • Cost / Message~$0.000
Available On
Meta logoMeta$0.02
Share & Export
Tweet
Llama 3.1 8B Instruct is an open-source text AI model by Meta, released in July 2024. It has an average benchmark score of 34.3. Context window: 16K tokens.