Beta
Home/Models/Llama 3.1 405B
Meta logo

Llama 3.1 405B

by Meta · Released Jul 2024

Open Source
44.1
avg score
Rank #126
Compare
Better than 45% of all models
Context
N/A
Input $/1M
TBD
Output $/1M
TBD
Type
text-generation
License
Open Source
Benchmarks
21 tested
Data updated today
About

Meta-llama text generation model. 383K downloads on HuggingFace.

Tested on 21 benchmarks with 38.0% average. Top scores: ARC AI2 (93.7%), HellaSwag (85.6%), TriviaQA (82.7%).

Capabilities
coding
14.4
#131 globally
reasoning
29.0
#84 globally
math
19.8
#163 globally
knowledge
59.0
#51 globally
agentic
7.4
#28 globally
language
18.1
#146 globally
general
7.8
#55 globally
Benchmark Scores
Compare All
Tested on 21 benchmarks · Ranked across 7 categories
Score Distribution (all 231 models)
0255075100
▲ You are here
WeirdML

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

21.4
Cybench

Capture-the-flag cybersecurity challenges. Tests vulnerability analysis, reverse engineering, cryptography, and exploitation skills.

7.5
BBH

BIG-Bench Hard. 23 challenging tasks from BIG-Bench where prior language models fell below average human performance.

77.2
SimpleBench

Deceptively simple questions that humans find easy but AI models often get wrong. Tests common sense and reasoning gaps.

7.6
MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

2.2
MATH level 5

Competition-level math from AMC, AIME, and olympiad problems. Level 5 is the hardest tier, requiring creative problem-solving.

49.8
OTIS Mock AIME 2024-2025

Mock AIME (American Invitational Mathematics Exam) problems from OTIS. Tests mathematical competition performance.

9.6
MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

0.0
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Info
Documentation
Community
BenchGecko API
meta-llama-llama-31-405b
Specifications
  • Typetext-generation
  • ContextN/A
  • ReleasedJul 2024
  • LicenseOpen Source
  • StatusActive
Available On
Meta logoMetaTBD
Share & Export
Tweet
Llama 3.1 405B is an open-source text-generation AI model by Meta, released in July 2024. It has an average benchmark score of 44.1.