Classement/Llama 3.1-405B

Llama 3.1-405B

Code source ouvert

par Meta · Sorti le 2024-01-01

49.3

score moyen

N/A

Prix d'entrée

N/A

Prix de sortie

N/A

Fenêtre de contexte

text

Type

Tested on 15 benchmarks with 49.3% average. Top scores: ARC AI2 (93.7%), HellaSwag (85.6%), TriviaQA (82.7%).

Scores de benchmark

Benchmark	Catégorie	Score
ARC AI2	knowledge	93.7
HellaSwag	knowledge	85.6
TriviaQA	knowledge	82.7
MMLU	knowledge	79.3
Winogrande	knowledge	78.4
BBH	reasoning	77.2
PIQA	knowledge	71.8
MATH level 5	math	49.8
GPQA diamond	knowledge	34.5
OpenBookQA	knowledge	32.3
WeirdML	coding	21.4
OTIS Mock AIME 2024-2025	math	9.6
SimpleBench	reasoning	7.6
Cybench	coding	7.5
The Agent Company	agentic	7.4