Beta
Classifica/Claude 2.1
Anthropic logo

Claude 2.1

di Anthropic · Rilascio 2024-01-01

21.0
punteggio medio
N/A
Prezzo Input
N/A
Prezzo Output
N/A
Finestra di Contesto
text
Tipo

Tested on 4 benchmarks with 21.0% average. Top scores: MMLU (64.7%), GPQA diamond (10.6%), WeirdML (7.1%).

BenchmarkCategoriaPunteggioBar
MMLUknowledge64.7
GPQA diamondknowledge10.6
WeirdMLcoding7.1
OTIS Mock AIME 2024-2025math1.9