Beta
Classifica/Claude 3 Sonnet
Anthropic logo

Claude 3 Sonnet

di Anthropic · Rilascio 2024-01-01

28.3
punteggio medio
N/A
Prezzo Input
N/A
Prezzo Output
N/A
Finestra di Contesto
text
Tipo

Tested on 6 benchmarks with 28.3% average. Top scores: MMLU (67.9%), Winogrande (50.2%), GPQA diamond (20.8%).

BenchmarkCategoriaPunteggioBar
MMLUknowledge67.9
Winograndeknowledge50.2
GPQA diamondknowledge20.8
MATH level 5math18.2
WeirdMLcoding10.2
OTIS Mock AIME 2024-2025math2.4