Beta
Classificação/Claude Sonnet 4
Anthropic

Claude Sonnet 4

por Anthropic · Lançado em 2025-05-22

36.9
pontuação média
$3.00/1M
Preço de entrada
$15.00/1M
Preço de saída
200K tokens (~100 books)
Janela de contexto
multimodal
Tipo

Tested on 20 benchmarks with 36.9% average. Top scores: MATH level 5 (84.4%), GPQA diamond (72.3%), OTIS Mock AIME 2024-2025 (71.1%).

Pontuações de benchmark

BenchmarkCategoriaPontuaçãoBar
MATH level 5math84.4
GPQA diamondknowledge72.3
OTIS Mock AIME 2024-2025math71.1
SWE-Bench Verified (Bash Only)coding64.9
Aider polyglotcoding61.3
DeepResearch Benchknowledge47.8
Fiction.LiveBenchknowledge46.9
WeirdMLcoding46.1
OSWorldagentic43.9
ARC-AGIreasoning40.0
GeoBenchknowledge37.0
Cybenchcoding35.0
SimpleBenchreasoning34.6
The Agent Companyagentic33.1
ARC-AGI-2reasoning5.9
GSO-Benchcoding4.9
FrontierMath-2025-02-28-Privatemath4.1
HLEknowledge3.1
VPCTknowledge1.0
FrontierMath-Tier-4-2025-07-01-Privatemath0.1

Modelos similares