Claude Opus 4

por Anthropic · Lançado em 2025-05-22

40.1

pontuação média

$15.00/1M

Preço de entrada

$75.00/1M

Preço de saída

200K tokens (~100 books)

Janela de contexto

multimodal

Tipo

Tested on 18 benchmarks with 40.1% average. Top scores: MATH level 5 (85.0%), Aider polyglot (72.0%), GPQA diamond (68.3%).

Pontuações de benchmark

Benchmark	Categoria	Pontuação
MATH level 5	math	85.0
Aider polyglot	coding	72.0
GPQA diamond	knowledge	68.3
SWE-Bench Verified (Bash Only)	coding	67.6
OTIS Mock AIME 2024-2025	math	64.4
Fiction.LiveBench	knowledge	61.1
SimpleBench	reasoning	50.6
GeoBench	knowledge	49.0
DeepResearch Bench	knowledge	49.0
WeirdML	coding	43.4
Cybench	coding	38.0
ARC-AGI	reasoning	35.7
ARC-AGI-2	reasoning	8.6
VPCT	knowledge	7.0
GSO-Bench	coding	6.9
HLE	knowledge	6.2
FrontierMath-2025-02-28-Private	math	4.5
FrontierMath-Tier-4-2025-07-01-Private	math	4.2