测试版
排行榜/Claude Opus 4.6
Anthropic logo

Claude Opus 4.6

来自 Anthropic · 发布于 2026-02-04

57.5
平均分
$5.00/1M
输入价格
$25.00/1M
输出价格
1.0M tokens (~500 books)
上下文窗口
multimodal
类型

Tested on 19 benchmarks with 57.5% average. Top scores: Chatbot Arena Elo — Coding (1542.9%), Chatbot Arena Elo — Overall (1496.6%), OTIS Mock AIME 2024-2025 (94.4%).

基准测试类别分数Bar
Chatbot Arena Elo — Codingarena1542.9
Chatbot Arena Elo — Overallarena1496.6
OTIS Mock AIME 2024-2025math94.4
ARC-AGIreasoning94.0
Cybenchcoding93.0
GPQA diamondknowledge87.4
SWE-Bench verifiedcoding78.7
WeirdMLcoding77.9
Terminal Benchcoding74.7
ARC-AGI-2reasoning69.2
SimpleBenchreasoning61.1
SimpleQA Verifiedknowledge46.5
FrontierMath-2025-02-28-Privatemath40.7
GSO-Benchcoding33.3
APEX-Agentsagentic31.7
HLEknowledge31.1
PostTrainBenchknowledge23.2
FrontierMath-Tier-4-2025-07-01-Privatemath22.9
Chess Puzzlesknowledge17.0