测试版
排行榜/Claude Opus 4.1
Anthropic logo

Claude Opus 4.1

来自 Anthropic · 发布于 2025-08-05

41.3
平均分
$15.00/1M
输入价格
$75.00/1M
输出价格
200K tokens (~100 books)
上下文窗口
multimodal
类型

Tested on 14 benchmarks with 41.3% average. Top scores: Lech Mazur Writing (85.4%), SWE-Bench verified (73.3%), GPQA diamond (69.7%).

基准测试类别分数Bar
Lech Mazur Writingknowledge85.4
SWE-Bench verifiedcoding73.3
GPQA diamondknowledge69.7
OTIS Mock AIME 2024-2025math68.9
SimpleBenchreasoning52.0
DeepResearch Benchknowledge49.7
WeirdMLcoding42.8
Cybenchcoding42.0
Terminal Benchcoding38.0
SimpleQA Verifiedknowledge34.8
FrontierMath-2025-02-28-Privatemath7.2
HLEknowledge7.1
FrontierMath-Tier-4-2025-07-01-Privatemath4.2
VPCTknowledge2.5