베타
리더보드/Claude Mythos Preview
Anthropic logo

Claude Mythos Preview

제공 Anthropic · 출시일 2026-04-07

81.8
평균 점수
N/A
입력 가격
N/A
출력 가격
1.0M tokens (~500 books)
컨텍스트 윈도우
text
유형

Tested on 14 benchmarks with 81.8% average. Top scores: USAMO (97.6%), GPQA diamond (94.5%), SWE-Bench verified (93.9%).

벤치마크카테고리점수Bar
USAMOmath97.6
GPQA diamondknowledge94.5
SWE-Bench verifiedcoding93.9
CharXiv Reasoning (with tools)reasoning93.2
MMMLUknowledge92.7
SWE-bench Multilingualcoding87.3
CharXiv Reasoningreasoning86.1
Terminal Benchcoding82.0
GraphWalks BFS 256K-1Mreasoning80.0
OSWorldagentic79.6
SWE-bench Procoding77.8
HLE (with tools)reasoning64.7
SWE-bench Multimodalcoding59.0
HLEknowledge56.8