Benchmark · KnowledgeSettled

OpenCompass · AIME2025

Updated 2026-02-16
Models tested
32
Top score
96.0
DeepSeek V3.2 Speciale
Median
87.3
min 22.4
Top-5 spread
σ 0.7
Settled

Best score over time · one chart, every benchmark

OPENCOMPASS · AIME202532 MODELS · FRONTIER RUNNING MAX0255075100SCORE ↑Mar 25Jun 25Aug 25Nov 25Feb 26RELEASE DATE →benchgecko.ai/benchmark/oc-aime2025 · frontier
Frontier on OpenCompass · AIME2025 rose from 22.4 to 96.0 in 9 months · +73.6 points · latest leader DeepSeek V3.2 Speciale from DeepSeek.
Pink dots = frontier records · 7 totalClick to open model page

Same category · related evaluations