OTIS Mock AIME 2024-2025
OTIS Mock AIME 2024-2025 · simulated American Invitational Mathematics Examination problems testing advanced problem-solving skills.
The Frontier
Best score over time · one chart, every benchmark
Full rankings
86 models tested · sorted by score
Score distribution
Where models cluster
Correlated benchmarks
Pearson r · original research
Benchmarks that track with OTIS Mock AIME 2024-2025
Pearson correlation across models scored on both benchmarks. Closer to 1 = strongly predictive.
Frequently asked
About OTIS Mock AIME 2024-2025
What does OTIS Mock AIME 2024-2025 measure?
OTIS Mock AIME 2024-2025 · simulated American Invitational Mathematics Examination problems testing advanced problem-solving skills. 86 AI models have been tested on it. Scores range from 0.5 to 96.1 out of 100.
Which model leads on OTIS Mock AIME 2024-2025?
GPT-5.2 from OpenAI leads OTIS Mock AIME 2024-2025 with a score of 96.1. The median score across 86 tested models is 34.9.
Is OTIS Mock AIME 2024-2025 saturated?
Yes · the top model on OTIS Mock AIME 2024-2025 has reached 96.1 out of 100, within 5% of the theoretical ceiling. This benchmark is approaching saturation and may be replaced by a harder successor.
Does OTIS Mock AIME 2024-2025 predict performance on other benchmarks?
Yes · OTIS Mock AIME 2024-2025 scores correlate 0.96 with OpenCompass · MMLU-Pro across 10 shared models. Models that do well on OTIS Mock AIME 2024-2025 tend to do well on OpenCompass · MMLU-Pro.
How often is OTIS Mock AIME 2024-2025 data refreshed?
BenchGecko pulls updates daily. New model scores on OTIS Mock AIME 2024-2025 appear as soon as they are published by Epoch AI or the model provider.
- Category
- Math
- Max score
- 100
- Models
- 86
- Updated
- 2026-03-05
Top on OTIS Mock AIME 2024-2025
GPT-5.2 · 96.1Gemini 3.1 Pro Preview · 95.6GPT-5.4 · 95.3Claude Opus 4.6 · 94.4Gemini 3 Flash Preview · 92.8More math benchmarks
Same category · related evaluations