MATH level 5
MATH Level 5 · the hardest tier of the MATH benchmark, featuring competition-level problems from AMC, AIME, and Olympiad-style mathematics.
The Frontier
Best score over time · one chart, every benchmark
Full rankings
72 models tested · sorted by score
Score distribution
Where models cluster
Correlated benchmarks
Pearson r · original research
Benchmarks that track with MATH level 5
Pearson correlation across models scored on both benchmarks. Closer to 1 = strongly predictive.
Frequently asked
About MATH level 5
What does MATH level 5 measure?
MATH Level 5 · the hardest tier of the MATH benchmark, featuring competition-level problems from AMC, AIME, and Olympiad-style mathematics. 72 AI models have been tested on it. Scores range from 3.3 to 98.1 out of 100.
Which model leads on MATH level 5?
GPT-5 from OpenAI leads MATH level 5 with a score of 98.1. The median score across 72 tested models is 62.7.
Is MATH level 5 saturated?
Yes · the top model on MATH level 5 has reached 98.1 out of 100, within 5% of the theoretical ceiling. This benchmark is approaching saturation and may be replaced by a harder successor.
Does MATH level 5 predict performance on other benchmarks?
Yes · MATH level 5 scores correlate 0.98 with MATH Level 5 across 9 shared models. Models that do well on MATH level 5 tend to do well on MATH Level 5.
How often is MATH level 5 data refreshed?
BenchGecko pulls updates daily. New model scores on MATH level 5 appear as soon as they are published by Epoch AI or the model provider.
- Category
- Math
- Max score
- 100
- Models
- 72
- Updated
- 2025-10-15
More math benchmarks
Same category · related evaluations