API
Benchmarks/FrontierMath-Tier-4-2025-07-01-Private

FrontierMath-Tier-4-2025-07-01-Private

FrontierMath Tier 4 (Jul 2025) β€” the most challenging tier of frontier mathematics, containing problems that push the absolute limits of AI mathematical reasoning.

39
Models Tested
37.5
Top Score
7.6
Average Score