Benchmark · MathCompetitive

FrontierMath-2025-02-28-Private

FrontierMath (Feb 2025) · original research-level math problems created by mathematicians, testing capabilities at the boundary of current AI mathematical reasoning.

Updated 2026-03-05
Models tested
54
Top score
50.0
GPT-5.4 Pro
Median
6.6
min 0.1
Top-5 spread
σ 4.4
Competitive

Best score over time · one chart, every benchmark

FRONTIERMATH-2025-02-28-PRIVATE46 MODELS · FRONTIER RUNNING MAX0255075100SCORE ↑Aug 24Dec 24May 25Oct 25Mar 26RELEASE DATE →benchgecko.ai/benchmark/frontiermath-2025-02-28-private · frontier
Frontier on FrontierMath-2025-02-28-Private rose from 9.3 to 50.0 in 15 months · +40.7 points · latest leader GPT-5.4 Pro from OpenAI.
Pink dots = frontier records · 8 totalClick to open model page

54 models tested · sorted by score

#ModelScore
1OpenAI logoGPT-5.4 Pro50.0
2OpenAI logoGPT-5.447.6
3Anthropic logoClaude Opus 4.640.7
4OpenAI logoGPT-5.240.7
5
U
Muse Spark
39.0
6Google DeepMind logoGemini 3 Pro37.6
7Google DeepMind logoGemini 3.1 Pro Preview36.9
8Google DeepMind logoGemini 3 Flash Preview35.6
9OpenAI logoGPT-532.4
10Anthropic logoClaude Sonnet 4.632.4
11OpenAI logoGPT-5.131.0
12moonshotai logoKimi K2.527.9
13OpenAI logoGPT-5 Mini27.2
14OpenAI logoo4 Mini24.8
15DeepSeek logoDeepSeek V3.222.1
16moonshotai logoKimi K2 Thinking21.4
17Anthropic logoClaude Opus 4.520.7
18xAI logoGrok 419.7
19OpenAI logoo318.7
20z-ai logoGLM 516.4
21Anthropic logoClaude Sonnet 4.515.2
22Google DeepMind logoGemini 2.5 Pro14.1
23OpenAI logoo3 Mini12.4
24OpenAI logoo19.3
25Alibaba Qwen logoQwen3 235B A22B Thinking 25078.5
26OpenAI logoGPT-5 Nano8.3
27Anthropic logoClaude Opus 4.17.2
28Anthropic logoClaude Haiku 4.55.9
29xAI logoGrok 3 Mini5.9
30OpenAI logoGPT-4.15.5
31Google DeepMind logoGemini 2.5 Flash4.8
32Anthropic logoClaude Opus 44.5
33OpenAI logoGPT-4.1 Mini4.5
34Anthropic logoClaude 3.7 Sonnet4.1
35Anthropic logoClaude Sonnet 44.1
36z-ai logoGLM 4.63.8
37xAI logoGrok 33.8
38z-ai logoGLM 4.72.4
39DeepSeek logoDeepSeek V31.7
40Google DeepMind logoGemini 2.0 Flash1.7
41OpenAI logoo1-mini1.7
42Anthropic logoClaude 3.5 Sonnet1.0
43OpenAI logoGPT-4.1 Nano1.0
44Alibaba Qwen logoQwen2.5-Max1.0
45xAI logoGrok-2 (Dec 2024)0.7
46Meta logoLlama 4 Maverick0.7
47Mistral AI logoMistral Medium 30.3
48Anthropic logoClaude 3.5 Haiku0.3
49OpenAI logoGPT-4o (2024-08-06)0.3
50OpenAI logoGPT-4o (2024-11-20)0.3
51Mistral AI logoMistral Large0.3
52Mistral AI logoMistral Large 24110.3
53Google DeepMind logoGemini 1.5 Flash (May 2024)0.1
54Meta logoLlama 4 Scout0.1

Same category · related evaluations