Beta

Best AI Models for Math

AI models ranked by math benchmarks. Compare MATH-500, GSM8K, and competition-level math scores across all providers.

120
Models
14
Providers
53
Open Source
$0.72
Median $/1M in
#ModelAvgfrontiermaGSM8KMATH level 5otis mock $/1M inContext
1Anthropic logoClaude Instant🇺🇸 Anthropic78.0-86.7--N/A0K
2OpenAI logoGPT-5.4 Pro🇺🇸 OpenAI66.750.0---$30.001.1M
3Alibaba Qwen logoQwen-14B🇨🇳 Alibaba QwenOpen60.7-61.3--N/A0K
4Google DeepMind logoGemini 3.1 Pro Preview🇺🇸 Google DeepMind60.636.9--95.6$2.001.0M
5Google DeepMind logoGemini 3 Pro🇺🇸 Google DeepMind60.537.6--91.4N/A0K
6DeepSeek logoDeepSeek V3🇨🇳 DeepSeekOpen59.01.7-64.815.8$0.32164K
7OpenAI logoGPT-5.4🇺🇸 OpenAI59.047.6--95.3$2.501.1M
8
U
Muse Spark Unknown
59.039.0--88.9N/A0K
9Microsoft logophi-3-medium 14B🇺🇸 MicrosoftOpen58.6--17.6-N/A0K
10Alibaba Qwen logoQwen3 Max🇨🇳 Alibaba QwenOpen58.3--97.173.3$0.78262K
11TII logoFalcon 2 11B TIIOpen58.0-53.8--N/A0K
12DeepSeek logoR1 0528🇨🇳 DeepSeekOpen57.9--96.666.4$0.50164K
13Mistral AI logoMixtral 8x7B Instruct🇫🇷 Mistral AIOpen57.8-74.49.9-$0.5433K
14z-ai logoGLM 5🇨🇳 z-aiOpen57.616.4--80.0$0.7280K
15Anthropic logoClaude Opus 4.6🇺🇸 Anthropic57.540.7--94.4$5.001.0M
16OpenAI logoo1🇺🇸 OpenAI56.49.3-94.773.3$15.00200K
17Alibaba Qwen logoQwen3 235B A22B🇨🇳 Alibaba QwenOpen56.4--68.9-$0.46131K
18Google DeepMind logoGemini 2.5 Pro🇺🇸 Google DeepMind56.214.1-95.684.7$1.251.0M
19OpenAI logoGPT-5.2 Pro🇺🇸 OpenAI56.2----$21.00400K
20OpenAI logoGPT-5 Mini🇺🇸 OpenAI56.027.2-97.886.7$0.25400K
21Alibaba Qwen logoQwen3 235B A22B Thinking 2507🇨🇳 Alibaba QwenOpen55.98.5--86.7$0.15131K
22OpenAI logoo3🇺🇸 OpenAI55.218.7-97.883.9$2.00200K
23OpenAI logoGPT-4 (older v0314)🇺🇸 OpenAI55.0-92.0-0.5$30.008K
24xAI logoGrok 4🇺🇸 xAI54.819.7--84.0$3.00256K
25OpenAI logoGPT-5🇺🇸 OpenAI54.432.4-98.191.4$1.25400K
26OpenAI logoGPT-5.2🇺🇸 OpenAI54.040.7--96.1$1.75400K
27Google DeepMind logoGemini 2.0 Pro🇺🇸 Google DeepMind53.7--83.5-N/A0K
28
U
Nemotron-4 15B Unknown
53.4-46.0--N/A0K
29moonshotai logoKimi K2 Thinking🇨🇳 moonshotaiOpen53.321.4--83.0$0.60262K
30OpenAI logoo4 Mini🇺🇸 OpenAI53.224.8-97.881.7$1.10200K
31Alibaba Qwen logoQwen2.5 72B Instruct🇨🇳 Alibaba QwenOpen53.2--63.28.0$0.1233K
32Alibaba Qwen logoQwen2.5 Coder 32B Instruct🇨🇳 Alibaba QwenOpen53.1-91.1--$0.6633K
33DeepSeek logoDeepSeek V3.2🇨🇳 DeepSeekOpen53.022.1--87.8$0.26164K
34moonshotai logoKimi K2.5🇨🇳 moonshotaiOpen52.027.9--92.2$0.38262K
35OpenAI logoGPT-4o (2024-05-13)🇺🇸 OpenAI51.1--51.06.2$5.00128K
36OpenAI logoGPT-4 Turbo🇺🇸 OpenAI51.0-90.023.01.0$10.00128K
37z-ai logoGLM 4.6🇨🇳 z-aiOpen50.83.8---$0.39205K
38z-ai logoGLM 4.7🇨🇳 z-aiOpen50.52.4--83.3$0.39203K
39OpenAI logoGPT-5.1🇺🇸 OpenAI49.631.0--88.6$1.25400K
40Google DeepMind logoGemini 3 Flash Preview🇺🇸 Google DeepMind49.135.6--92.8$0.501.0M
41Google DeepMind logoGemini 2.0 Flash🇺🇸 Google DeepMind48.01.7-82.231.0$0.101.0M
42
U
Stable Beluga 2 Unknown
47.8-69.6--N/A0K
43Anthropic logoClaude 3.7 Sonnet🇺🇸 Anthropic47.74.1-91.257.7$3.00200K
44Anthropic logoClaude Sonnet 4.6🇺🇸 Anthropic47.632.4--85.8$3.001.0M
45Google DeepMind logoGemini 1.5 Flash (May 2024)🇺🇸 Google DeepMind47.40.182.425.13.8N/A0K
46OpenAI logogpt-oss-120b🇺🇸 OpenAIOpen46.9---88.9$0.04131K
47xAI logoGrok 3 Mini🇺🇸 xAI46.65.9-90.977.8$0.30131K
48OpenAI logoGPT-3.5 Turbo (older v0613)🇺🇸 OpenAI45.8-57.811.6-$1.004K
49Mistral AI logoMistral Large 2411🇫🇷 Mistral AIOpen45.80.3-50.37.7$2.00131K
50Anthropic logoClaude Opus 4.5🇺🇸 Anthropic45.420.7--86.1$5.00200K
51OpenAI logoGPT-5 Nano🇺🇸 OpenAI45.38.3-95.281.1$0.05400K
52DeepSeek logoR1🇨🇳 DeepSeekOpen45.1--93.053.3$0.7064K
53Anthropic logoClaude Sonnet 4🇺🇸 Anthropic44.64.1-84.471.1$3.001.0M
54OpenAI logoGPT-4.1 Mini🇺🇸 OpenAI44.54.5-87.344.7$0.401.0M
55TII logoFalcon-180B TIIOpen44.4-54.4--N/A0K
56Alibaba Qwen logoQwen2.5 Coder 7B Instruct🇨🇳 Alibaba QwenOpen44.4-86.7--$0.0333K
57OpenAI logoGPT-4.1🇺🇸 OpenAI43.35.5-83.038.3$2.001.0M
58OpenAI logoGPT-5 Pro🇺🇸 OpenAI43.3----$15.00400K
59OpenAI logoGPT-4o-mini (2024-07-18)🇺🇸 OpenAI43.2-91.352.66.8$0.15128K
60Microsoft logoPhi 4🇺🇸 MicrosoftOpen43.2--64.913.7$0.0716K
61Meta logoLlama 2-13B🇺🇸 MetaOpen42.5-36.93.3-N/A0K
62Anthropic logoClaude 3.5 Sonnet🇺🇸 Anthropic42.31.0-51.76.4N/A0K
63Google DeepMind logoGemma 3 27B🇺🇸 Google DeepMindOpen42.2--74.019.6$0.08131K
64Google DeepMind logoGemma 3 27B (free)🇺🇸 Google DeepMindOpen42.2--74.019.6Free131K
65Anthropic logoClaude Sonnet 4.5🇺🇸 Anthropic42.115.2-97.777.8$3.001.0M
66Anthropic logoClaude Opus 4🇺🇸 Anthropic41.74.5-85.064.4$15.00200K
67Mistral AI logoMistral 7B V0.1🇫🇷 Mistral AIOpen41.6-54.4--N/A0K
68OpenAI logoo1-preview🇺🇸 OpenAI41.5--81.731.0N/A0K
69Anthropic logoClaude Opus 4.1🇺🇸 Anthropic41.37.2--68.9$15.00200K
70Google DeepMind logoGemini 1.5 Pro (Feb 2024)🇺🇸 Google DeepMind41.3--40.86.7N/A0K
71Alibaba Qwen logoQwen2-72B🇨🇳 Alibaba QwenOpen41.3--39.1-N/A0K
72Alibaba Qwen logoQwen2.5-Max🇨🇳 Alibaba QwenOpen41.01.0-67.216.0N/A0K
73
U
Baichuan 2-7B Unknown
40.3-24.6--N/A0K
74Google DeepMind logoGemini 2.5 Flash🇺🇸 Google DeepMind40.04.8--73.0$0.301.0M
75Mistral AI logoMistral Medium 3🇫🇷 Mistral AIOpen40.00.3-81.632.1$0.40131K
76OpenAI logoGPT-4o-mini🇺🇸 OpenAI39.6-91.352.66.8$0.15128K
77Mistral AI logoMistral Large 2407🇫🇷 Mistral AIOpen39.1--44.88.4$2.00131K
78Alibaba logoQwen2.5 Coder 1.5B Instruct🇨🇳 AlibabaOpen38.8-65.8--N/A0K
79xAI logoGrok 3🇺🇸 xAI38.43.8-88.855.5$3.00131K
80OpenAI logoo3 Mini🇺🇸 OpenAI38.412.4-96.576.9$1.10200K
81Meta logoLlama 3.1 405B🇺🇸 MetaOpen38.0--49.89.6N/A0K
82Meta logoLlama 3.1 70B Instruct🇺🇸 MetaOpen37.8--36.73.5$0.40131K
83Google DeepMind logoGemini 2.0 Flash Thinking (Jan 2025)🇺🇸 Google DeepMind37.7---57.7N/A0K
84OpenAI logoGPT-4o (2024-11-20)🇺🇸 OpenAI37.70.3-53.36.3$2.50128K
85Anthropic logoClaude 2🇺🇸 Anthropic37.2--11.72.4N/A0K
86Anthropic logoClaude 3.5 Haiku🇺🇸 Anthropic37.20.3-46.44.2$0.80200K
87Mistral AI logoMistral Nemo🇫🇷 Mistral AIOpen37.2-84.210.8-$0.02131K
88Anthropic logoClaude Haiku 4.5🇺🇸 Anthropic37.15.9-96.466.6$1.00200K
89Meta logoLlama 3.2 90B🇺🇸 MetaOpen36.1--39.42.5N/A0K
90Google DeepMind logoGemma 2 9B🇺🇸 Google DeepMindOpen36.0-84.921.00.5$0.038K
91OpenAI logoGPT-4.5🇺🇸 OpenAI35.9--78.637.7N/A0K
92OpenAI logoGPT-4o (2024-08-06)🇺🇸 OpenAI35.60.3-53.36.3$2.50128K
93OpenAI logoGPT-4.1 Nano🇺🇸 OpenAI35.21.0-70.028.8$0.101.0M
94Meta logoLLaMA-13B🇺🇸 MetaOpen34.9-20.6--N/A0K
95OpenAI logoo1-mini🇺🇸 OpenAI34.91.7-89.246.9N/A0K
96Anthropic logoClaude 3 Opus🇺🇸 Anthropic33.7--37.54.6N/A0K
97xAI logoGrok-2 (Dec 2024)🇺🇸 xAI33.20.7-63.511.4N/A0K
98Google DeepMind logoGemma 2 27B🇺🇸 Google DeepMindOpen32.9--27.91.3$0.658K
99Meta logoLlama 3 70B Instruct🇺🇸 MetaOpen32.4--22.64.2$0.518K
100
U
MPT-30B Unknown
31.7-34.4--N/A0K
101
U
Yi 6B UnknownOpen
31.4-44.95.2-N/A0K
102Meta logoLlama 3 8B Instruct🇺🇸 MetaOpen30.8--6.10.7$0.038K
103Mistral AI logoMistral Large🇫🇷 Mistral AIOpen30.00.3-24.51.9$2.00128K
104Google DeepMind logoGemma 2B🇺🇸 Google DeepMindOpen29.1-17.7--N/A0K
105Meta logoLlama 3.3 70B Instruct (free)🇺🇸 MetaOpen29.1--41.65.0Free66K
106Anthropic logoClaude 3 Haiku🇺🇸 Anthropic28.7--14.91.7$0.25200K
107Anthropic logoClaude 3 Sonnet🇺🇸 Anthropic28.3--18.22.4N/A0K
108Meta logoLlama 4 Maverick🇺🇸 MetaOpen28.00.7-73.020.5$0.151.0M
109Meta logoLlama 3.1 8B Instruct🇺🇸 MetaOpen27.4-82.422.92.4$0.0216K
110DeepSeek logoDeepSeek Coder 33B🇨🇳 DeepSeekOpen25.4-35.4--N/A0K
111
U
StarCoder 2 15B UnknownOpen
24.3-57.7--N/A0K
112
U
Baichuan1-7B Unknown
23.7-9.2--N/A0K
113Mistral AI logoMixtral 8x22B Instruct🇫🇷 Mistral AIOpen23.5--24.2-$2.0066K
114Google DeepMind logoGemini 1.0 Pro🇺🇸 Google DeepMind21.1--11.21.0N/A0K
115Anthropic logoClaude 2.1🇺🇸 Anthropic21.0---1.9N/A0K
116
U
INTELLECT-1 Unknown
20.2-38.6--N/A0K
117Meta logoLlama 4 Scout🇺🇸 MetaOpen18.90.1-62.37.7$0.08328K
118DeepSeek logoDeepSeek Coder 6.7B🇨🇳 DeepSeekOpen16.7-21.3--N/A0K
119
U
Magistral Small 1.1 Unknown
16.6---29.9N/A0K
120DeepSeek logoDeepSeek Coder 1.3B🇨🇳 DeepSeekOpen3.2-4.4--N/A0K
90+ Gold 80-89 70-79 60-69 <60Scores in % unless noted. Avg = unweighted mean across tested benchmarks.

Models ranked by mathematical ability across MATH-500, GSM8K, FrontierMath, and competition-level problems. Scores reflect arithmetic, algebra, geometry, combinatorics, and proof-based reasoning.

Which AI model is best at math?

Math rankings are updated live. Check the leaderboard above for the current leader on MATH-500, GSM8K, and competition-level math benchmarks.

What is MATH-500?

MATH-500 is a curated subset of 500 competition-level math problems spanning multiple difficulty levels and topics including algebra, number theory, and geometry.

Can AI solve competition math problems?

Top models now score above 95% on MATH-500 and are approaching human-competitive levels on AIME and Olympiad-style problems.