API
Benchmarks/MATH level 5

MATH level 5

MATH Level 5 β€” the hardest tier of the MATH benchmark, featuring competition-level problems from AMC, AIME, and Olympiad-style mathematics.

89
Models Tested
98.1
Top Score
56.9
Average Score
1OpenAIOpenAI98.1
2OpenAIOpenAI98.1
3OpenAIOpenAI97.8
4OpenAIOpenAI97.8
5OpenAIOpenAI97.8
6AnthropicAnthropic97.7
7Alibaba QwenAlibaba Qwen97.1
8DeepSeekDeepSeek96.6
9OpenAIOpenAI96.5
10AnthropicAnthropic96.4
11OpenAIOpenAI95.2
12OpenAIOpenAI94.7
13DeepSeekDeepSeek93.0
14AnthropicAnthropic91.2
15AnthropicAnthropic91.2
16xAIxAI90.9
17xAIxAI90.9
18OpenAIOpenAI89.2
19xAIxAI88.8
20xAIxAI88.8
21OpenAIOpenAI87.3
22AnthropicAnthropic85.0
23AnthropicAnthropic84.4
24GoogleGoogle83.5
25OpenAIOpenAI83.0
26OpenAIOpenAI81.7
27Mistral AIMistral AI81.6
28OpenAIOpenAI78.6
29Google DeepMindGoogle DeepMind74.0
30Google DeepMindGoogle DeepMind74.0
31Google DeepMindGoogle DeepMind74.0
32Google DeepMindGoogle DeepMind74.0
33Google DeepMindGoogle DeepMind74.0
34Google DeepMindGoogle DeepMind74.0
35MetaMeta73.0
36GoogleGoogle70.4
37OpenAIOpenAI70.0
38Alibaba QwenAlibaba Qwen68.9
39AlibabaAlibaba67.2
40MicrosoftMicrosoft64.9
41DeepSeekDeepSeek64.8
42xAIxAI63.5
43Alibaba QwenAlibaba Qwen63.2
44MetaMeta62.3
45GoogleGoogle61.9
46OpenAIOpenAI53.3
47OpenAIOpenAI53.3
48OpenAIOpenAI53.3
49OpenAIOpenAI53.3
50OpenAIOpenAI52.6
51OpenAIOpenAI52.6
52AnthropicAnthropic51.7
53Mistral AIMistral AI50.3
54OpenAIOpenAI49.8
55MetaMeta49.8
56OpenAIOpenAI46.7
57AnthropicAnthropic46.4
58Mistral AIMistral AI44.8
59MetaMeta41.6
60MetaMeta41.6
61GoogleGoogle40.8
62MetaMeta39.4
63AlibabaAlibaba39.1
64AnthropicAnthropic37.5
65MetaMeta36.7
66Google DeepMindGoogle DeepMind27.9
67GoogleGoogle25.1
68Mistral AIMistral AI24.5
69Mistral AIMistral AI24.2
70OpenAIOpenAI23.0
71OpenAIOpenAI23.0
72OpenAIOpenAI23.0
73OpenAIOpenAI23.0
74MetaMeta22.9
75MetaMeta22.6
76Google DeepMindGoogle DeepMind21.0
77AnthropicAnthropic18.2
78MicrosoftMicrosoft17.6
79AnthropicAnthropic14.9
80AnthropicAnthropic11.7
81OpenAIOpenAI11.6
82OpenAIOpenAI11.6
83OpenAIOpenAI11.6
84GoogleGoogle11.2
85Mistral AIMistral AI10.8
86Mistral AIMistral AI9.9
87MetaMeta6.1
88
U
unknown
5.2
89MetaMeta3.3