Beta

Best AI Models for Knowledge

AI models ranked by knowledge benchmarks. Compare MMLU-Pro, GPQA Diamond, SimpleQA, and other knowledge tests.

125
Models
14
Providers
57
Open Source
$0.72
Median $/1M in
#ModelAvgGPQA diamondMMLUsimpleqa vTriviaQA$/1M inContext
1Anthropic logoClaude Mythos Preview🇺🇸 Anthropic81.894.5---N/A1.0M
2Anthropic logoClaude Instant🇺🇸 Anthropic78.0-64.5-78.9N/A0K
3DeepSeek logoDeepSeek-V2 (MoE-236B, May 2024)🇨🇳 DeepSeekOpen76.5-71.2-80.0N/A0K
4Microsoft logophi-3-small 7.4B🇺🇸 MicrosoftOpen67.4-67.6-58.1N/A0K
5OpenAI logoGPT-5.4 Pro🇺🇸 OpenAI66.792.8-47.8-$30.001.1M
6Microsoft logophi-3-mini 3.8B🇺🇸 MicrosoftOpen61.0-58.4-64.0N/A0K
7Alibaba Qwen logoQwen-14B🇨🇳 Alibaba QwenOpen60.7-55.1--N/A0K
8Google DeepMind logoGemini 3.1 Pro Preview🇺🇸 Google DeepMind60.692.1-77.3-$2.001.0M
9Google DeepMind logoGemini 3 Pro🇺🇸 Google DeepMind60.590.2-72.9-N/A0K
10DeepSeek logoDeepSeek V3🇨🇳 DeepSeekOpen59.042.082.9-82.9$0.32164K
11OpenAI logoGPT-5.4🇺🇸 OpenAI59.091.1-44.8-$2.501.1M
12
U
Muse Spark Unknown
59.086.4-66.3-N/A0K
13Microsoft logophi-3-medium 14B🇺🇸 MicrosoftOpen58.63.570.7-73.9N/A0K
14Alibaba Qwen logoQwen3 Max🇨🇳 Alibaba QwenOpen58.363.5-67.5-$0.78262K
15TII logoFalcon 2 11B TIIOpen58.0-44.5--N/A0K
16DeepSeek logoR1 0528🇨🇳 DeepSeekOpen57.968.4-27.4-$0.50164K
17Mistral AI logoMixtral 8x7B Instruct🇫🇷 Mistral AIOpen57.87.560.8-82.2$0.5433K
18z-ai logoGLM 5🇨🇳 z-aiOpen57.683.8---$0.7280K
19Anthropic logoClaude Opus 4.6🇺🇸 Anthropic57.587.4-46.5-$5.001.0M
20OpenAI logoo1🇺🇸 OpenAI56.469.0---$15.00200K
21Alibaba Qwen logoQwen3 235B A22B🇨🇳 Alibaba QwenOpen56.460.9---$0.46131K
22Google DeepMind logoGemini 2.5 Pro🇺🇸 Google DeepMind56.280.4-56.0-$1.251.0M
23OpenAI logoGPT-5 Mini🇺🇸 OpenAI56.066.7-21.0-$0.25400K
24Alibaba Qwen logoQwen3 235B A22B Thinking 2507🇨🇳 Alibaba QwenOpen55.973.4-50.1-$0.15131K
25OpenAI logoo3🇺🇸 OpenAI55.275.8-53.0-$2.00200K
26OpenAI logoGPT-4 (older v0314)🇺🇸 OpenAI55.014.381.9--$30.008K
27xAI logoGrok 4🇺🇸 xAI54.882.7-47.9-$3.00256K
28OpenAI logoGPT-5🇺🇸 OpenAI54.481.6-50.6-$1.25400K
29OpenAI logoGPT-5.2🇺🇸 OpenAI54.088.5-38.9-$1.75400K
30Google DeepMind logoGemini 2.0 Pro🇺🇸 Google DeepMind53.754.2---N/A0K
31
U
Nemotron-4 15B Unknown
53.4-44.9--N/A0K
32moonshotai logoKimi K2 Thinking🇨🇳 moonshotaiOpen53.379.0-31.6-$0.60262K
33OpenAI logoo4 Mini🇺🇸 OpenAI53.272.8-23.9-$1.10200K
34Alibaba Qwen logoQwen2.5 72B Instruct🇨🇳 Alibaba QwenOpen53.232.280.4-71.9$0.1233K
35Alibaba Qwen logoQwen2.5 Coder 32B Instruct🇨🇳 Alibaba QwenOpen53.1-72.1--$0.6633K
36DeepSeek logoDeepSeek V3.2🇨🇳 DeepSeekOpen53.077.9-27.5-$0.26164K
37moonshotai logoKimi K2.5🇨🇳 moonshotaiOpen52.083.5-33.9-$0.38262K
38OpenAI logoGPT-4o (2024-05-13)🇺🇸 OpenAI51.131.978.9--$5.00128K
39OpenAI logoGPT-4 Turbo🇺🇸 OpenAI51.07.576.5-84.8$10.00128K
40z-ai logoGLM 4.7🇨🇳 z-aiOpen50.577.8-31.5-$0.39203K
41OpenAI logoGPT-5.1🇺🇸 OpenAI49.683.5-48.9-$1.25400K
42Google DeepMind logoGemini 3 Flash Preview🇺🇸 Google DeepMind49.177.6-67.4-$0.501.0M
43Google DeepMind logoGemini 2.0 Flash🇺🇸 Google DeepMind48.052.272.9--$0.101.0M
44
U
Stable Beluga 2 Unknown
47.8-58.1--N/A0K
45Anthropic logoClaude 3.7 Sonnet🇺🇸 Anthropic47.773.0---$3.00200K
46Anthropic logoClaude Sonnet 4.6🇺🇸 Anthropic47.683.2-29.0-$3.001.0M
47Google DeepMind logoGemini 1.5 Flash (May 2024)🇺🇸 Google DeepMind47.420.570.5--N/A0K
48OpenAI logogpt-oss-120b🇺🇸 OpenAIOpen46.967.7-13.9-$0.04131K
49xAI logoGrok 3 Mini🇺🇸 xAI46.668.3-21.1-$0.30131K
50OpenAI logoGPT-3.5 Turbo (older v0613)🇺🇸 OpenAI45.82.956.4-85.8$1.004K
51Mistral AI logoMistral Large 2411🇫🇷 Mistral AIOpen45.835.1---$2.00131K
52Anthropic logoClaude Opus 4.5🇺🇸 Anthropic45.481.4-41.8-$5.00200K
53OpenAI logoGPT-5 Nano🇺🇸 OpenAI45.359.3-12.2-$0.05400K
54DeepSeek logoR1🇨🇳 DeepSeekOpen45.162.3-27.4-$0.7064K
55Anthropic logoClaude Sonnet 4🇺🇸 Anthropic44.672.3---$3.001.0M
56OpenAI logoGPT-4.1 Mini🇺🇸 OpenAI44.554.5---$0.401.0M
57TII logoFalcon-180B TIIOpen44.4-60.8-79.9N/A0K
58Alibaba Qwen logoQwen2.5 Coder 7B Instruct🇨🇳 Alibaba QwenOpen44.4-57.3--$0.0333K
59OpenAI logoGPT-4.1🇺🇸 OpenAI43.355.9---$2.001.0M
60OpenAI logoGPT-4o-mini (2024-07-18)🇺🇸 OpenAI43.217.075.7--$0.15128K
61Microsoft logoPhi 4🇺🇸 MicrosoftOpen43.241.479.7--$0.0716K
62Meta logoLlama 2-13B🇺🇸 MetaOpen42.51.840.8-79.6N/A0K
63Anthropic logoClaude 3.5 Sonnet🇺🇸 Anthropic42.338.782.0--N/A0K
64Google DeepMind logoGemma 3 27B🇺🇸 Google DeepMindOpen42.231.8---$0.08131K
65Google DeepMind logoGemma 3 27B (free)🇺🇸 Google DeepMindOpen42.231.8---Free131K
66Anthropic logoClaude Sonnet 4.5🇺🇸 Anthropic42.176.4-23.6-$3.001.0M
67Anthropic logoClaude Opus 4🇺🇸 Anthropic41.768.3---$15.00200K
68Mistral AI logoMistral 7B V0.1🇫🇷 Mistral AIOpen41.6-50.0-75.2N/A0K
69OpenAI logoo1-preview🇺🇸 OpenAI41.533.8---N/A0K
70Anthropic logoClaude Opus 4.1🇺🇸 Anthropic41.369.7-34.8-$15.00200K
71Google DeepMind logoGemini 1.5 Pro (Feb 2024)🇺🇸 Google DeepMind41.327.876.9--N/A0K
72Alibaba Qwen logoQwen2-72B🇨🇳 Alibaba QwenOpen41.321.076.5--N/A0K
73Alibaba Qwen logoQwen2.5-Max🇨🇳 Alibaba QwenOpen41.041.5---N/A0K
74
U
Baichuan 2-7B Unknown
40.3-38.9--N/A0K
75Mistral AI logoMistral Medium 3🇫🇷 Mistral AIOpen40.046.0---$0.40131K
76OpenAI logoGPT-4o-mini🇺🇸 OpenAI39.617.075.7--$0.15128K
77Mistral AI logoMistral Large 2407🇫🇷 Mistral AIOpen39.132.073.3--$2.00131K
78Alibaba logoQwen2.5 Coder 1.5B Instruct🇨🇳 AlibabaOpen38.8-38.1--N/A0K
79xAI logoGrok 3🇺🇸 xAI38.467.7---$3.00131K
80OpenAI logoo3 Mini🇺🇸 OpenAI38.469.4---$1.10200K
81Meta logoLlama 3.1 405B🇺🇸 MetaOpen38.034.579.3-82.7N/A0K
82Meta logoLlama 3.1 70B Instruct🇺🇸 MetaOpen37.825.673.5--$0.40131K
83Google DeepMind logoGemini 2.0 Flash Thinking (Jan 2025)🇺🇸 Google DeepMind37.742.8---N/A0K
84OpenAI logoGPT-4o (2024-11-20)🇺🇸 OpenAI37.732.379.1--$2.50128K
85Anthropic logoClaude 2🇺🇸 Anthropic37.212.971.3-87.5N/A0K
86Anthropic logoClaude 3.5 Haiku🇺🇸 Anthropic37.217.565.76.7-$0.80200K
87Mistral AI logoMistral Nemo🇫🇷 Mistral AIOpen37.26.5---$0.02131K
88Anthropic logoClaude Haiku 4.5🇺🇸 Anthropic37.161.6-5.9-$1.00200K
89Meta logoLlama 3.2 90B🇺🇸 MetaOpen36.121.473.7--N/A0K
90Google DeepMind logoGemma 2 9B🇺🇸 Google DeepMindOpen36.03.362.8--$0.038K
91OpenAI logoGPT-4.5🇺🇸 OpenAI35.958.3---N/A0K
92OpenAI logoGPT-4o (2024-08-06)🇺🇸 OpenAI35.632.379.1--$2.50128K
93OpenAI logoGPT-4.1 Nano🇺🇸 OpenAI35.231.9---$0.101.0M
94Meta logoLLaMA-13B🇺🇸 MetaOpen34.9-30.3-77.9N/A0K
95OpenAI logoo1-mini🇺🇸 OpenAI34.949.8---N/A0K
96
U
XGen-7B Unknown
33.9-15.1--N/A0K
97Anthropic logoClaude 3 Opus🇺🇸 Anthropic33.729.679.5--N/A0K
98xAI logoGrok-2 (Dec 2024)🇺🇸 xAI33.238.4---N/A0K
99Google DeepMind logoGemma 2 27B🇺🇸 Google DeepMindOpen32.915.367.6--$0.658K
100Meta logoLlama 3 70B Instruct🇺🇸 MetaOpen32.420.872.4--$0.518K
101
U
MPT-30B Unknown
31.7-30.5-73.6N/A0K
102
U
Yi 6B UnknownOpen
31.4-52.0--N/A0K
103Meta logoLlama 3 8B Instruct🇺🇸 MetaOpen30.81.458.4-67.7$0.038K
104Microsoft logoPhi 2🇺🇸 MicrosoftOpen30.2-44.5-45.2N/A0K
105Mistral AI logoMistral Large🇫🇷 Mistral AIOpen30.018.458.4--$2.00128K
106
U
Dolly 2.0-12b Unknown
29.2-1.6--N/A0K
107Google DeepMind logoGemma 2B🇺🇸 Google DeepMindOpen29.1-23.1-53.2N/A0K
108Meta logoLlama 3.3 70B Instruct (free)🇺🇸 MetaOpen29.129.981.7--Free66K
109Anthropic logoClaude 3 Haiku🇺🇸 Anthropic28.715.165.1--$0.25200K
110Anthropic logoClaude 3 Sonnet🇺🇸 Anthropic28.320.867.9--N/A0K
111Meta logoLlama 4 Maverick🇺🇸 MetaOpen28.056.0---$0.151.0M
112Meta logoLlama 3.1 8B Instruct🇺🇸 MetaOpen27.41.341.5--$0.0216K
113DeepSeek logoDeepSeek Coder 33B🇨🇳 DeepSeekOpen25.4-19.2--N/A0K
114
U
StarCoder 2 15B UnknownOpen
24.3-52.1--N/A0K
115
U
Baichuan1-7B Unknown
23.7-23.1--N/A0K
116Mistral AI logoMixtral 8x22B Instruct🇫🇷 Mistral AIOpen23.512.170.4--$2.0066K
117OpenAI logoCerebras-GPT-13B🇺🇸 OpenAI23.4-1.6--N/A0K
118Google DeepMind logoGemini 1.0 Pro🇺🇸 Google DeepMind21.111.960.0--N/A0K
119Anthropic logoClaude 2.1🇺🇸 Anthropic21.010.664.7--N/A0K
120
U
INTELLECT-1 Unknown
20.2-33.2--N/A0K
121Meta logoLlama 4 Scout🇺🇸 MetaOpen18.935.8---$0.08328K
122DeepSeek logoDeepSeek Coder 6.7B🇨🇳 DeepSeekOpen16.7-15.2--N/A0K
123
U
Magistral Small 1.1 Unknown
16.631.2---N/A0K
124Microsoft logoPhi-1.5🇺🇸 MicrosoftOpen16.3-16.8--N/A0K
125DeepSeek logoDeepSeek Coder 1.3B🇨🇳 DeepSeekOpen3.2-1.1--N/A0K
90+ Gold 80-89 70-79 60-69 <60Scores in % unless noted. Avg = unweighted mean across tested benchmarks.

Models ranked by factual knowledge across MMLU-Pro, MMLU, GPQA Diamond, SimpleQA Verified, and other knowledge benchmarks. These tests cover 57+ academic subjects from STEM to humanities.

Which AI model has the most knowledge?

Knowledge rankings depend on the benchmarks used. The leaderboard above ranks models by MMLU-Pro and other knowledge benchmarks covering 57+ academic subjects.

What is MMLU-Pro?

MMLU-Pro is a harder version of the Massive Multitask Language Understanding benchmark, with 10 answer choices instead of 4 and more reasoning-heavy questions across 57 subjects.