Best AI Models for Knowledge

AI models ranked by knowledge benchmarks. Compare MMLU-Pro, GPQA Diamond, SimpleQA, and other knowledge tests.

127
Models
14
Providers
57
Open Source
$0.78
Median $/1M in
#ModelAvgGPQA diamondMMLUsimpleqa vTriviaQA$/1M inContext
1OpenAI logoGPT-5.5 Pro🇺🇸 OpenAI87.894.2---$30.00400K
2OpenAI logoGPT-5.5🇺🇸 OpenAI85.093.6---$5.00400K
3Anthropic logoClaude Mythos Preview🇺🇸 Anthropic81.894.5---N/A1.0M
4Anthropic logoClaude Instant🇺🇸 Anthropic78.0-64.5-78.9N/A0K
5DeepSeek logoDeepSeek-V2 (MoE-236B, May 2024)🇨🇳 DeepSeekOpen76.5-71.2-80.0N/A0K
6Microsoft logophi-3-small 7.4B🇺🇸 MicrosoftOpen67.4-67.6-58.1N/A0K
7OpenAI logoGPT-5.4 Pro🇺🇸 OpenAI66.792.8-47.8-$30.001.1M
8Microsoft logophi-3-mini 3.8B🇺🇸 MicrosoftOpen61.0-58.4-64.0N/A0K
9Alibaba Qwen logoQwen-14B🇨🇳 Alibaba QwenOpen60.7-55.1--N/A0K
10Google DeepMind logoGemini 3.1 Pro Preview🇺🇸 Google DeepMind60.692.1-77.3-$2.001.0M
11Google DeepMind logoGemini 3 Pro🇺🇸 Google DeepMind60.590.2-72.9-N/A0K
12DeepSeek logoDeepSeek V3🇨🇳 DeepSeekOpen59.042.082.9-82.9$0.32164K
13OpenAI logoGPT-5.4🇺🇸 OpenAI59.091.1-44.8-$2.501.1M
14
U
Muse Spark Unknown
59.086.4-66.3-N/A0K
15Microsoft logophi-3-medium 14B🇺🇸 MicrosoftOpen58.63.570.7-73.9N/A0K
16Alibaba Qwen logoQwen3 Max🇨🇳 Alibaba QwenOpen58.363.5-67.5-$0.78262K
17TII logoFalcon 2 11B TIIOpen58.0-44.5--N/A0K
18DeepSeek logoR1 0528🇨🇳 DeepSeekOpen57.968.4-27.4-$0.50164K
19Mistral AI logoMixtral 8x7B Instruct🇫🇷 Mistral AIOpen57.87.560.8-82.2$0.5433K
20z-ai logoGLM 5🇨🇳 z-aiOpen57.683.8---$0.60203K
21Anthropic logoClaude Opus 4.6🇺🇸 Anthropic57.587.4-46.5-$5.001.0M
22OpenAI logoo1🇺🇸 OpenAI56.469.0---$15.00200K
23Alibaba Qwen logoQwen3 235B A22B🇨🇳 Alibaba QwenOpen56.460.9---$0.46131K
24Google DeepMind logoGemini 2.5 Pro🇺🇸 Google DeepMind56.280.4-56.0-$1.251.0M
25OpenAI logoGPT-5 Mini🇺🇸 OpenAI56.066.7-21.0-$0.25400K
26Alibaba Qwen logoQwen3 235B A22B Thinking 2507🇨🇳 Alibaba QwenOpen55.973.4-50.1-$0.15131K
27OpenAI logoo3🇺🇸 OpenAI55.275.8-53.0-$2.00200K
28OpenAI logoGPT-4 (older v0314)🇺🇸 OpenAI55.014.381.9--$30.008K
29xAI logoGrok 4🇺🇸 xAI54.882.7-47.9-$3.00256K
30OpenAI logoGPT-5🇺🇸 OpenAI54.481.6-50.6-$1.25400K
31OpenAI logoGPT-5.2🇺🇸 OpenAI54.088.5-38.9-$1.75400K
32Google DeepMind logoGemini 2.0 Pro🇺🇸 Google DeepMind53.754.2---N/A0K
33
U
Nemotron-4 15B Unknown
53.4-44.9--N/A0K
34moonshotai logoKimi K2 Thinking🇨🇳 moonshotaiOpen53.379.0-31.6-$0.60262K
35OpenAI logoo4 Mini🇺🇸 OpenAI53.272.8-23.9-$1.10200K
36Alibaba Qwen logoQwen2.5 72B Instruct🇨🇳 Alibaba QwenOpen53.232.280.4-71.9$0.3633K
37Alibaba Qwen logoQwen2.5 Coder 32B Instruct🇨🇳 Alibaba QwenOpen53.1-72.1--$0.6633K
38DeepSeek logoDeepSeek V3.2🇨🇳 DeepSeekOpen53.077.9-27.5-$0.25131K
39moonshotai logoKimi K2.5🇨🇳 moonshotaiOpen52.083.5-33.9-$0.44262K
40OpenAI logoGPT-4o (2024-05-13)🇺🇸 OpenAI51.131.978.9--$5.00128K
41OpenAI logoGPT-4 Turbo🇺🇸 OpenAI51.07.576.5-84.8$10.00128K
42z-ai logoGLM 4.7🇨🇳 z-aiOpen50.577.8-31.5-$0.38203K
43OpenAI logoGPT-5.1🇺🇸 OpenAI49.683.5-48.9-$1.25400K
44Google DeepMind logoGemini 3 Flash Preview🇺🇸 Google DeepMind49.177.6-67.4-$0.501.0M
45Google DeepMind logoGemini 2.0 Flash🇺🇸 Google DeepMind48.052.272.9--$0.101.0M
46
U
Stable Beluga 2 Unknown
47.8-58.1--N/A0K
47Anthropic logoClaude 3.7 Sonnet🇺🇸 Anthropic47.773.0---$3.00200K
48Anthropic logoClaude Sonnet 4.6🇺🇸 Anthropic47.683.2-29.0-$3.001.0M
49Google DeepMind logoGemini 1.5 Flash (May 2024)🇺🇸 Google DeepMind47.420.570.5--N/A0K
50OpenAI logogpt-oss-120b🇺🇸 OpenAIOpen46.967.7-13.9-$0.04131K
51xAI logoGrok 3 Mini🇺🇸 xAI46.668.3-21.1-$0.30131K
52OpenAI logoGPT-3.5 Turbo (older v0613)🇺🇸 OpenAI45.82.956.4-85.8$1.004K
53Mistral AI logoMistral Large 2411🇫🇷 Mistral AIOpen45.835.1---$2.00131K
54Anthropic logoClaude Opus 4.5🇺🇸 Anthropic45.481.4-41.8-$5.00200K
55OpenAI logoGPT-5 Nano🇺🇸 OpenAI45.359.3-12.2-$0.05400K
56DeepSeek logoR1🇨🇳 DeepSeekOpen45.162.3-27.4-$0.7064K
57Anthropic logoClaude Sonnet 4🇺🇸 Anthropic44.672.3---$3.001.0M
58OpenAI logoGPT-4.1 Mini🇺🇸 OpenAI44.554.5---$0.401.0M
59TII logoFalcon-180B TIIOpen44.4-60.8-79.9N/A0K
60Alibaba Qwen logoQwen2.5 Coder 7B Instruct🇨🇳 Alibaba QwenOpen44.4-57.3--$0.0333K
61OpenAI logoGPT-4.1🇺🇸 OpenAI43.355.9---$2.001.0M
62OpenAI logoGPT-4o-mini (2024-07-18)🇺🇸 OpenAI43.217.075.7--$0.15128K
63Microsoft logoPhi 4🇺🇸 MicrosoftOpen43.241.479.7--$0.0716K
64Meta logoLlama 2-13B🇺🇸 MetaOpen42.51.840.8-79.6N/A0K
65Anthropic logoClaude 3.5 Sonnet🇺🇸 Anthropic42.338.782.0--N/A0K
66Google DeepMind logoGemma 3 27B🇺🇸 Google DeepMindOpen42.231.8---$0.08131K
67Google DeepMind logoGemma 3 27B (free)🇺🇸 Google DeepMindOpen42.231.8---Free131K
68Anthropic logoClaude Sonnet 4.5🇺🇸 Anthropic42.176.4-23.6-$3.001.0M
69Anthropic logoClaude Opus 4🇺🇸 Anthropic41.768.3---$15.00200K
70Mistral AI logoMistral 7B V0.1🇫🇷 Mistral AIOpen41.6-50.0-75.2N/A0K
71OpenAI logoo1-preview🇺🇸 OpenAI41.533.8---N/A0K
72Anthropic logoClaude Opus 4.1🇺🇸 Anthropic41.369.7-34.8-$15.00200K
73Google DeepMind logoGemini 1.5 Pro (Feb 2024)🇺🇸 Google DeepMind41.327.876.9--N/A0K
74Alibaba Qwen logoQwen2-72B🇨🇳 Alibaba QwenOpen41.321.076.5--N/A0K
75Alibaba Qwen logoQwen2.5-Max🇨🇳 Alibaba QwenOpen41.041.5---N/A0K
76
U
Baichuan 2-7B Unknown
40.3-38.9--N/A0K
77Mistral AI logoMistral Medium 3🇫🇷 Mistral AIOpen40.046.0---$0.40131K
78OpenAI logoGPT-4o-mini🇺🇸 OpenAI39.617.075.7--$0.15128K
79Mistral AI logoMistral Large 2407🇫🇷 Mistral AIOpen39.132.073.3--$2.00131K
80Alibaba logoQwen2.5 Coder 1.5B Instruct🇨🇳 AlibabaOpen38.8-38.1--N/A0K
81xAI logoGrok 3🇺🇸 xAI38.467.7---$3.00131K
82OpenAI logoo3 Mini🇺🇸 OpenAI38.469.4---$1.10200K
83Meta logoLlama 3.1 405B🇺🇸 MetaOpen38.034.579.3-82.7N/A0K
84Meta logoLlama 3.1 70B Instruct🇺🇸 MetaOpen37.825.673.5--$0.40131K
85Google DeepMind logoGemini 2.0 Flash Thinking (Jan 2025)🇺🇸 Google DeepMind37.742.8---N/A0K
86OpenAI logoGPT-4o (2024-11-20)🇺🇸 OpenAI37.732.379.1--$2.50128K
87Anthropic logoClaude 2🇺🇸 Anthropic37.212.971.3-87.5N/A0K
88Anthropic logoClaude 3.5 Haiku🇺🇸 Anthropic37.217.565.76.7-$0.80200K
89Mistral AI logoMistral Nemo🇫🇷 Mistral AIOpen37.26.5---$0.02131K
90Anthropic logoClaude Haiku 4.5🇺🇸 Anthropic37.161.6-5.9-$1.00200K
91Meta logoLlama 3.2 90B🇺🇸 MetaOpen36.121.473.7--N/A0K
92Google DeepMind logoGemma 2 9B🇺🇸 Google DeepMindOpen36.03.362.8--$0.038K
93OpenAI logoGPT-4.5🇺🇸 OpenAI35.958.3---N/A0K
94OpenAI logoGPT-4o (2024-08-06)🇺🇸 OpenAI35.632.379.1--$2.50128K
95OpenAI logoGPT-4.1 Nano🇺🇸 OpenAI35.231.9---$0.101.0M
96Meta logoLLaMA-13B🇺🇸 MetaOpen34.9-30.3-77.9N/A0K
97OpenAI logoo1-mini🇺🇸 OpenAI34.949.8---N/A0K
98
U
XGen-7B Unknown
33.9-15.1--N/A0K
99Anthropic logoClaude 3 Opus🇺🇸 Anthropic33.729.679.5--N/A0K
100xAI logoGrok-2 (Dec 2024)🇺🇸 xAI33.238.4---N/A0K
101Google DeepMind logoGemma 2 27B🇺🇸 Google DeepMindOpen32.915.367.6--$0.658K
102Meta logoLlama 3 70B Instruct🇺🇸 MetaOpen32.420.872.4--$0.518K
103
U
MPT-30B Unknown
31.7-30.5-73.6N/A0K
104
U
Yi 6B UnknownOpen
31.4-52.0--N/A0K
105Meta logoLlama 3 8B Instruct🇺🇸 MetaOpen30.81.458.4-67.7$0.038K
106Microsoft logoPhi 2🇺🇸 MicrosoftOpen30.2-44.5-45.2N/A0K
107Mistral AI logoMistral Large🇫🇷 Mistral AIOpen30.018.458.4--$2.00128K
108
U
Dolly 2.0-12b Unknown
29.2-1.6--N/A0K
109Google DeepMind logoGemma 2B🇺🇸 Google DeepMindOpen29.1-23.1-53.2N/A0K
110Meta logoLlama 3.3 70B Instruct (free)🇺🇸 MetaOpen29.129.981.7--Free66K
111Anthropic logoClaude 3 Haiku🇺🇸 Anthropic28.715.165.1--$0.25200K
112Anthropic logoClaude 3 Sonnet🇺🇸 Anthropic28.320.867.9--N/A0K
113Meta logoLlama 4 Maverick🇺🇸 MetaOpen28.056.0---$0.151.0M
114Meta logoLlama 3.1 8B Instruct🇺🇸 MetaOpen27.41.341.5--$0.0216K
115DeepSeek logoDeepSeek Coder 33B🇨🇳 DeepSeekOpen25.4-19.2--N/A0K
116
U
StarCoder 2 15B UnknownOpen
24.3-52.1--N/A0K
117
U
Baichuan1-7B Unknown
23.7-23.1--N/A0K
118Mistral AI logoMixtral 8x22B Instruct🇫🇷 Mistral AIOpen23.512.170.4--$2.0066K
119OpenAI logoCerebras-GPT-13B🇺🇸 OpenAI23.4-1.6--N/A0K
120Google DeepMind logoGemini 1.0 Pro🇺🇸 Google DeepMind21.111.960.0--N/A0K
121Anthropic logoClaude 2.1🇺🇸 Anthropic21.010.664.7--N/A0K
122
U
INTELLECT-1 Unknown
20.2-33.2--N/A0K
123Meta logoLlama 4 Scout🇺🇸 MetaOpen18.935.8---$0.08328K
124DeepSeek logoDeepSeek Coder 6.7B🇨🇳 DeepSeekOpen16.7-15.2--N/A0K
125
U
Magistral Small 1.1 Unknown
16.631.2---N/A0K
126Microsoft logoPhi-1.5🇺🇸 MicrosoftOpen16.3-16.8--N/A0K
127DeepSeek logoDeepSeek Coder 1.3B🇨🇳 DeepSeekOpen3.2-1.1--N/A0K
90+ Gold 80-89 70-79 60-69 <60Scores in % unless noted. Avg = unweighted mean across tested benchmarks.

Models ranked by factual knowledge across MMLU-Pro, MMLU, GPQA Diamond, SimpleQA Verified, and other knowledge benchmarks. These tests cover 57+ academic subjects from STEM to humanities.

Which AI model has the most knowledge?

Knowledge rankings depend on the benchmarks used. The leaderboard above ranks models by MMLU-Pro and other knowledge benchmarks covering 57+ academic subjects.

What is MMLU-Pro?

MMLU-Pro is a harder version of the Massive Multitask Language Understanding benchmark, with 10 answer choices instead of 4 and more reasoning-heavy questions across 57 subjects.