#	Model	Score	Price
1	GPT-5.2-Codex· OpenAI	74.3	$1.75
2	GPT-5.1-Codex-Max· OpenAI	72.0	$1.25
3	Qwen3.6 Plus· Alibaba Qwen	70.8	$0.33
4	GLM 5.1· z-ai	70.2	$0.95
5	GLM 5· z-ai	68.8	$0.72
6	GPT-5.1-Codex· OpenAI	68.6	$1.25
7	MiniMax M2.7· minimax	63.5	$0.30
8	Gemma 4 31B· Google DeepMind	61.6	$0.13
9	Kimi K2 Thinking· moonshotai	61.6	$0.60
10	GPT-5 Mini· OpenAI	61.0	$0.25
11	GPT-5.1-Codex-Mini· OpenAI	60.4	$0.25
12	MiniMax M2.5· minimax	60.1	$0.12
13	MiMo-V2-Pro· xiaomi	58.1	$1.00
14	GLM 4.7· z-ai	58.1	$0.39
15	GLM 4.6· z-ai	55.2	$0.39
16	Qwen3 235B A22B Thinking 2507· Alibaba Qwen	53.0	$0.15
17	DeepSeek V3.2· DeepSeek	51.8	$0.26
18	Qwen3 Next 80B A3B Thinking· Alibaba Qwen	50.4	$0.10
19	DeepSeek V3.2 Exp· DeepSeek	49.9	$0.27
20	GLM 5V Turbo· z-ai	49.6	$1.20
21	Qwen3 235B A22B Instruct 2507· Alibaba Qwen	48.8	$0.07
22	GPT-5 Nano· OpenAI	48.6	$0.05
23	Qwen3 Next 80B A3B Instruct· Alibaba Qwen	48.4	$0.09
24	gpt-oss-120b· OpenAI	46.1	$0.04
25	Devstral 2 2512· Mistral AI	41.2	$0.40
26	GLM 4.6V· z-ai	40.1	$0.30
27	GPT-5.4 Mini· OpenAI	37.0	$0.75
28	Nemotron 3 Super· NVIDIA	32.5	$0.10
29	GPT-5.4 Nano· OpenAI	32.4	$0.20

Frequently asked

Pulled from the LiveBench · Overall dataset · updated daily

What does LiveBench · Overall measure?

LiveBench · Overall is a knowledge benchmark in the BenchGecko catalog. 29 AI models have been tested on it. Scores range from 32.4 to 74.3 out of 100.

Which model leads on LiveBench · Overall?

GPT-5.2-Codex from OpenAI leads LiveBench · Overall with a score of 74.3. The median score across 29 tested models is 55.2.

Is LiveBench · Overall saturated?

No · the top score is 74.3 out of 100 (74%). There is still meaningful room for improvement on LiveBench · Overall.

Does LiveBench · Overall predict performance on other benchmarks?

Yes · LiveBench · Overall scores correlate 0.92 with LiveBench · Reasoning across 29 shared models. Models that do well on LiveBench · Overall tend to do well on LiveBench · Reasoning.

How often is LiveBench · Overall data refreshed?

BenchGecko pulls updates daily. New model scores on LiveBench · Overall appear as soon as they are published by Epoch AI or the model provider.