#	Model	Score	Price
1	GLM 5.1· z-ai	68.5	$0.95
2	Gemma 4 31B· Google DeepMind	67.6	$0.13
3	GPT-5.1-Codex-Max· OpenAI	67.1	$1.25
4	GPT-5.2-Codex· OpenAI	66.5	$1.75
5	GPT-5 Mini· OpenAI	64.2	$0.25
6	GPT-5.1-Codex· OpenAI	63.4	$1.25
7	Kimi K2 Thinking· moonshotai	62.0	$0.60
8	MiniMax M2.7· minimax	61.1	$0.30
9	GPT-5.1-Codex-Mini· OpenAI	59.0	$0.25
10	Qwen3.6 Plus· Alibaba Qwen	58.3	$0.33
11	MiniMax M2.5· minimax	57.2	$0.12
12	GLM 5· z-ai	55.3	$0.72
13	GPT-5 Nano· OpenAI	52.0	$0.05
14	gpt-oss-120b· OpenAI	50.3	$0.04
15	MiMo-V2-Pro· xiaomi	43.2	$1.00
16	Qwen3 Next 80B A3B Thinking· Alibaba Qwen	41.5	$0.10
17	Qwen3 235B A22B Thinking 2507· Alibaba Qwen	40.6	$0.15
18	GLM 4.7· z-ai	35.7	$0.39
19	Nemotron 3 Super· NVIDIA	28.4	$0.10
20	GLM 5V Turbo· z-ai	27.2	$1.20
21	GLM 4.6· z-ai	26.2	$0.39
22	DeepSeek V3.2· DeepSeek	23.1	$0.26
23	Qwen3 235B A22B Instruct 2507· Alibaba Qwen	21.7	$0.07
24	DeepSeek V3.2 Exp· DeepSeek	19.3	$0.27
25	Qwen3 Next 80B A3B Instruct· Alibaba Qwen	19.2	$0.09
26	GPT-5.4 Mini· OpenAI	18.9	$0.75
27	GLM 4.6V· z-ai	17.1	$0.30
28	GPT-5.4 Nano· OpenAI	16.5	$0.20
29	Devstral 2 2512· Mistral AI	13.5	$0.40

Frequently asked

Pulled from the LiveBench · If dataset · updated daily

What does LiveBench · If measure?

LiveBench · If is a knowledge benchmark in the BenchGecko catalog. 29 AI models have been tested on it. Scores range from 13.5 to 68.5 out of 100.

Which model leads on LiveBench · If?

GLM 5.1 from z-ai leads LiveBench · If with a score of 68.5. The median score across 29 tested models is 43.2.

Is LiveBench · If saturated?

No · the top score is 68.5 out of 100 (68%). There is still meaningful room for improvement on LiveBench · If.

Does LiveBench · If predict performance on other benchmarks?

Yes · LiveBench · If scores correlate 0.83 with LiveBench · Overall across 29 shared models. Models that do well on LiveBench · If tend to do well on LiveBench · Overall.

How often is LiveBench · If data refreshed?

BenchGecko pulls updates daily. New model scores on LiveBench · If appear as soon as they are published by Epoch AI or the model provider.