What does LiveBench · If measure?

LiveBench · If is a knowledge benchmark in the BenchGecko catalog. 29 AI models have been tested on it. Scores range from 13.5 to 68.5 out of 100.

Which model leads on LiveBench · If?

GLM 5.1 from z-ai leads LiveBench · If with a score of 68.5. The median score across 29 tested models is 43.2.

Is LiveBench · If saturated?

No · the top score is 68.5 out of 100 (68%). There is still meaningful room for improvement on LiveBench · If.

Does LiveBench · If predict performance on other benchmarks?

Yes · LiveBench · If scores correlate 0.83 with LiveBench · Overall across 29 shared models. Models that do well on LiveBench · If tend to do well on LiveBench · Overall.

How often is LiveBench · If data refreshed?

BenchGecko pulls updates daily. New model scores on LiveBench · If appear as soon as they are published by Epoch AI or the model provider.

Benchmark · KnowledgeSettled

LiveBench · If

Name: LiveBench · If Benchmark
Creator: BenchGecko
License: https://creativecommons.org/licenses/by/4.0/

Updated 2026-04-07

Models tested

Top score

68.5

GLM 5.1

Median

43.2

min 13.5

Top-5 spread

σ 1.4

Settled

The Frontier

Best score over time · one chart, every benchmark

Chart type

Frontier on LiveBench · If rose from 21.7 to 68.5 in 9 months · +46.7 points · latest leader GLM 5.1 from z-ai.

Pink dots = frontier records · 7 totalClick to open model page

Full rankings

29 models tested · sorted by score

#	Model	Score	Price
1	GLM 5.1· z-ai	68.5	$1.05
2	Gemma 4 31B· Google DeepMind	67.6	$0.13
3	GPT-5.1-Codex-Max· OpenAI	67.1	$1.25
4	GPT-5.2-Codex· OpenAI	66.5	$1.75
5	GPT-5 Mini· OpenAI	64.2	$0.25
6	GPT-5.1-Codex· OpenAI	63.4	$1.25
7	Kimi K2 Thinking· moonshotai	62.0	$0.60
8	MiniMax M2.7· minimax	61.1	$0.30
9	GPT-5.1-Codex-Mini· OpenAI	59.0	$0.25
10	Qwen3.6 Plus· Alibaba Qwen	58.3	$0.33
11	MiniMax M2.5· minimax	57.2	$0.15
12	GLM 5· z-ai	55.3	$0.60
13	GPT-5 Nano· OpenAI	52.0	$0.05
14	gpt-oss-120b· OpenAI	50.3	$0.04
15	MiMo-V2-Pro· xiaomi	43.2	$1.00
16	Qwen3 Next 80B A3B Thinking· Alibaba Qwen	41.5	$0.10
17	Qwen3 235B A22B Thinking 2507· Alibaba Qwen	40.6	$0.15
18	GLM 4.7· z-ai	35.7	$0.38
19	Nemotron 3 Super· NVIDIA	28.4	$0.09
20	GLM 5V Turbo· z-ai	27.2	$1.20
21	GLM 4.6· z-ai	26.2	$0.39
22	DeepSeek V3.2· DeepSeek	23.1	$0.25
23	Qwen3 235B A22B Instruct 2507· Alibaba Qwen	21.7	$0.07
24	DeepSeek V3.2 Exp· DeepSeek	19.3	$0.27
25	Qwen3 Next 80B A3B Instruct· Alibaba Qwen	19.2	$0.09
26	GPT-5.4 Mini· OpenAI	18.9	$0.75
27	GLM 4.6V· z-ai	17.1	$0.30
28	GPT-5.4 Nano· OpenAI	16.5	$0.20
29	Devstral 2 2512· Mistral AI	13.5	$0.40