#	Model	Score	Price
1	GLM 5· z-ai	77.5	$0.72
2	GPT-5.1-Codex-Max· OpenAI	75.4	$1.25
3	Qwen3.6 Plus· Alibaba Qwen	75.0	$0.33
4	GPT-5.2-Codex· OpenAI	73.7	$1.75
5	GLM 5.1· z-ai	71.8	$0.95
6	Gemma 4 31B· Google DeepMind	71.3	$0.13
7	Qwen3 235B A22B Thinking 2507· Alibaba Qwen	69.5	$0.15
8	GPT-5.1-Codex· OpenAI	69.5	$1.25
9	GPT-5 Mini· OpenAI	69.2	$0.25
10	MiMo-V2-Pro· xiaomi	69.1	$1.00
11	MiniMax M2.7· minimax	66.8	$0.30
12	Kimi K2 Thinking· moonshotai	66.5	$0.60
13	Qwen3 Next 80B A3B Instruct· Alibaba Qwen	66.3	$0.09
14	Qwen3 235B A22B Instruct 2507· Alibaba Qwen	66.1	$0.07
15	DeepSeek V3.2 Exp· DeepSeek	65.6	$0.27
16	GLM 4.7· z-ai	65.2	$0.39
17	DeepSeek V3.2· DeepSeek	64.2	$0.26
18	GPT-5.1-Codex-Mini· OpenAI	63.0	$0.25
19	GLM 5V Turbo· z-ai	62.3	$1.20
20	GLM 4.6· z-ai	59.0	$0.39
21	Qwen3 Next 80B A3B Thinking· Alibaba Qwen	56.3	$0.10
22	MiniMax M2.5· minimax	55.1	$0.12
23	GLM 4.6V· z-ai	49.7	$0.30
24	gpt-oss-120b· OpenAI	48.6	$0.04
25	GPT-5 Nano· OpenAI	47.7	$0.05
26	Devstral 2 2512· Mistral AI	45.7	$0.40
27	GPT-5.4 Mini· OpenAI	41.8	$0.75
28	Nemotron 3 Super· NVIDIA	30.0	$0.10
29	GPT-5.4 Nano· OpenAI	28.7	$0.20

Frequently asked

Pulled from the LiveBench · Language dataset · updated daily

What does LiveBench · Language measure?

LiveBench · Language is a knowledge benchmark in the BenchGecko catalog. 29 AI models have been tested on it. Scores range from 28.7 to 77.5 out of 100.

Which model leads on LiveBench · Language?

GLM 5 from z-ai leads LiveBench · Language with a score of 77.5. The median score across 29 tested models is 65.6.

Is LiveBench · Language saturated?

No · the top score is 77.5 out of 100 (78%). There is still meaningful room for improvement on LiveBench · Language.

Does LiveBench · Language predict performance on other benchmarks?

Yes · LiveBench · Language scores correlate 0.88 with LiveBench · Mathematics across 29 shared models. Models that do well on LiveBench · Language tend to do well on LiveBench · Mathematics.

How often is LiveBench · Language data refreshed?

BenchGecko pulls updates daily. New model scores on LiveBench · Language appear as soon as they are published by Epoch AI or the model provider.