#	Model	Score	Price
1	GPT-5.2-Codex· OpenAI	83.6	$1.75
2	GPT-5.1-Codex-Max· OpenAI	81.4	$1.25
3	Qwen3.6 Plus· Alibaba Qwen	78.2	$0.33
4	GPT-5 Mini· OpenAI	76.1	$0.25
5	DeepSeek V3.2· DeepSeek	75.7	$0.26
6	GLM 5.1· z-ai	75.4	$0.95
7	GPT-5.4 Mini· OpenAI	74.7	$0.75
8	GLM 5V Turbo· z-ai	73.9	$1.20
9	GLM 5· z-ai	73.6	$0.72
10	DeepSeek V3.2 Exp· DeepSeek	73.2	$0.27
11	GLM 4.7· z-ai	73.1	$0.39
12	GPT-5.1-Codex· OpenAI	71.8	$1.25
13	GLM 4.6· z-ai	71.0	$0.39
14	MiniMax M2.5· minimax	70.7	$0.12
15	GPT-5.1-Codex-Mini· OpenAI	69.9	$0.25
16	Qwen3 235B A22B Instruct 2507· Alibaba Qwen	69.6	$0.07
17	Qwen3 235B A22B Thinking 2507· Alibaba Qwen	69.0	$0.15
18	MiMo-V2-Pro· xiaomi	68.8	$1.00
19	Qwen3 Next 80B A3B Instruct· Alibaba Qwen	68.2	$0.09
20	Kimi K2 Thinking· moonshotai	67.4	$0.60
21	GPT-5 Nano· OpenAI	67.4	$0.05
22	Devstral 2 2512· Mistral AI	66.8	$0.40
23	GLM 4.6V· z-ai	64.2	$0.30
24	GPT-5.4 Nano· OpenAI	61.9	$0.20
25	Qwen3 Next 80B A3B Thinking· Alibaba Qwen	60.7	$0.10
26	Gemma 4 31B· Google DeepMind	60.3	$0.13
27	gpt-oss-120b· OpenAI	60.2	$0.04
28	MiniMax M2.7· minimax	54.9	$0.30
29	Nemotron 3 Super· NVIDIA	54.1	$0.10

Frequently asked

Pulled from the LiveBench · Coding dataset · updated daily

What does LiveBench · Coding measure?

LiveBench · Coding is a knowledge benchmark in the BenchGecko catalog. 29 AI models have been tested on it. Scores range from 54.1 to 83.6 out of 100.

Which model leads on LiveBench · Coding?

GPT-5.2-Codex from OpenAI leads LiveBench · Coding with a score of 83.6. The median score across 29 tested models is 69.9.

Is LiveBench · Coding saturated?

No · the top score is 83.6 out of 100 (84%). There is still meaningful room for improvement on LiveBench · Coding.

Does LiveBench · Coding predict performance on other benchmarks?

Yes · LiveBench · Coding scores correlate 0.74 with Chatbot Arena Elo · Overall across 15 shared models. Models that do well on LiveBench · Coding tend to do well on Chatbot Arena Elo · Overall.

How often is LiveBench · Coding data refreshed?

BenchGecko pulls updates daily. New model scores on LiveBench · Coding appear as soon as they are published by Epoch AI or the model provider.