#	Model	Score	Price
1	GPT-5.2-Codex· OpenAI	88.8	$1.75
2	GLM 5.1· z-ai	84.9	$0.95
3	Qwen3.6 Plus· Alibaba Qwen	83.7	$0.33
4	GPT-5.1-Codex-Max· OpenAI	83.7	$1.25
5	GLM 5· z-ai	83.5	$0.72
6	GLM 4.6· z-ai	81.1	$0.39
7	Kimi K2 Thinking· moonshotai	81.1	$0.60
8	MiniMax M2.7· minimax	80.5	$0.30
9	GPT-5.1-Codex· OpenAI	79.6	$1.25
10	MiniMax M2.5· minimax	77.4	$0.12
11	MiMo-V2-Pro· xiaomi	77.0	$1.00
12	GPT-5.1-Codex-Mini· OpenAI	76.3	$0.25
13	GLM 4.7· z-ai	76.0	$0.39
14	GPT-5 Mini· OpenAI	74.4	$0.25
15	Qwen3 Next 80B A3B Thinking· Alibaba Qwen	74.3	$0.10
16	Gemma 4 31B· Google DeepMind	73.9	$0.13
17	Qwen3 235B A22B Thinking 2507· Alibaba Qwen	73.4	$0.15
18	GLM 5V Turbo· z-ai	70.4	$1.20
19	Qwen3 Next 80B A3B Instruct· Alibaba Qwen	70.2	$0.09
20	gpt-oss-120b· OpenAI	68.9	$0.04
21	Qwen3 235B A22B Instruct 2507· Alibaba Qwen	68.0	$0.07
22	GPT-5 Nano· OpenAI	64.7	$0.05
23	DeepSeek V3.2 Exp· DeepSeek	64.4	$0.27
24	DeepSeek V3.2· DeepSeek	64.0	$0.26
25	GLM 4.6V· z-ai	62.5	$0.30
26	Devstral 2 2512· Mistral AI	52.5	$0.40
27	GPT-5.4 Mini· OpenAI	37.0	$0.75
28	Nemotron 3 Super· NVIDIA	36.4	$0.10
29	GPT-5.4 Nano· OpenAI	36.0	$0.20

Frequently asked

Pulled from the LiveBench · Mathematics dataset · updated daily

What does LiveBench · Mathematics measure?

LiveBench · Mathematics is a knowledge benchmark in the BenchGecko catalog. 29 AI models have been tested on it. Scores range from 36.0 to 88.8 out of 100.

Which model leads on LiveBench · Mathematics?

GPT-5.2-Codex from OpenAI leads LiveBench · Mathematics with a score of 88.8. The median score across 29 tested models is 74.3.

Is LiveBench · Mathematics saturated?

No · the top score is 88.8 out of 100 (89%). There is still meaningful room for improvement on LiveBench · Mathematics.

Does LiveBench · Mathematics predict performance on other benchmarks?

Yes · LiveBench · Mathematics scores correlate 0.91 with LiveBench · Overall across 29 shared models. Models that do well on LiveBench · Mathematics tend to do well on LiveBench · Overall.

How often is LiveBench · Mathematics data refreshed?

BenchGecko pulls updates daily. New model scores on LiveBench · Mathematics appear as soon as they are published by Epoch AI or the model provider.