Benchmark · KnowledgeSettled

LiveBench · Mathematics

Updated 2026-04-07
Models tested
29
Top score
88.8
GPT-5.2-Codex
Median
74.3
min 36.0
Top-5 spread
σ 2.0
Settled

Best score over time · one chart, every benchmark

LIVEBENCH · MATHEMATICS29 MODELS · FRONTIER RUNNING MAX0255075100SCORE ↑Jul 25Sep 25Nov 25Feb 26Apr 26RELEASE DATE →benchgecko.ai/benchmark/livebench-mathematics · frontier
Frontier on LiveBench · Mathematics rose from 68.0 to 88.8 in 6 months · +20.7 points · latest leader GPT-5.2-Codex from OpenAI.
Pink dots = frontier records · 6 totalClick to open model page

Same category · related evaluations