Benchmark · KnowledgeSettled

LiveBench · Language

Updated 2026-04-07
Models tested
29
Top score
77.5
GLM 5
Median
65.6
min 28.7
Top-5 spread
σ 1.9
Settled

Best score over time · one chart, every benchmark

LIVEBENCH · LANGUAGE29 MODELS · FRONTIER RUNNING MAX0255075100SCORE ↑Jul 25Sep 25Nov 25Feb 26Apr 26RELEASE DATE →benchgecko.ai/benchmark/livebench-language · frontier
Frontier on LiveBench · Language rose from 66.1 to 77.5 in 7 months · +11.5 points · latest leader GLM 5 from z-ai.
Pink dots = frontier records · 4 totalClick to open model page

Same category · related evaluations