Benchmark · KnowledgeSettled

LiveBench · If

Updated 2026-04-07
Models tested
29
Top score
68.5
GLM 5.1
Median
43.2
min 13.5
Top-5 spread
σ 1.4
Settled

Best score over time · one chart, every benchmark

LIVEBENCH · IF29 MODELS · FRONTIER RUNNING MAX0255075100SCORE ↑Jul 25Sep 25Nov 25Feb 26Apr 26RELEASE DATE →benchgecko.ai/benchmark/livebench-if · frontier
Frontier on LiveBench · If rose from 21.7 to 68.5 in 9 months · +46.7 points · latest leader GLM 5.1 from z-ai.
Pink dots = frontier records · 7 totalClick to open model page

Same category · related evaluations