Benchmark · KnowledgeSettled

HELM · Omni-MATH

Updated 2026-01-21
Models tested
34
Top score
72.2
GPT-5 Mini
Median
44.1
min 22.4
Top-5 spread
σ 2.6
Competitive

Best score over time · one chart, every benchmark

HELM · OMNI-MATH30 MODELS · FRONTIER RUNNING MAX0255075100SCORE ↑Jul 24Dec 24Apr 25Sep 25Jan 26RELEASE DATE →benchgecko.ai/benchmark/helm-omni-math · frontier
Frontier on HELM · Omni-MATH rose from 28.0 to 72.2 in 13 months · +44.2 points · latest leader GPT-5 Mini from OpenAI.
Pink dots = frontier records · 11 totalClick to open model page

Same category · related evaluations