Benchmark · KnowledgeSettled

OpenCompass · IFEval

Updated 2026-02-16
Models tested
32
Top score
93.9
Kimi K2.5
Median
89.2
min 60.3
Top-5 spread
σ 0.8
Settled

Best score over time · one chart, every benchmark

OPENCOMPASS · IFEVAL32 MODELS · FRONTIER RUNNING MAX0255075100SCORE ↑Mar 25Jun 25Aug 25Nov 25Feb 26RELEASE DATE →benchgecko.ai/benchmark/oc-ifeval · frontier
Frontier on OpenCompass · IFEval rose from 81.0 to 93.9 in 11 months · +12.9 points · latest leader Kimi K2.5 from moonshotai.
Pink dots = frontier records · 7 totalClick to open model page

Same category · related evaluations