Benchmark · KnowledgeSettled

OpenCompass · HLE

Updated 2026-02-16
Models tested
32
Top score
28.6
DeepSeek V3.2 Speciale
Median
13.9
min 4.2
Top-5 spread
σ 1.2
Settled

Best score over time · one chart, every benchmark

OPENCOMPASS · HLE32 MODELS · FRONTIER RUNNING MAX0255075100SCORE ↑Mar 25Jun 25Aug 25Nov 25Feb 26RELEASE DATE →benchgecko.ai/benchmark/oc-hle · frontier
Frontier on OpenCompass · HLE rose from 4.2 to 28.6 in 9 months · +24.4 points · latest leader DeepSeek V3.2 Speciale from DeepSeek.
Pink dots = frontier records · 8 totalClick to open model page

Same category · related evaluations