Beta
Benchmark · Knowledge

OpenCompass · LiveCodeBenchV6

Updated 2026-02-16
Models tested
32
Top score
86.2
GLM 5
Median
67.3
min 30.8
Top-5 spread
σ 1.7
settled

Best score over time · one chart, every benchmark

OPENCOMPASS · LIVECODEBENCHV632 MODELS · FRONTIER RUNNING MAX0255075100SCORE ↑Mar 25Jun 25Aug 25Nov 25Feb 26RELEASE DATE →benchgecko.ai/benchmark/oc-livecodebenchv6 · frontier
Frontier on OpenCompass · LiveCodeBenchV6 rose from 30.8 to 86.2 in 11 months · +55.4 points · latest leader GLM 5 from z-ai.
Pink dots = frontier records · 9 totalClick to open model page

Where models cluster

SCORE DISTRIBUTION0–1010–2020–30330–40540–50550–60460–70970–80680–9090–100MEDIAN · 67.3SCORE BUCKET → (0 TO 100)MODELSbenchgecko.ai

Pearson r · original research

32 models tested · sorted by score

Pulled from the OpenCompass · LiveCodeBenchV6 dataset · updated daily

What does OpenCompass · LiveCodeBenchV6 measure?

OpenCompass · LiveCodeBenchV6 is a knowledge benchmark in the BenchGecko catalog. 32 AI models have been tested on it. Scores range from 30.8 to 86.2 out of 100.

Which model leads on OpenCompass · LiveCodeBenchV6?

GLM 5 from z-ai leads OpenCompass · LiveCodeBenchV6 with a score of 86.2. The median score across 32 tested models is 67.3.

Is OpenCompass · LiveCodeBenchV6 saturated?

No · the top score is 86.2 out of 100 (86%). There is still meaningful room for improvement on OpenCompass · LiveCodeBenchV6.

Does OpenCompass · LiveCodeBenchV6 predict performance on other benchmarks?

Yes · OpenCompass · LiveCodeBenchV6 scores correlate 0.96 with Fiction.LiveBench across 6 shared models. Models that do well on OpenCompass · LiveCodeBenchV6 tend to do well on Fiction.LiveBench.

How often is OpenCompass · LiveCodeBenchV6 data refreshed?

BenchGecko pulls updates daily. New model scores on OpenCompass · LiveCodeBenchV6 appear as soon as they are published by Epoch AI or the model provider.

Same category · related evaluations