Artificial Analysis · Coding Index
The Frontier
Best score over time · one chart, every benchmark
Distribution
Where models cluster
Correlated benchmarks
Pearson r · original research
Benchmarks that track with Artificial Analysis · Coding Index
Pearson correlation across models scored on both benchmarks. Closer to 1 = strongly predictive.
Full rankings
66 models tested · sorted by score
Frequently asked
Pulled from the Artificial Analysis · Coding Index dataset · updated daily
What does Artificial Analysis · Coding Index measure?
Artificial Analysis · Coding Index is a knowledge benchmark in the BenchGecko catalog. 66 AI models have been tested on it. Scores range from 0.8 to 57.3 out of 60.
Which model leads on Artificial Analysis · Coding Index?
GPT-5.4 from OpenAI leads Artificial Analysis · Coding Index with a score of 57.3. The median score across 66 tested models is 29.4.
Is Artificial Analysis · Coding Index saturated?
Yes · the top model on Artificial Analysis · Coding Index has reached 57.3 out of 60, within 5% of the theoretical ceiling. This benchmark is approaching saturation and may be replaced by a harder successor.
Does Artificial Analysis · Coding Index predict performance on other benchmarks?
Yes · Artificial Analysis · Coding Index scores correlate 0.98 with OpenCompass · HLE across 11 shared models. Models that do well on Artificial Analysis · Coding Index tend to do well on OpenCompass · HLE.
How often is Artificial Analysis · Coding Index data refreshed?
BenchGecko pulls updates daily. New model scores on Artificial Analysis · Coding Index appear as soon as they are published by Epoch AI or the model provider.
Top on Artificial Analysis · Coding Index
GPT-5.4 · 57.3Gemini 3.1 Pro Preview · 55.5GPT-5.3-Codex · 53.1GPT-5.4 Mini · 51.5Claude Sonnet 4.6 · 50.9More knowledge benchmarks
Same category · related evaluations