Beta
Benchmark · Knowledge

Chatbot Arena Elo · Coding

Updated 2026-04-07
Models tested
27
Top score
1546.2
Claude Opus 4.6 (Fast)
Median
1386.1
min 1182.2
Top-5 spread
σ 38.4
wide open

Best score over time · one chart, every benchmark

CHATBOT ARENA ELO · CODING26 MODELS · FRONTIER RUNNING MAX040080012001600SCORE ↑Jun 25Aug 25Nov 25Jan 26Apr 26RELEASE DATE →benchgecko.ai/benchmark/arena-elo-coding · frontier
Frontier on Chatbot Arena Elo · Coding rose from 1202.0 to 1546.2 in 10 months · +344.2 points · latest leader Claude Opus 4.6 (Fast) from Anthropic.
Pink dots = frontier records · 6 totalClick to open model page

Where models cluster

SCORE DISTRIBUTION0–160160–320320–480480–640640–800800–960960–112051120–1280161280–144061440–1600MEDIAN · 1386.1SCORE BUCKET → (0 TO 1600)MODELSbenchgecko.ai

Pearson r · original research

27 models tested · sorted by score

Pulled from the Chatbot Arena Elo · Coding dataset · updated daily

What does Chatbot Arena Elo · Coding measure?

Chatbot Arena Elo · Coding is a knowledge benchmark in the BenchGecko catalog. 27 AI models have been tested on it. Scores range from 1182.2 to 1546.2 out of 1600.

Which model leads on Chatbot Arena Elo · Coding?

Claude Opus 4.6 (Fast) from Anthropic leads Chatbot Arena Elo · Coding with a score of 1546.2. The median score across 27 tested models is 1386.1.

Is Chatbot Arena Elo · Coding saturated?

Yes · the top model on Chatbot Arena Elo · Coding has reached 1546.2 out of 1600, within 5% of the theoretical ceiling. This benchmark is approaching saturation and may be replaced by a harder successor.

Does Chatbot Arena Elo · Coding predict performance on other benchmarks?

Yes · Chatbot Arena Elo · Coding scores correlate 0.95 with SWE-Bench verified across 10 shared models. Models that do well on Chatbot Arena Elo · Coding tend to do well on SWE-Bench verified.

How often is Chatbot Arena Elo · Coding data refreshed?

BenchGecko pulls updates daily. New model scores on Chatbot Arena Elo · Coding appear as soon as they are published by Epoch AI or the model provider.

Same category · related evaluations