#	Model	Score	Price
1	Claude Opus 4.6 (Fast)· Anthropic	1546.2	$30.00
2	Claude Opus 4.6· Anthropic	1542.9	$5.00
3	Claude Sonnet 4.6· Anthropic	1521.0	$3.00
4	Claude Opus 4.5· Anthropic	1465.2	$5.00
5	Gemini 3.1 Pro Preview· Google DeepMind	1455.7	$2.00
6	GLM 5· z-ai	1441.0	$0.72
7	GLM 4.7· z-ai	1439.2	$0.39
8	Gemini 3 Pro· Google DeepMind	1437.6	—
9	Gemini 3 Flash Preview· Google DeepMind	1436.4	$0.50
10	MiMo-V2-Pro· xiaomi	1433.4	$1.00
11	MiniMax M2.7· minimax	1427.7	$0.30
12	GPT-5.2· OpenAI	1403.1	$1.75
13	MiniMax M2.5· minimax	1396.3	$0.12
14	Qwen3.5 397B A17B· Alibaba Qwen	1386.1	$0.39
15	Qwen3.5-122B-A10B· Alibaba Qwen	1362.3	$0.26
16	GLM 4.6· z-ai	1353.7	$0.39
17	Qwen3.5-27B· Alibaba Qwen	1344.0	$0.20
18	GPT-5.1· OpenAI	1338.8	$1.25
19	MiMo-V2-Flash· xiaomi	1336.5	$0.09
20	DeepSeek V3.2· DeepSeek	1326.9	$0.26
21	MiniMax M2· minimax	1303.3	$0.26
22	DeepSeek V3.2 Exp· DeepSeek	1285.5	$0.27
23	Qwen3.5-35B-A3B· Alibaba Qwen	1246.5	$0.16
24	Gemini 3.1 Flash Lite Preview· Google DeepMind	1238.1	$0.25
25	Qwen3.5-Flash· Alibaba Qwen	1235.4	$0.07
26	Gemini 2.5 Pro· Google DeepMind	1202.0	$1.25
27	Mercury 2· inception	1182.2	$0.25

Frequently asked

Pulled from the Chatbot Arena Elo · Coding dataset · updated daily

What does Chatbot Arena Elo · Coding measure?

Chatbot Arena Elo · Coding is a knowledge benchmark in the BenchGecko catalog. 27 AI models have been tested on it. Scores range from 1182.2 to 1546.2 out of 1600.

Which model leads on Chatbot Arena Elo · Coding?

Claude Opus 4.6 (Fast) from Anthropic leads Chatbot Arena Elo · Coding with a score of 1546.2. The median score across 27 tested models is 1386.1.

Is Chatbot Arena Elo · Coding saturated?

Yes · the top model on Chatbot Arena Elo · Coding has reached 1546.2 out of 1600, within 5% of the theoretical ceiling. This benchmark is approaching saturation and may be replaced by a harder successor.

Does Chatbot Arena Elo · Coding predict performance on other benchmarks?

Yes · Chatbot Arena Elo · Coding scores correlate 0.95 with SWE-Bench verified across 10 shared models. Models that do well on Chatbot Arena Elo · Coding tend to do well on SWE-Bench verified.

How often is Chatbot Arena Elo · Coding data refreshed?

BenchGecko pulls updates daily. New model scores on Chatbot Arena Elo · Coding appear as soon as they are published by Epoch AI or the model provider.