Which model leads on GeoBench?

Gemini 3 Flash Preview from Google DeepMind leads GeoBench with a score of 88.0. The median score across 26 tested models is 66.0.

Is GeoBench saturated?

No · the top score is 88.0 out of 100 (88%). There is still meaningful room for improvement on GeoBench.

Does GeoBench predict performance on other benchmarks?

Yes · GeoBench scores correlate 0.96 with Artificial Analysis · Agentic Index across 5 shared models. Models that do well on GeoBench tend to do well on Artificial Analysis · Agentic Index.

How often is GeoBench data refreshed?

BenchGecko pulls updates daily. New model scores on GeoBench appear as soon as they are published by Epoch AI or the model provider.

Benchmark · KnowledgeSettled

GeoBench

Name: GeoBench Benchmark
Creator: BenchGecko
License: https://creativecommons.org/licenses/by/4.0/

GeoBench · tests geographic knowledge and spatial reasoning across countries, landmarks, coordinates, and geopolitical understanding.

Updated 2025-12-17

Models tested

Top score

88.0

Gemini 3 Flash Preview

Median

66.0

min 34.0

Top-5 spread

σ 2.9

Competitive

The Frontier

Best score over time · one chart, every benchmark

Chart type

Frontier on GeoBench rose from 64.0 to 88.0 in 17 months · +24.0 points · latest leader Gemini 3 Flash Preview from Google DeepMind.

Pink dots = frontier records · 5 totalClick to open model page

Full rankings

26 models tested · sorted by score

#	Model	Score	Price
1	Gemini 3 Flash Preview· Google DeepMind	88.0	$0.50
2	Gemini 3 Pro· Google DeepMind	84.0	—
3	Gemini 2.5 Pro· Google DeepMind	81.0	$1.25
4	GPT-5· OpenAI	81.0	$1.25
5	o1· OpenAI	80.0	$15.00
6	Gemini 2.0 Flash· Google DeepMind	77.0	$0.10
7	Gemini 1.5 Flash (May 2024)· Google DeepMind	76.0	—
8	Claude Opus 4.5· Anthropic	75.0	$5.00
9	o3· OpenAI	74.0	$2.00
10	Gemini 2.5 Flash· Google DeepMind	73.0	$0.30
11	GPT-4.1· OpenAI	72.0	$2.00
12	GPT-4o (2024-11-20)· OpenAI	71.0	$2.50
13	Claude 3.7 Sonnet· Anthropic	68.0	$3.00
14	GPT-4o-mini· OpenAI	64.0	$0.15
15	GPT-4o-mini (2024-07-18)· OpenAI	64.0	$0.15
16	o4 Mini· OpenAI	64.0	$1.10
17	Claude 3.5 Sonnet· Anthropic	62.0	—
18	Qwen2.5 72B Instruct· Alibaba Qwen	62.0	$0.36
19	Gemma 3 27B· Google DeepMind	52.0	$0.08
20	Gemma 3 27B (free)· Google DeepMind	52.0	$0.00
21	Llama 3.2 90B· Meta	52.0	—
22	Llama 4 Maverick· Meta	52.0	$0.15
23	Claude Opus 4· Anthropic	49.0	$15.00
24	Grok 4· xAI	45.0	$3.00
25	Claude Sonnet 4· Anthropic	37.0	$3.00
26	Claude 3.5 Haiku· Anthropic	34.0	$0.80