Artificial Analysis · Agentic Index
The Frontier
Best score over time · one chart, every benchmark
Distribution
Where models cluster
Correlated benchmarks
Pearson r · original research
Benchmarks that track with Artificial Analysis · Agentic Index
Pearson correlation across models scored on both benchmarks. Closer to 1 = strongly predictive.
Full rankings
62 models tested · sorted by score
Frequently asked
Pulled from the Artificial Analysis · Agentic Index dataset · updated daily
What does Artificial Analysis · Agentic Index measure?
Artificial Analysis · Agentic Index is a knowledge benchmark in the BenchGecko catalog. 62 AI models have been tested on it. Scores range from 2.7 to 69.4 out of 60.
Which model leads on Artificial Analysis · Agentic Index?
GPT-5.4 from OpenAI leads Artificial Analysis · Agentic Index with a score of 69.4. The median score across 62 tested models is 38.8.
Is Artificial Analysis · Agentic Index saturated?
Yes · the top model on Artificial Analysis · Agentic Index has reached 69.4 out of 60, within 5% of the theoretical ceiling. This benchmark is approaching saturation and may be replaced by a harder successor.
Does Artificial Analysis · Agentic Index predict performance on other benchmarks?
Yes · Artificial Analysis · Agentic Index scores correlate 0.96 with GeoBench across 5 shared models. Models that do well on Artificial Analysis · Agentic Index tend to do well on GeoBench.
How often is Artificial Analysis · Agentic Index data refreshed?
BenchGecko pulls updates daily. New model scores on Artificial Analysis · Agentic Index appear as soon as they are published by Epoch AI or the model provider.
Top on Artificial Analysis · Agentic Index
GPT-5.4 · 69.4Claude Opus 4.6 (Fast) · 67.6GLM 5.1 · 67.0GLM 5 Turbo · 63.1Claude Sonnet 4.6 · 63.0More knowledge benchmarks
Same category · related evaluations