Artificial Analysis · Quality Index
The Frontier
Best score over time · one chart, every benchmark
Distribution
Where models cluster
Correlated benchmarks
Pearson r · original research
Benchmarks that track with Artificial Analysis · Quality Index
Pearson correlation across models scored on both benchmarks. Closer to 1 = strongly predictive.
Full rankings
68 models tested · sorted by score
Frequently asked
Pulled from the Artificial Analysis · Quality Index dataset · updated daily
What does Artificial Analysis · Quality Index measure?
Artificial Analysis · Quality Index is a knowledge benchmark in the BenchGecko catalog. 68 AI models have been tested on it. Scores range from 7.7 to 57.2 out of 60.
Which model leads on Artificial Analysis · Quality Index?
Gemini 3.1 Pro Preview from Google DeepMind leads Artificial Analysis · Quality Index with a score of 57.2. The median score across 68 tested models is 32.1.
Is Artificial Analysis · Quality Index saturated?
Yes · the top model on Artificial Analysis · Quality Index has reached 57.2 out of 60, within 5% of the theoretical ceiling. This benchmark is approaching saturation and may be replaced by a harder successor.
Does Artificial Analysis · Quality Index predict performance on other benchmarks?
Yes · Artificial Analysis · Quality Index scores correlate 0.97 with Artificial Analysis · Coding Index across 66 shared models. Models that do well on Artificial Analysis · Quality Index tend to do well on Artificial Analysis · Coding Index.
How often is Artificial Analysis · Quality Index data refreshed?
BenchGecko pulls updates daily. New model scores on Artificial Analysis · Quality Index appear as soon as they are published by Epoch AI or the model provider.
Top on Artificial Analysis · Quality Index
Gemini 3.1 Pro Preview · 57.2GPT-5.4 · 57.2GPT-5.3-Codex · 54.0Claude Opus 4.6 (Fast) · 53.0Muse Spark · 52.1More knowledge benchmarks
Same category · related evaluations