Beta
Benchmark · Knowledge

LiveBench · Data Analysis

Updated 2026-04-07
Models tested
29
Top score
78.2
GPT-5.2-Codex
Median
49.8
min 21.2
Top-5 spread
σ 6.1
wide open

Best score over time · one chart, every benchmark

LIVEBENCH · DATA ANALYSIS29 MODELS · FRONTIER RUNNING MAX0255075100SCORE ↑Jul 25Sep 25Nov 25Feb 26Apr 26RELEASE DATE →benchgecko.ai/benchmark/livebench-data-analysis · frontier
Frontier on LiveBench · Data Analysis rose from 44.7 to 78.2 in 6 months · +33.5 points · latest leader GPT-5.2-Codex from OpenAI.
Pink dots = frontier records · 5 totalClick to open model page

Where models cluster

SCORE DISTRIBUTION0–1010–20120–30330–401140–50950–60460–70170–8080–9090–100MEDIAN · 49.8SCORE BUCKET → (0 TO 100)MODELSbenchgecko.ai

Pearson r · original research

29 models tested · sorted by score

Pulled from the LiveBench · Data Analysis dataset · updated daily

What does LiveBench · Data Analysis measure?

LiveBench · Data Analysis is a knowledge benchmark in the BenchGecko catalog. 29 AI models have been tested on it. Scores range from 21.2 to 78.2 out of 100.

Which model leads on LiveBench · Data Analysis?

GPT-5.2-Codex from OpenAI leads LiveBench · Data Analysis with a score of 78.2. The median score across 29 tested models is 49.8.

Is LiveBench · Data Analysis saturated?

No · the top score is 78.2 out of 100 (78%). There is still meaningful room for improvement on LiveBench · Data Analysis.

Does LiveBench · Data Analysis predict performance on other benchmarks?

Yes · LiveBench · Data Analysis scores correlate 0.84 with LiveBench · Overall across 29 shared models. Models that do well on LiveBench · Data Analysis tend to do well on LiveBench · Overall.

How often is LiveBench · Data Analysis data refreshed?

BenchGecko pulls updates daily. New model scores on LiveBench · Data Analysis appear as soon as they are published by Epoch AI or the model provider.

Same category · related evaluations