Beta
Benchmark · Knowledge

AudioMultiChallenge · Text Output

Updated 2025-10-30
Models tested
3
Top score
46.9
Gemini 2.5 Pro
Median
40.0
min 26.3
Top-5 spread
σ 8.6
wide open

Where models cluster

SCORE DISTRIBUTION0–1010–20120–3030–40240–5050–6060–7070–8080–9090–100MEDIAN · 40.0SCORE BUCKET → (0 TO 100)MODELSbenchgecko.ai

Pearson r · original research

Not enough overlapping models yet.

3 models tested · sorted by score

Pulled from the AudioMultiChallenge · Text Output dataset · updated daily

What does AudioMultiChallenge · Text Output measure?

AudioMultiChallenge · Text Output is a knowledge benchmark in the BenchGecko catalog. 3 AI models have been tested on it. Scores range from 26.3 to 46.9 out of 100.

Which model leads on AudioMultiChallenge · Text Output?

Gemini 2.5 Pro from Google DeepMind leads AudioMultiChallenge · Text Output with a score of 46.9. The median score across 3 tested models is 40.0.

Is AudioMultiChallenge · Text Output saturated?

No · the top score is 46.9 out of 100 (47%). There is still meaningful room for improvement on AudioMultiChallenge · Text Output.

What makes AudioMultiChallenge · Text Output distinctive?

AudioMultiChallenge · Text Output is a knowledge benchmark with limited overlap to the rest of the catalog · it measures capabilities that are not well-covered by other benchmarks we track.

How often is AudioMultiChallenge · Text Output data refreshed?

BenchGecko pulls updates daily. New model scores on AudioMultiChallenge · Text Output appear as soon as they are published by Epoch AI or the model provider.

Same category · related evaluations