Question 1

What does AudioMultiChallenge · Text Output measure?

Accepted Answer

AudioMultiChallenge · Text Output is a knowledge benchmark in the BenchGecko catalog. 3 AI models have been tested on it. Scores range from 26.3 to 46.9 out of 100.

Question 2

Which model leads on AudioMultiChallenge · Text Output?

Accepted Answer

Gemini 2.5 Pro from Google DeepMind leads AudioMultiChallenge · Text Output with a score of 46.9. The median score across 3 tested models is 40.0.

Question 3

Is AudioMultiChallenge · Text Output saturated?

Accepted Answer

No · the top score is 46.9 out of 100 (47%). There is still meaningful room for improvement on AudioMultiChallenge · Text Output.

Question 4

What makes AudioMultiChallenge · Text Output distinctive?

Accepted Answer

AudioMultiChallenge · Text Output is a knowledge benchmark with limited overlap to the rest of the catalog · it measures capabilities that are not well-covered by other benchmarks we track.

Question 5

How often is AudioMultiChallenge · Text Output data refreshed?

Accepted Answer

BenchGecko pulls updates daily. New model scores on AudioMultiChallenge · Text Output appear as soon as they are published by Epoch AI or the model provider.

#	Model	Score	Price
1	Gemini 2.5 Pro· Google DeepMind	46.9	$1.25
2	Gemini 2.5 Flash· Google DeepMind	40.0	$0.30
3	Voxtral Small 24B 2507· Mistral AI	26.3	$0.10

AudioMultiChallenge · Text Output

Distribution

Correlated benchmarks

Full rankings

Frequently asked

Top on AudioMultiChallenge · Text Output

Related topics

Compare models

More knowledge benchmarks