Question 1

What does MASK measure?

Accepted Answer

MASK is a knowledge benchmark in the BenchGecko catalog. 2 AI models have been tested on it. Scores range from 95.3 to 96.3 out of 100.

Question 2

Which model leads on MASK?

Accepted Answer

Claude Opus 4.6 (Fast) from Anthropic leads MASK with a score of 96.3. The median score across 2 tested models is 95.8.

Question 3

Is MASK saturated?

Accepted Answer

Yes · the top model on MASK has reached 96.3 out of 100, within 5% of the theoretical ceiling. This benchmark is approaching saturation and may be replaced by a harder successor.

Question 4

What makes MASK distinctive?

Accepted Answer

MASK is a knowledge benchmark with limited overlap to the rest of the catalog · it measures capabilities that are not well-covered by other benchmarks we track.

Question 5

How often is MASK data refreshed?

Accepted Answer

BenchGecko pulls updates daily. New model scores on MASK appear as soon as they are published by Epoch AI or the model provider.

#	Model	Score	Price	Bar
1	Claude Opus 4.6 (Fast)· Anthropic	96.3	$30.00
2	Claude Sonnet 4· Anthropic	95.3	$3.00

MASK

Distribution

Correlated benchmarks

Full rankings

Frequently asked

Top on MASK

Related topics

Compare models

More knowledge benchmarks