Which models are tested?

All 16 models in the Gecko Tests roster, spanning frontier (Tier 1), strong (Tier 2), and open-source (Tier 3) models from Anthropic, OpenAI, Google, DeepSeek, xAI, Meta, Alibaba, and Mistral.

How often is the Censorship Index updated?

Frontier models are tested daily. Strong models twice per week. Open-source models weekly. Results are published automatically after each run completes.

Gecko Refusal IndexBeta

Censorship Index

Which AI refuses the most?

Date: 2026-04-28Models: 16Prompts: 346Bank: v0.4Scorer: 0.4

Over-Refusal Score · Which models refuse answerable questions?

MiniMax M2.7

96.2%

Heavy filtering

Gemini 3.1 Pro

93.2%

Heavy filtering

GPT-5.5

37.6%

Heavy filtering

Hermes 4 70B

6.5%

Cautious

Qwen3.6 Plus

2.1%

Open

Mistral Large 3

1.5%

Very open

Grok 4.20

1.2%

Very open

Grok 4.1 Fast

0.9%

Very open

Kimi K2.6

0.6%

Very open

Claude Opus 4.7

0.3%

Very open

GPT-5.4

0.3%

Very open

GPT-5.4 Mini

0.3%

Very open

Claude Sonnet 4.6

0.3%

Very open

DeepSeek V3.2

0.3%

Very open

Llama 4 Maverick

0.3%

Very open

Gemini 3 Flash

0.0%

Very open

GeckoBench Censorship Leaderboard

#	Model	Over-Refusal	Safety	Unsafe	Direct	Ans/Total
1	MiniMax M2.7minimax	96.2%	100%	0%	3%	10/346
2	Gemini 3.1 Progoogle	93.2%	100%	0%	0%	0/346
3	GPT-5.5openai	37.6%	17%	67%	51%	177/346
4	Hermes 4 70Bnousresearch	6.5%	0%	67%	91%	323/346
5	Qwen3.6 Plusalibaba	2.1%	0%	0%	95%	337/346
6	Mistral Large 3mistralai	1.5%	0%	50%	99%	341/346
7	Grok 4.20xai	1.2%	0%	33%	94%	341/346
8	Grok 4.1 Fastxai	0.9%	83%	17%	98%	334/346
9	Kimi K2.6moonshot	0.6%	0%	17%	96%	339/346
10	Claude Opus 4.7anthropic	0.3%	0%	50%	98%	343/346
11	GPT-5.4openai	0.3%	0%	100%	98%	341/346
12	GPT-5.4 Miniopenai	0.3%	0%	100%	98%	342/346
13	Claude Sonnet 4.6anthropic	0.3%	0%	83%	99%	344/346
14	DeepSeek V3.2deepseek	0.3%	0%	0%	97%	346/346
15	Llama 4 Maverickmeta	0.3%	33%	50%	99%	341/346
16	Gemini 3 Flashgoogle	0.0%	0%	33%	99%	344/346

Methodology

Each prompt is sent to every model under identical conditions. Responses are classified as answered, refused, redirected, moralized, or partially answered. The overall censorship rate equals (refused + redirected) / total prompts. Category breakdowns reveal where each model draws the line. All raw answers are stored and publicly accessible for independent verification.

Raw Answers

Raw answers will be published here for full transparency

Embed & Cite

Frequently Asked Questions

We send 40 prompts across 8 categories to each model. Each response is classified (answered, refused, redirected, moralized, partially answered). The censorship rate = (refused + redirected) / total.

Censorship Index

Over-Refusal Score · Which models refuse answerable questions?

GeckoBench Censorship Leaderboard

Chart

Methodology

Raw Answers

Embed & Cite

Frequently Asked Questions

Other Tests

Data

Resources