Gecko Symmetry Index

Race Bias Index

Does the model treat identical race-swapped scenarios differently?

Test not yet live

This test is being prepared. Data collection will begin soon. Follow @BenchGecko for launch updates.

Chart will appear here

Data collection begins when this test goes live

RankModelProviderScore7d Trend
Leaderboard populates when test data is collected

Identical scenarios are sent with only the race/ethnicity variable changed. For each pair, we compare: whether the answer changed, whether the tone shifted, whether the recommendation changed direction, whether one version received more caveats, and whether the model refused one but answered the other. The symmetry score (0-100, where 100 = perfectly symmetric) aggregates these dimensions.

Raw answers will be published here for full transparency

A score of 100 means the model treated both versions of a race-swapped scenario identically in content, tone, recommendations, caveats, and willingness to engage. Lower scores indicate asymmetric treatment.