How often are Gecko Tests updated?

Frontier models are tested daily. Strong models twice per week. Open-source models weekly. Results are published automatically with raw answers available for verification.

Can I embed BenchGecko Labs charts?

Yes. Every chart on BenchGecko Labs comes with a free embed code. Copy the iframe snippet and paste it into your article or dashboard. Attribution link required.

BenchGecko Labs

Daily AI Tests, Behavior Data & Charts People Cite

BenchGecko Labs runs Gecko Tests powered by GeckoBench, our proprietary AI behavior benchmark. We test selected frontier and widely used models on a published cadence, publish raw answers, and turn results into embeddable charts.

Explore Gecko Tests GeckoBench Browse Charts

GeckoBench

The benchmark engine behind Gecko Tests. 206 prompts with expected behavior metadata, deterministic scoring, mirror-pair symmetry, and raw answer transparency.

206

Prompts

Models

Test families

What is BenchGecko Labs?

Traditional benchmarks measure how well a model performs. Labs measures how a model behaves. We track censorship patterns, bias asymmetries, political orientations, moral reasoning, and behavioral drift that standard benchmarks miss entirely.

Every test runs the same prompts on every model, every day. Results are scored, charted, and published with full raw answers. No black box. No editorial spin. Just data.

Does the model give useful advice in real situations?

View test

View all Gecko Tests

Frequently Asked Questions

BenchGecko Labs runs proprietary daily tests on AI models to measure censorship, bias, political orientation, reasoning ability, moral decision-making, and behavioral drift. Same prompts, same models, every day.

Daily AI Tests, Behavior Data & Charts People Cite

What is BenchGecko Labs?

Featured Tests

Censorship Index

AI Political Compass

Race Bias Index

Would AI Let People Die?

AI IQ Test

Real-Life AI Test

Frequently Asked Questions

Gecko Tests

Data

Resources