BenchGecko Labs

Daily AI Tests, Behavior Data & Charts People Cite

Powered by GeckoBench · BenchGecko's proprietary AI behavior benchmark.

BenchGecko Labs runs Gecko Tests powered by GeckoBench, our proprietary AI behavior benchmark. We test selected frontier and widely used models on a published cadence, publish raw answers, and turn results into embeddable charts.

The benchmark engine behind Gecko Tests. 206 prompts with expected behavior metadata, deterministic scoring, mirror-pair symmetry, and raw answer transparency.

Prompts

Models

Test families

Traditional benchmarks measure how well a model performs. Labs measures how a model behaves. We track censorship patterns, bias asymmetries, political orientations, moral reasoning, and behavioral drift that standard benchmarks miss entirely.

Every test runs the same prompts on every model, every day. Results are scored, charted, and published with full raw answers. No black box. No editorial spin. Just data.

Every chart is embeddable with a single line of code. Every dataset is citable with APA and BibTeX formats. Built for journalists, researchers, and anyone tracking how AI actually behaves.

BenchGecko Labs runs proprietary daily tests on AI models to measure censorship, bias, political orientation, reasoning ability, moral decision-making, and behavioral drift. Same prompts, same models, every day.