BenchGecko Labs

Daily AI Tests, Behavior Data & Charts People Cite

We run the same prompts on every frontier model, every day. Raw answers. Public charts. Embeddable data. The AI behavior layer nobody else is building.

Live signals will appear here once Gecko Tests go live. First test: Censorship Index.

Traditional benchmarks measure how well a model performs. Labs measures how a model behaves. We track censorship patterns, bias asymmetries, political orientations, moral reasoning, and behavioral drift that standard benchmarks miss entirely.

Every test runs the same prompts on every model, every day. Results are scored, charted, and published with full raw answers. No black box. No editorial spin. Just data.

Every chart is embeddable with a single line of code. Every dataset is citable with APA and BibTeX formats. Built for journalists, researchers, and anyone tracking how AI actually behaves.

BenchGecko Labs runs proprietary daily tests on AI models to measure censorship, bias, political orientation, reasoning ability, moral decision-making, and behavioral drift. Same prompts, same models, every day.