Beta
Models · FamiliesLive · 65 families · 317 variants · 19 providers

Every Model Family · Tracked

One page per family · every Claude Opus variant, every Qwen 3 size, every Llama 4 release. Aggregate stats, price range, and the full lineage of each lab's work.

Lineage trackedBenchmark aggregatedVariant rollupRefreshed daily
All variants
Families
65
distinct lineages
Variants
317
across all families
Top variant score
81.9
GPT-5
Biggest family
36
Qwen 3
OpenAI logo· 12 families
Alibaba Qwen logo· 10 families
DeepSeek logo· 4 families
Google DeepMind logo· 4 families
stepfun logo· 1 family
xiaomi logo· 1 family
xAI logo· 3 families
minimax logo· 2 families
z-ai logo· 4 families
Microsoft logo· 2 families
Anthropic logo· 3 families
Mistral AI logo· 4 families
Meta logo· 6 families
Cohere logo· 1 family
NVIDIA logo· 2 families
sao10k logo· 2 families
RH
· 1 family
U
· 1 family
ByteDance logo· 2 families

How the hierarchy works

What is a model family?

A model family is a group of related AI model variants released by the same lab · for example Claude Opus ships as Opus 3, 3.5, 4, 4.6 and Qwen 3 ships as dozens of parameter sizes. BenchGecko tracks 65 families with 317 total variants.

Which family has the most variants?

Qwen 3 has 36 tracked variants, released by Alibaba.

Which family has the top-scoring variant?

The highest-scoring variant is GPT-5 Chat at 81.9 · part of the GPT-5 family by OpenAI.

Why not dedupe families into one canonical model?

Because per-variant pricing, benchmarks, and context windows differ materially. Deduping would destroy data quality. Instead we keep every variant in /models and expose the family as a second navigation layer · see ADR-0005.

Keep exploring the BenchGecko graph