Live11 categories · 994 models tracked

Models by Category.
Ranked.

Every model category we track · one leaderboard per task. Pick the skill you care about and see which models dominate it.

Best AI Models for Coding

85 models

AI models ranked by coding benchmarks. Compare HumanEval+, SWE-bench Verified, Aider Polyglot, and more across all providers.

3Claude Mythos Preview

81.8

Best AI Models for Reasoning

135 models

AI models ranked by reasoning benchmarks. Compare GPQA Diamond, ARC-AGI, BBH, and other reasoning tests across all providers.

3Claude Mythos Preview

81.8

Best AI Models for Math

120 models

AI models ranked by math benchmarks. Compare MATH-500, GSM8K, and competition-level math scores across all providers.

Best AI Models for Knowledge

127 models

AI models ranked by knowledge benchmarks. Compare MMLU-Pro, GPQA Diamond, SimpleQA, and other knowledge tests.

3Claude Mythos Preview

81.8

Best AI Models for Vision

27 models

AI models ranked by vision and multimodal benchmarks. Compare MMMU, VideoMME, and visual reasoning scores.

Best Open Source AI Models

142 models

Open-source AI models ranked by benchmark score. Compare Llama, Mistral, DeepSeek, Qwen, and other open-weight models.

Top 3

1Qwen3.5 397B A17B

78.4

2DeepSeek V3.2 Speciale

78.2

3Step 3.5 Flash

76.9

AI Models Under $1/1M Tokens

91 models

Cheapest AI models ranked by score. All models with input pricing under $1 per million tokens, sorted by benchmark performance.

Top 3

1Qwen3.5 397B A17B

78.4

2DeepSeek V3.2 Speciale

78.2

3Step 3.5 Flash

76.9

AI Models Under $5/1M Tokens

130 models

AI models under $5 per million tokens ranked by benchmark score. The sweet spot of price and performance.

3DeepSeek V3.2 Speciale

78.2

AI Models with 1M+ Context

25 models

AI models with 1 million+ token context windows ranked by score. Compare Gemini, Claude, and other long-context models.

Top 3

1Claude Mythos Preview

81.8

2Gemini 2.5 Pro Preview 05-06

76.9

3Qwen3.6 Plus

70.9

Best Small AI Models

84 models

Small AI models (under 10B parameters) ranked by benchmark score. Lightweight models you can run locally.

Top 3

1Gemini 2.5 Pro Preview 05-06

Flagship AI Models

243 models

The best AI model from each provider, ranked by benchmark score. Compare the flagships from OpenAI, Anthropic, Google, Meta, and more.

Models by Category.Ranked.

Best AI Models for Coding

Best AI Models for Reasoning

Best AI Models for Math

Best AI Models for Knowledge

Best AI Models for Vision

Best Open Source AI Models

AI Models Under $1/1M Tokens

AI Models Under $5/1M Tokens

AI Models with 1M+ Context

Best Small AI Models

Flagship AI Models

Models by Category.
Ranked.