Models by Category.
Ranked.
Every model category we track · one leaderboard per task. Pick the skill you care about and see which models dominate it.
Best AI Models for Coding
AI models ranked by coding benchmarks. Compare HumanEval+, SWE-bench Verified, Aider Polyglot, and more across all providers.
Best AI Models for Reasoning
AI models ranked by reasoning benchmarks. Compare GPQA Diamond, ARC-AGI, BBH, and other reasoning tests across all providers.
Best AI Models for Math
AI models ranked by math benchmarks. Compare MATH-500, GSM8K, and competition-level math scores across all providers.
Best AI Models for Knowledge
AI models ranked by knowledge benchmarks. Compare MMLU-Pro, GPQA Diamond, SimpleQA, and other knowledge tests.
Best AI Models for Vision
AI models ranked by vision and multimodal benchmarks. Compare MMMU, VideoMME, and visual reasoning scores.
Best Open Source AI Models
Open-source AI models ranked by benchmark score. Compare Llama, Mistral, DeepSeek, Qwen, and other open-weight models.
AI Models Under $1/1M Tokens
Cheapest AI models ranked by score. All models with input pricing under $1 per million tokens, sorted by benchmark performance.
AI Models Under $5/1M Tokens
AI models under $5 per million tokens ranked by benchmark score. The sweet spot of price and performance.
AI Models with 1M+ Context
AI models with 1 million+ token context windows ranked by score. Compare Gemini, Claude, and other long-context models.
Best Small AI Models
Small AI models (under 10B parameters) ranked by benchmark score. Lightweight models you can run locally.
Flagship AI Models
The best AI model from each provider, ranked by benchmark score. Compare the flagships from OpenAI, Anthropic, Google, Meta, and more.