LIVETracking 994 AI models from 267 providers.
BenchGeckoBeta
AI
Providers
Economy
Mindshare
Infra
Learn
Labs
Developers
Gecko Tests·powered by GeckoBench · AI Bias, Censorship, IQ & PoliticsView Gecko TestsBuild your own chart
Home·Models
BenchGecko

BenchGecko is the data layer of the AI economy. Thousands of models with cross-provider pricing and daily price history. Hundreds of companies with valuations, funding timelines, and revenue estimates. Benchmark scores, developer adoption signals, agent leaderboards, and a changelog that captures every price drop, every launch, every deprecation as it happens. If it moved in AI today, it's already on BenchGecko.

@BenchGecko

Rankings

  • Leaderboard
  • All Models
  • AI Agents
  • Benchmarks
  • Compare Models
  • MCP Servers
  • Skills
  • Pricing

Top Models

  • Qwen3 VL 235B A22B Instruct
  • DeepSeek V3.1 Terminus
  • Qwen3 VL 235B A22B Thinking
  • Trinity Large Preview (free)
  • GLM 4.5 Air
  • GLM 4.7 Flash
  • MiniMax M1
  • GLM 4.5V
  • Gemma 3 12B
  • Nova 2 Lite

Providers

  • AI Research
  • AI21 Labs
  • AI4Bharat
  • aion-labs
  • alfredpros
  • Alibaba
  • Alimama Creative
  • Allen Dou
  • All Providers →

Benchmarks

  • Aider · Code Editing
  • Aider polyglot
  • ANLI
  • APEX-Agents
  • ARC AI2
  • ARC-AGI
  • ARC-AGI-2
  • Artificial Analysis · Agentic Index
  • All Benchmarks →

Resources

  • API Documentation
  • Methodology
  • Research
  • Gecko Mindshare
  • News
  • Changelog
  • Countries
  • Model Status
  • Brand Assets
  • Developers
  • Charts
  • Learn
  • Economy
  • Labs
  • Gecko Tests
© 2026 BenchGecko·The AI Economy, Tracked
Privacy·Terms·Press
Models · LeaderboardUpdated 8d ago · 994 models · 267 providers · 128 benchmarks

Every AI Model · Tracked

The most complete list of AI models you can actually use · 994 models, 267 providers, 128 benchmarks · all scored, priced, and ranked in one place.

Benchmark scoredCross-provider pricingContext · parameters · licensingRefreshed daily
Compare side by sideShowing 1 of 994
Total Models
994
Providers
267
Top Model
GPT-5.5 Pro
87.8%
Cheapest Capable
gpt-oss-20b
$0.03/M
PodiumMoversLeaderboardFAQ

Top 10 Overall

Ranked by average benchmark score · min 3 benchmarks

#1 GOLD3 benchmarks
OpenAI logo
GPT-5.5 Pro
OpenAI
87.8%
#2 SILVER6 benchmarks
OpenAI logo
GPT-5.5
OpenAI
85.0%
#3 BRONZE7 benchmarks
OpenAI logo
GPT-5 Chat
OpenAI
81.9%
#4Anthropic logo
Claude Mythos Preview
Anthropic
81.8%14 benchmarks
#5Alibaba Qwen logo
Qwen3.5 397B A17B
Alibaba Qwen
78.4%11 benchmarks
#6DeepSeek logo
DeepSeek V3.2 Speciale
DeepSeek
78.2%9 benchmarks
#7Anthropic logo
Claude Instant
Anthropic
78.0%4 benchmarks
#8stepfun logo
Step 3.5 Flash
stepfun
76.9%10 benchmarks
#9DeepSeek logo
DeepSeek-V2 (MoE-236B, May 2024)
DeepSeek
76.5%7 benchmarks
#10xiaomi logo
MiMo-V2-Flash
xiaomi
73.3%11 benchmarks

What's moving

New releases, coverage, price leaders, and ELO champions

#ModelProviderCategoryReleased
1ibm-granite logoGranite 4.1 8Bibm-graniteLLMApr 30, 2026
2xAI logoGrok 4.3xAIMultimodalApr 30, 2026
3NVIDIA logoNemotron 3 Nano Omni (free)NVIDIAMultimodalApr 28, 2026
4openrouter logoOwl AlphaopenrouterLLMApr 28, 2026
5Alibaba Qwen logoQwen3.5 Plus 2026-04-20Alibaba QwenMultimodalApr 27, 2026
6Alibaba Qwen logoQwen3.6 27BAlibaba QwenMultimodalApr 27, 2026
7Alibaba Qwen logoQwen3.6 35B A3BAlibaba QwenMultimodalApr 27, 2026
8Alibaba Qwen logoQwen3.6 FlashAlibaba QwenMultimodalApr 27, 2026
9Alibaba Qwen logoQwen3.6 Max PreviewAlibaba QwenLLMApr 27, 2026
10DeepSeek logoDeepSeek V4 FlashDeepSeekLLMApr 24, 2026
Provider
Filter
1 result

All Models

#ModelProviderCategoryContextIn $/MOut $/MAvgBenchmarksELO
1writer logoPalmyra X5writerLLM1.0M$0.60$6.0069.7%5—

1 rows · click column headers to sort · pick up to 4 models to compare

Top 5 by category

Specialist leaders across every modality we track

LLMs

See all →
#1OpenAI logoGPT-5.5 Pro87.8%#2OpenAI logoGPT-5.585.0%#3Anthropic logoClaude Mythos Preview81.8%#4DeepSeek logoDeepSeek V3.2 Speciale78.2%#5Anthropic logoClaude Instant78.0%

Multimodal

See all →
#1OpenAI logoGPT-5 Chat81.9%#2Alibaba Qwen logoQwen3.5 397B A17B78.4%#3Google DeepMind logoGemini 2.5 Pro Preview 05-0676.9%#4OpenAI logoGPT-5.1-Codex-Max72.0%#5OpenAI logoo4 Mini High72.0%

Decision shortcuts

Jump from the leaderboard into comparison, provider, pricing, and benchmark paths

GPT-5.5 Pro vs Qwen3.5
Frontier comparison
Qwen3.5 vs DeepSeek V3.2
Open model comparison
OpenAI model lineup
Provider profile
Anthropic Claude lineup
Provider profile
Coding model pricing
Cost by use case
Reasoning model pricing
Cost by use case
SWE-bench Verified results
Coding benchmark
GPQA Diamond results
Reasoning benchmark

Frequently asked

Quick answers, sourced from our data

How many AI models does BenchGecko track?

BenchGecko currently tracks 994 AI models across 267 providers, each scored against up to 128 benchmarks. New models are added continuously and the full dataset refreshes daily.

What is the best AI model right now?

"Best" depends on the task. For general reasoning we rank by average score across 3+ benchmarks; for coding we surface ELO and SWE-bench specifically; for cost/performance we expose a "cheapest capable" metric. Use the filter bar and column sort to define your own winner, or pick a category from the mini tables below.

How is the average score calculated?

Average score is the arithmetic mean of a model's normalized benchmark scores, computed only when the model has at least one public benchmark result. Models with fewer than 3 benchmarks are excluded from the podium to avoid single-score outliers.

Where does BenchGecko get this data?

Model metadata and pricing come from OpenRouter's public API. Benchmarks are pulled from Epoch AI (CC-BY) and SWE-bench's public leaderboards. ELO ratings come from LMArena. Everything is re-normalized and cross-linked daily. See the methodology page for full provenance.

Can I use this data in my article or product?

Yes. All BenchGecko data is licensed CC BY 4.0 — attribution required. Use the "Cite this page" button above for ready-made APA, MLA, AP Style, BibTeX, and HTML embed snippets. The free API tier requires a backlink to benchgecko.ai.

How often does the data refresh?

Core model, pricing and benchmark data refreshes every 24 hours. Live status and pricing alerts can fire more frequently when upstream sources change. The "Live" pill on this page lights up when the last refresh was less than an hour ago.

Which models are open source?

Toggle the "Open Source" filter to see only models with OSS weights available. We currently count permissively — weights-available licenses like Llama, Qwen, DeepSeek, and Mistral Open count as OSS for this filter even when the weights come with usage restrictions.

How do I compare two models side by side?

Check the boxes next to any 2-4 models in the leaderboard above, then click "Compare →" in the pink action bar. You can also navigate directly to /compare/[modelA]-vs-[modelB] for shareable comparisons.

See also

Related model, benchmark, provider, and pricing indexes

Compare models
Side-by-side with charts
Benchmarks
Every eval we track
Providers
Where to run models
Pricing
Every model, every provider
AI agents
SWE-bench leaderboard
MCP servers
Tools for LLMs
AI economy
Valuations & funding
Methodology
How we score