LIVETracking 994 AI models from 267 providers.
BenchGeckoBeta
AI
Providers
Economy
Mindshare
Infra
Learn
Labs
Developers
Charts·Build live AI market viewsOpen chartsBuild your own chart
Home·Models
BenchGecko

BenchGecko is the data layer of the AI economy. Thousands of models with cross-provider pricing and daily price history. Hundreds of companies with valuations, funding timelines, and revenue estimates. Benchmark scores, developer adoption signals, agent leaderboards, and a changelog that captures every price drop, every launch, every deprecation as it happens. If it moved in AI today, it's already on BenchGecko.

@BenchGecko

Rankings

  • Leaderboard
  • All Models
  • AI Agents
  • Benchmarks
  • Compare Models
  • MCP Servers
  • Skills
  • Pricing

Top Models

  • Qwen3 VL 235B A22B Instruct
  • DeepSeek V3.1 Terminus
  • Qwen3 VL 235B A22B Thinking
  • Trinity Large Preview (free)
  • GLM 4.5 Air
  • GLM 4.7 Flash
  • MiniMax M1
  • GLM 4.5V
  • Gemma 3 12B
  • Nova 2 Lite

Providers

  • AI Research
  • AI21 Labs
  • AI4Bharat
  • aion-labs
  • alfredpros
  • Alibaba
  • Alimama Creative
  • Allen Dou
  • All Providers →

Benchmarks

  • Aider · Code Editing
  • Aider polyglot
  • ANLI
  • APEX-Agents
  • ARC AI2
  • ARC-AGI
  • ARC-AGI-2
  • Artificial Analysis · Agentic Index
  • All Benchmarks →

Resources

  • API Documentation
  • Methodology
  • Research
  • Gecko Mindshare
  • News
  • Changelog
  • Countries
  • Model Status
  • Brand Assets
  • Developers
  • Charts
  • Learn
  • Economy
  • Labs
  • Gecko Tests
© 2026 BenchGecko·The AI Economy, Tracked
Privacy·Terms·Press
Models · LeaderboardUpdated 19d ago · 994 models · 267 providers · 128 benchmarks

Every AI Model · Tracked

The most complete list of AI models you can actually use · 994 models, 267 providers, 128 benchmarks · all scored, priced, and ranked in one place.

Benchmark scoredCross-provider pricingContext · parameters · licensingRefreshed daily
Compare side by sideShowing 58 of 994
Total Models
994
Providers
267
Top Model
GPT-5.5 Pro
87.8%
Cheapest Capable
gpt-oss-20b
$0.03/M
PodiumMoversLeaderboardFAQ

Top 10 Overall

Ranked by average benchmark score · min 3 benchmarks

#1 GOLD3 benchmarks
OpenAI logo
GPT-5.5 Pro
OpenAI
87.8%
#2 SILVER6 benchmarks
OpenAI logo
GPT-5.5
OpenAI
85.0%
#3 BRONZE7 benchmarks
OpenAI logo
GPT-5 Chat
OpenAI
81.9%
#4Anthropic logo
Claude Mythos Preview
Anthropic
81.8%14 benchmarks
#5Alibaba Qwen logo
Qwen3.5 397B A17B
Alibaba Qwen
78.4%11 benchmarks
#6DeepSeek logo
DeepSeek V3.2 Speciale
DeepSeek
78.2%9 benchmarks
#7Anthropic logo
Claude Instant
Anthropic
78.0%4 benchmarks
#8stepfun logo
Step 3.5 Flash
stepfun
76.9%10 benchmarks
#9DeepSeek logo
DeepSeek-V2 (MoE-236B, May 2024)
DeepSeek
76.5%7 benchmarks
#10xiaomi logo
MiMo-V2-Flash
xiaomi
73.3%11 benchmarks

What's moving

New releases, coverage, price leaders, and ELO champions

#ModelProviderCategoryReleased
1ibm-granite logoGranite 4.1 8Bibm-graniteLLMApr 30, 2026
2xAI logoGrok 4.3xAIMultimodalApr 30, 2026
3NVIDIA logoNemotron 3 Nano Omni (free)NVIDIAMultimodalApr 28, 2026
4openrouter logoOwl AlphaopenrouterLLMApr 28, 2026
5Alibaba Qwen logoQwen3.5 Plus 2026-04-20Alibaba QwenMultimodalApr 27, 2026
6Alibaba Qwen logoQwen3.6 27BAlibaba QwenMultimodalApr 27, 2026
7Alibaba Qwen logoQwen3.6 35B A3BAlibaba QwenMultimodalApr 27, 2026
8Alibaba Qwen logoQwen3.6 FlashAlibaba QwenMultimodalApr 27, 2026
9Alibaba Qwen logoQwen3.6 Max PreviewAlibaba QwenLLMApr 27, 2026
Provider
Filter
58 results

All Models

#ModelProviderCategoryContextIn $/MOut $/MAvgBenchmarksELO
1Anthropic logoClaude Mythos PreviewAnthropicLLM1.0MTBDTBD81.8%14—
2Google DeepMind logoGemini 2.5 Pro Preview 05-06Google DeepMindMultimodal1.0M$1.25$10.0076.9%1—
3Alibaba Qwen logoQwen3.6 PlusOSSAlibaba QwenMultimodal1.0M$0.33$1.9570.9%11—
4writer logoPalmyra X5writerLLM1.0M$0.60$6.0069.7%5—
5OpenAI logoGPT-5.4 ProOpenAIMultimodal1.1M$30.00$180.0066.7%8—
6Google DeepMind logoGemini 2.0 Flash LiteGoogle DeepMindMultimodal1.0M$0.07$0.3064.2%5—
7Google DeepMind logoGemini 3.1 Pro PreviewGoogle DeepMindMultimodal1.0M$2.00$12.0060.6%23—
8Google DeepMind logoGemini 2.5 Flash LiteGoogle DeepMindMultimodal1.0M$0.10$0.4059.1%8—
9OpenAI logoGPT-5.4OpenAIMultimodal1.1M$2.50$15.0059.0%16—
10xiaomi logoMiMo-V2-ProxiaomiLLM1.0M$1.00$3.0058.1%13—
11Anthropic logoClaude Opus 4.6AnthropicMultimodal1.0M$5.00$25.0057.5%19—
12Google DeepMind logoGemini 2.5 ProGoogle DeepMindMultimodal1.0M$1.25$10.0056.2%42—
13Google DeepMind logoGemini 2.5 Pro Preview 06-05Google DeepMindMultimodal1.0M$1.25$10.0050.9%4—
14xAI logoGrok 4 FastxAIMultimodal2.0M$0.20$0.5050.4%6—
15Google DeepMind logoGemini 3 Flash PreviewGoogle DeepMindMultimodal1.0M$0.50$3.0049.1%24—
16Google DeepMind logoGemini 2.0 FlashGoogle DeepMindMultimodal1.0M$0.10$0.4048.0%20—
17Anthropic logoClaude Sonnet 4.6AnthropicMultimodal1.0M$3.00$15.0047.6%18—
18Anthropic logoClaude Sonnet 4AnthropicMultimodal1.0M$3.00$15.0044.6%27—
19OpenAI logoGPT-4.1 MiniOpenAIMultimodal1.0M$0.40$1.6044.5%16—
20Anthropic logoClaude Opus 4.6 (Fast)AnthropicMultimodal1.0M$30.00$150.0043.4%12—
21OpenAI logoGPT-4.1OpenAIMultimodal1.0M$2.00$8.0043.3%22—
22Anthropic logoClaude Sonnet 4.5AnthropicMultimodal1.0M$3.00$15.0042.1%21—
23Google DeepMind logoGemini 2.5 FlashGoogle DeepMindMultimodal1.0M$0.30$2.5040.0%25—
24OpenAI logoGPT-4.1 NanoOpenAIMultimodal1.0M$0.10$0.4035.2%14—
25Meta logoLlama 4 MaverickOSSMetaMultimodal1.0M$0.15$0.6028.0%17—
26openrouter logoAuto RouteropenrouterImage Generation2.0M$-1000000.00$-1000000.00—0—
27Anthropic logoClaude Opus 4.7AnthropicMultimodal1.0M$5.00$25.00—0—
28DeepSeek logoDeepSeek V4 FlashDeepSeekLLM1.0M$0.14$0.28—0—
29DeepSeek logoDeepSeek V4 ProDeepSeekLLM1.0M$0.43$0.87—0—
30Google DeepMind logoGemini 2.5 Flash Lite Preview 09-2025Google DeepMindMultimodal1.0M$0.10$0.40—0—
31Google DeepMind logoGemini 3.1 Flash Lite PreviewGoogle DeepMindMultimodal1.0M$0.25$1.50—5—
32Google DeepMind logoGemini 3.1 Pro Preview Custom ToolsGoogle DeepMindMultimodal1.0M$2.00$12.00—0—
33xAI logoGrok 4.1 FastxAIMultimodal2.0M$0.20$0.50—3—
34xAI logoGrok 4.20xAIMultimodal2.0M$1.25$2.50—0—
35xAI logoGrok 4.20 BetaxAIMultimodal2.0M$2.00$6.00—0—
36xAI logoGrok 4.20 Multi-AgentxAIMultimodal2.0M$2.00$6.00—0—
37xAI logoGrok 4.20 Multi-Agent BetaxAIMultimodal2.0M$2.00$6.00—0—
38xAI logoGrok 4.3xAIMultimodal1.0M$1.25$2.50—0—
39Google DeepMind logoLyria 3 Clip PreviewGoogle DeepMindLLM1.0MFreeFree—0—
40Google DeepMind logoLyria 3 Pro PreviewGoogle DeepMindLLM1.0MFreeFree—0—
41xiaomi logoMiMo-V2.5xiaomiMultimodal1.0M$0.40$2.00—0—
42xiaomi logoMiMo-V2.5-ProxiaomiLLM1.0M$1.00$3.00—0—
43minimax logoMiniMax M1minimaxLLM1.0M$0.40$2.20—1—
44minimax logoMiniMax-01OSSminimaxMultimodal1.0M$0.20$1.10—0—
45Amazon logoNova 2 LiteAmazonMultimodal1.0M$0.30$2.50—1—
46Amazon logoNova Premier 1.0AmazonMultimodal1.0M$2.50$12.50—0—
47openrouter logoOwl AlphaopenrouterLLM1.0MFreeFree—0—
48Alibaba Qwen logoQwen Plus 0728OSSAlibaba QwenLLM1.0M$0.26$0.78—0—
49Alibaba Qwen logoQwen Plus 0728 (thinking)OSSAlibaba QwenLLM1.0M$0.26$0.78—0—
50Alibaba Qwen logoQwen-PlusOSSAlibaba QwenLLM1.0M$0.26$0.78—0—
51Alibaba Qwen logoQwen3 Coder FlashOSSAlibaba QwenLLM1.0M$0.20$0.97—0—
52Alibaba Qwen logoQwen3 Coder PlusOSSAlibaba QwenLLM1.0M$0.65$3.25—0—
53Alibaba Qwen logoQwen3.5 Plus 2026-02-15OSSAlibaba QwenMultimodal1.0M$0.26$1.56—0—
54Alibaba Qwen logoQwen3.5 Plus 2026-04-20Alibaba QwenMultimodal1.0M$0.40$2.40—0—
55Alibaba Qwen logoQwen3.5-FlashOSSAlibaba QwenMultimodal1.0M$0.07$0.26—2—
56Alibaba Qwen logoQwen3.6 FlashAlibaba QwenMultimodal1.0M$0.25$1.50—0—
57Alibaba Qwen logoQwen3.6 Plus (free)Alibaba QwenMultimodal1.0MFreeFree—0—
58Alibaba Qwen logoQwen3.6 Plus Preview (free)OSSAlibaba QwenLLM1.0MFreeFree—0—

58 rows · click column headers to sort · pick up to 4 models to compare

Top 5 by category

Specialist leaders across every modality we track

LLMs

See all →
#1OpenAI logoGPT-5.5 Pro87.8%#2OpenAI logoGPT-5.585.0%#3Anthropic logoClaude Mythos Preview81.8%#4DeepSeek logoDeepSeek V3.2 Speciale78.2%#5Anthropic logoClaude Instant78.0%

Multimodal

See all →
#1OpenAI logoGPT-5 Chat81.9%#2Alibaba Qwen logoQwen3.5 397B A17B78.4%#3Google DeepMind logoGemini 2.5 Pro Preview 05-0676.9%#4OpenAI logoGPT-5.1-Codex-Max72.0%#5OpenAI logoo4 Mini High72.0%

Decision shortcuts

Jump from the leaderboard into comparison, provider, pricing, and benchmark paths

GPT-5.5 Pro vs Qwen3.5
Frontier comparison
Qwen3.5 vs DeepSeek V3.2
Open model comparison
OpenAI model lineup
Provider profile
Anthropic Claude lineup
Provider profile
Coding model pricing
Cost by use case
Reasoning model pricing
Cost by use case
SWE-bench Verified results
Coding benchmark
GPQA Diamond results
Reasoning benchmark

Frequently asked

Quick answers, sourced from our data

How many AI models does BenchGecko track?

BenchGecko currently tracks 994 AI models across 267 providers, each scored against up to 128 benchmarks. New models are added continuously and the full dataset refreshes daily.

What is the best AI model right now?

"Best" depends on the task. For general reasoning we rank by average score across 3+ benchmarks; for coding we surface ELO and SWE-bench specifically; for cost/performance we expose a "cheapest capable" metric. Use the filter bar and column sort to define your own winner, or pick a category from the mini tables below.

How is the average score calculated?

Average score is the arithmetic mean of a model's normalized benchmark scores, computed only when the model has at least one public benchmark result. Models with fewer than 3 benchmarks are excluded from the podium to avoid single-score outliers.

Where does BenchGecko get this data?

Model metadata and pricing come from OpenRouter's public API. Benchmarks are pulled from Epoch AI (CC-BY) and SWE-bench's public leaderboards. ELO ratings come from LMArena. Everything is re-normalized and cross-linked daily. See the methodology page for full provenance.

Can I use this data in my article or product?

Yes. All BenchGecko data is licensed CC BY 4.0 — attribution required. Use the "Cite this page" button above for ready-made APA, MLA, AP Style, BibTeX, and HTML embed snippets. The free API tier requires a backlink to benchgecko.ai.

How often does the data refresh?

Core model, pricing and benchmark data refreshes every 24 hours. Live status and pricing alerts can fire more frequently when upstream sources change. The "Live" pill on this page lights up when the last refresh was less than an hour ago.

Which models are open source?

Toggle the "Open Source" filter to see only models with OSS weights available. We currently count permissively — weights-available licenses like Llama, Qwen, DeepSeek, and Mistral Open count as OSS for this filter even when the weights come with usage restrictions.

How do I compare two models side by side?

Check the boxes next to any 2-4 models in the leaderboard above, then click "Compare →" in the pink action bar. You can also navigate directly to /compare/[modelA]-vs-[modelB] for shareable comparisons.

See also

Related model, benchmark, provider, and pricing indexes

Compare models
Side-by-side with charts
Benchmarks
Every eval we track
Providers
Where to run models
Pricing
Every model, every provider
AI agents
SWE-bench leaderboard
MCP servers
Tools for LLMs
AI economy
Valuations & funding
Methodology
How we score