LIVETracking 994 AI models from 267 providers.
BenchGeckoBeta
AI
Providers
Economy
Mindshare
Infra
Learn
Labs
Developers
Gecko Tests·powered by GeckoBench · AI Bias, Censorship, IQ & PoliticsView Gecko TestsBuild your own chart
Home·Models
BenchGecko

BenchGecko is the data layer of the AI economy. Thousands of models with cross-provider pricing and daily price history. Hundreds of companies with valuations, funding timelines, and revenue estimates. Benchmark scores, developer adoption signals, agent leaderboards, and a changelog that captures every price drop, every launch, every deprecation as it happens. If it moved in AI today, it's already on BenchGecko.

@BenchGecko

Rankings

  • Leaderboard
  • All Models
  • AI Agents
  • Benchmarks
  • Compare Models
  • MCP Servers
  • Skills
  • Pricing

Top Models

  • Qwen3 VL 235B A22B Instruct
  • DeepSeek V3.1 Terminus
  • Qwen3 VL 235B A22B Thinking
  • Trinity Large Preview (free)
  • GLM 4.5 Air
  • GLM 4.7 Flash
  • MiniMax M1
  • GLM 4.5V
  • Gemma 3 12B
  • Nova 2 Lite

Providers

  • AI Research
  • AI21 Labs
  • AI4Bharat
  • aion-labs
  • alfredpros
  • Alibaba
  • Alimama Creative
  • Allen Dou
  • All Providers →

Benchmarks

  • Aider · Code Editing
  • Aider polyglot
  • ANLI
  • APEX-Agents
  • ARC AI2
  • ARC-AGI
  • ARC-AGI-2
  • Artificial Analysis · Agentic Index
  • All Benchmarks →

Resources

  • API Documentation
  • Methodology
  • Research
  • Gecko Mindshare
  • News
  • Changelog
  • Countries
  • Model Status
  • Brand Assets
  • Developers
  • Charts
  • Learn
  • Economy
  • Labs
  • Gecko Tests
© 2026 BenchGecko·The AI Economy, Tracked
Privacy·Terms·Press
Models · LeaderboardUpdated 8d ago · 994 models · 267 providers · 128 benchmarks

Every AI Model · Tracked

The most complete list of AI models you can actually use · 994 models, 267 providers, 128 benchmarks · all scored, priced, and ranked in one place.

Benchmark scoredCross-provider pricingContext · parameters · licensingRefreshed daily
Compare side by sideShowing 49 of 994
Total Models
994
Providers
267
Top Model
GPT-5.5 Pro
87.8%
Cheapest Capable
gpt-oss-20b
$0.03/M
PodiumMoversLeaderboardFAQ

Top 10 Overall

Ranked by average benchmark score · min 3 benchmarks

#1 GOLD3 benchmarks
OpenAI logo
GPT-5.5 Pro
OpenAI
87.8%
#2 SILVER6 benchmarks
OpenAI logo
GPT-5.5
OpenAI
85.0%
#3 BRONZE7 benchmarks
OpenAI logo
GPT-5 Chat
OpenAI
81.9%
#4Anthropic logo
Claude Mythos Preview
Anthropic
81.8%14 benchmarks
#5Alibaba Qwen logo
Qwen3.5 397B A17B
Alibaba Qwen
78.4%11 benchmarks
#6DeepSeek logo
DeepSeek V3.2 Speciale
DeepSeek
78.2%9 benchmarks
#7Anthropic logo
Claude Instant
Anthropic
78.0%4 benchmarks
#8stepfun logo
Step 3.5 Flash
stepfun
76.9%10 benchmarks
#9DeepSeek logo
DeepSeek-V2 (MoE-236B, May 2024)
DeepSeek
76.5%7 benchmarks
#10xiaomi logo
MiMo-V2-Flash
xiaomi
73.3%11 benchmarks

What's moving

New releases, coverage, price leaders, and ELO champions

#ModelProviderCategoryReleased
1ibm-granite logoGranite 4.1 8Bibm-graniteLLMApr 30, 2026
2xAI logoGrok 4.3xAIMultimodalApr 30, 2026
3NVIDIA logoNemotron 3 Nano Omni (free)NVIDIAMultimodalApr 28, 2026
4openrouter logoOwl AlphaopenrouterLLMApr 28, 2026
5Alibaba Qwen logoQwen3.5 Plus 2026-04-20Alibaba QwenMultimodalApr 27, 2026
6Alibaba Qwen logoQwen3.6 27BAlibaba QwenMultimodalApr 27, 2026
7Alibaba Qwen logoQwen3.6 35B A3BAlibaba QwenMultimodalApr 27, 2026
8Alibaba Qwen logoQwen3.6 FlashAlibaba QwenMultimodalApr 27, 2026
9Alibaba Qwen logoQwen3.6 Max PreviewAlibaba QwenLLMApr 27, 2026
10DeepSeek logoDeepSeek V4 FlashDeepSeekLLMApr 24, 2026
Provider
Filter
49 results

All Models

#ModelProviderCategoryContextIn $/MOut $/MAvgBenchmarksELO
1Google DeepMind logoGemini 2.5 Pro Preview 05-06Google DeepMindMultimodal1.0M$1.25$10.0076.9%1—
2Google DeepMind logoGemini 2.0 Flash LiteGoogle DeepMindMultimodal1.0M$0.07$0.3064.2%5—
3Google DeepMind logoGemma 4 31BOSSGoogle DeepMindMultimodal262K$0.13$0.3861.6%8—
4Google DeepMind logoGemini 3.1 Pro PreviewGoogle DeepMindMultimodal1.0M$2.00$12.0060.6%23—
5Google DeepMind logoGemini 3 ProGoogle DeepMindLLM—TBDTBD60.5%28—
6Google DeepMind logoGemini 2.5 Flash LiteGoogle DeepMindMultimodal1.0M$0.10$0.4059.1%8—
7Google DeepMind logoGemini 2.5 ProGoogle DeepMindMultimodal1.0M$1.25$10.0056.2%42—
8Google DeepMind logoGemini 2.0 ProGoogle DeepMindLLM—TBDTBD53.7%4—
9Google DeepMind logoGemini 2.5 Pro Preview 06-05Google DeepMindMultimodal1.0M$1.25$10.0050.9%4—
10Google DeepMind logoGemini 3 Flash PreviewGoogle DeepMindMultimodal1.0M$0.50$3.0049.1%24—
11Google DeepMind logoGemini 2.0 FlashGoogle DeepMindMultimodal1.0M$0.10$0.4048.0%20—
12Google DeepMind logoGemini 1.5 Flash (May 2024)Google DeepMindLLM—TBDTBD47.4%17—
13Google DeepMind logoGemma 3 27BOSSGoogle DeepMindMultimodal131K$0.08$0.1642.2%14—
14Google DeepMind logoGemma 3 27B (free)OSSGoogle DeepMindMultimodal131KFreeFree42.2%7—
15Google DeepMind logoGemini 1.5 Pro (Feb 2024)Google DeepMindLLM—TBDTBD41.3%20—
16Google DeepMind logoGemini 2.5 FlashGoogle DeepMindMultimodal1.0M$0.30$2.5040.0%25—
17Google DeepMind logoGemini 2.0 Flash Thinking (Jan 2025)Google DeepMindLLM—TBDTBD37.7%7—
18Google DeepMind logoGemma 2 2b ItOSSGoogle DeepMindLLM—TBDTBD36.4%12—
19Google DeepMind logoGemma 2 9BOSSGoogle DeepMindLLM8K$0.03$0.0936.0%13—
20Google DeepMind logoGemma 2 27BOSSGoogle DeepMindLLM8K$0.65$0.6532.9%11—
21Google DeepMind logoGemma 2BOSSGoogle DeepMindLLM—TBDTBD29.1%16—
22Google DeepMind logoGemini 1.0 ProGoogle DeepMindLLM—TBDTBD21.1%4—
23Google DeepMind logoGemma 2 2bOSSGoogle DeepMindLLM—TBDTBD10.4%6—
24Google logoBert Base CasedOSSGoogleLLM—TBDTBD—0—
25Google logoBert Base Multilingual CasedOSSGoogleLLM—TBDTBD—0—
26Google logoBert Base Multilingual UncasedOSSGoogleLLM—TBDTBD—0—
27Google logoBert Base UncasedOSSGoogleLLM—TBDTBD—0—
28Google DeepMind logoGemini 2.5 Flash Lite Preview 09-2025Google DeepMindMultimodal1.0M$0.10$0.40—0—
29Google DeepMind logoGemini 3.1 Flash Lite PreviewGoogle DeepMindMultimodal1.0M$0.25$1.50—5—
30Google DeepMind logoGemini 3.1 Pro Preview Custom ToolsGoogle DeepMindMultimodal1.0M$2.00$12.00—0—
31Google DeepMind logoGemma 3 12BOSSGoogle DeepMindMultimodal131K$0.04$0.13—1—
32Google DeepMind logoGemma 3 12B (free)OSSGoogle DeepMindMultimodal33KFreeFree—0—
33Google DeepMind logoGemma 3 1b ItOSSGoogle DeepMindLLM—TBDTBD—0—
34Google DeepMind logoGemma 3 4BOSSGoogle DeepMindMultimodal131K$0.04$0.08—1—
35Google DeepMind logoGemma 3 4B (free)OSSGoogle DeepMindMultimodal33KFreeFree—0—
36Google DeepMind logoGemma 3n 2B (free)OSSGoogle DeepMindLLM8KFreeFree—0—
37Google DeepMind logoGemma 3n 4BOSSGoogle DeepMindLLM33K$0.06$0.12—1—
38Google DeepMind logoGemma 3n 4B (free)OSSGoogle DeepMindLLM8KFreeFree—0—
39Google DeepMind logoGemma 3n E2B ItOSSGoogle DeepMindLLM—TBDTBD—0—
40Google DeepMind logoGemma 4 26B A4B OSSGoogle DeepMindMultimodal262K$0.06$0.33—0—
41Google DeepMind logoGemma 4 26B A4B (free)OSSGoogle DeepMindMultimodal262KFreeFree—4—
42Google DeepMind logoGemma 4 31B (free)OSSGoogle DeepMindMultimodal262KFreeFree—4—
43Google DeepMind logoLyria 3 Clip PreviewGoogle DeepMindLLM1.0MFreeFree—0—
44Google DeepMind logoLyria 3 Pro PreviewGoogle DeepMindLLM1.0MFreeFree—0—
45Google DeepMind logoNano Banana (Gemini 2.5 Flash Image)Google DeepMindImage Generation33K$0.30$2.50—0—
46Google DeepMind logoNano Banana 2 (Gemini 3.1 Flash Image Preview)Google DeepMindImage Generation66K$0.50$3.00—0—
47Google DeepMind logoNano Banana Pro (Gemini 3 Pro Image Preview)Google DeepMindImage Generation66K$2.00$12.00—0—
48Google DeepMind logoVit Base Patch16 224OSSGoogle DeepMindLLM—TBDTBD—0—
49Google DeepMind logoVit Base Patch16 224 In21kOSSGoogle DeepMindLLM—TBDTBD—0—

49 rows · click column headers to sort · pick up to 4 models to compare

Top 5 by category

Specialist leaders across every modality we track

LLMs

See all →
#1OpenAI logoGPT-5.5 Pro87.8%#2OpenAI logoGPT-5.585.0%#3Anthropic logoClaude Mythos Preview81.8%#4DeepSeek logoDeepSeek V3.2 Speciale78.2%#5Anthropic logoClaude Instant78.0%

Multimodal

See all →
#1OpenAI logoGPT-5 Chat81.9%#2Alibaba Qwen logoQwen3.5 397B A17B78.4%#3Google DeepMind logoGemini 2.5 Pro Preview 05-0676.9%#4OpenAI logoGPT-5.1-Codex-Max72.0%#5OpenAI logoo4 Mini High72.0%

Decision shortcuts

Jump from the leaderboard into comparison, provider, pricing, and benchmark paths

GPT-5.5 Pro vs Qwen3.5
Frontier comparison
Qwen3.5 vs DeepSeek V3.2
Open model comparison
OpenAI model lineup
Provider profile
Anthropic Claude lineup
Provider profile
Coding model pricing
Cost by use case
Reasoning model pricing
Cost by use case
SWE-bench Verified results
Coding benchmark
GPQA Diamond results
Reasoning benchmark

Frequently asked

Quick answers, sourced from our data

How many AI models does BenchGecko track?

BenchGecko currently tracks 994 AI models across 267 providers, each scored against up to 128 benchmarks. New models are added continuously and the full dataset refreshes daily.

What is the best AI model right now?

"Best" depends on the task. For general reasoning we rank by average score across 3+ benchmarks; for coding we surface ELO and SWE-bench specifically; for cost/performance we expose a "cheapest capable" metric. Use the filter bar and column sort to define your own winner, or pick a category from the mini tables below.

How is the average score calculated?

Average score is the arithmetic mean of a model's normalized benchmark scores, computed only when the model has at least one public benchmark result. Models with fewer than 3 benchmarks are excluded from the podium to avoid single-score outliers.

Where does BenchGecko get this data?

Model metadata and pricing come from OpenRouter's public API. Benchmarks are pulled from Epoch AI (CC-BY) and SWE-bench's public leaderboards. ELO ratings come from LMArena. Everything is re-normalized and cross-linked daily. See the methodology page for full provenance.

Can I use this data in my article or product?

Yes. All BenchGecko data is licensed CC BY 4.0 — attribution required. Use the "Cite this page" button above for ready-made APA, MLA, AP Style, BibTeX, and HTML embed snippets. The free API tier requires a backlink to benchgecko.ai.

How often does the data refresh?

Core model, pricing and benchmark data refreshes every 24 hours. Live status and pricing alerts can fire more frequently when upstream sources change. The "Live" pill on this page lights up when the last refresh was less than an hour ago.

Which models are open source?

Toggle the "Open Source" filter to see only models with OSS weights available. We currently count permissively — weights-available licenses like Llama, Qwen, DeepSeek, and Mistral Open count as OSS for this filter even when the weights come with usage restrictions.

How do I compare two models side by side?

Check the boxes next to any 2-4 models in the leaderboard above, then click "Compare →" in the pink action bar. You can also navigate directly to /compare/[modelA]-vs-[modelB] for shareable comparisons.

See also

Related model, benchmark, provider, and pricing indexes

Compare models
Side-by-side with charts
Benchmarks
Every eval we track
Providers
Where to run models
Pricing
Every model, every provider
AI agents
SWE-bench leaderboard
MCP servers
Tools for LLMs
AI economy
Valuations & funding
Methodology
How we score