AO VIVO
Apr 7Claude Mythos Preview · Anthropic's most capable model arrives·Mar 31GPT-5.4 Nano launched on OpenAI·Mar 31GPT-5.4 Mini joins the OpenAI lineup·Mar 30Claude Opus 4.5 input price dropped · $5.00 per 1M tokens·Mar 30Mistral Small 4 available via Mistral AI·Mar 29Gemini 2.5 Pro scores 94.1% on MMLU·Mar 29Grok 4.20 Multi-Agent Beta enters agent rankings·Mar 28DeepSeek V3.2 output price dropped · $0.38 per 1M tokens·Mar 287 new MCP servers added in dev-tools category·Mar 27Claude Sonnet 4.6 released by Anthropic·Mar 27Claude Opus 4.6 released by Anthropic·Mar 26OTIS Mock AIME 2024-2025 benchmark added·Mar 26Claude Opus 4.1 pricing increased · $15/$75 per 1M tokens·Mar 25Grok 4.20 Beta launched by xAI·Mar 25Inception added as a tracked provider·Mar 24DeepSeek R1 0528 posted 87.2% on GPQA Diamond·Mar 243 new MCP servers in AI/ML category·Mar 23GPT-4o Audio Preview marked as deprecated·Mar 23Mistral Medium 3.1 input price cut to $0.40 per 1M tokens·Mar 22DeepSeek V3.2 Speciale released·Mar 22WeirdML benchmark now tracked on BenchGecko·Mar 20Nemotron 3 Super (120B) launched by NVIDIA·Mar 20Gemini 2.5 Flash Lite priced at $0.10/$0.40 per 1M tokens·Mar 18Mistral Large 3 2512 released by Mistral AI·Mar 18Grok Code Fast 1 added to agent rankings·Mar 16Claude Sonnet 4.5 scores 91.7% on MMLU·Mar 1612 new MCP servers added across 5 categories·Mar 14GPT-5.4 Pro launched · OpenAI's new flagship·Mar 14GPT-5.4 standard tier released by OpenAI·Mar 12Grok 3 Mini marked as deprecated by xAI·Mar 12Llama 3.3 Nemotron Super 49B pricing dropped·Mar 10Liquid added as a tracked provider·Mar 10MiniMax M2.7 released by MiniMax·Mar 8Grok 4 posted 89.4% on GPQA Diamond·Mar 8LAMBADA benchmark scores now tracked·Mar 5Gemini 2.5 Flash output price reduced to $2.50 per 1M tokens·Mar 5Mercury 2 launched by Inception·Mar 3Qwen3.5-Flash released by Alibaba Qwen·Mar 35 new MCP servers added · finance and auth categories·Apr 7Claude Mythos Preview · Anthropic's most capable model arrives·Mar 31GPT-5.4 Nano launched on OpenAI·Mar 31GPT-5.4 Mini joins the OpenAI lineup·Mar 30Claude Opus 4.5 input price dropped · $5.00 per 1M tokens·Mar 30Mistral Small 4 available via Mistral AI·Mar 29Gemini 2.5 Pro scores 94.1% on MMLU·Mar 29Grok 4.20 Multi-Agent Beta enters agent rankings·Mar 28DeepSeek V3.2 output price dropped · $0.38 per 1M tokens·Mar 287 new MCP servers added in dev-tools category·Mar 27Claude Sonnet 4.6 released by Anthropic·Mar 27Claude Opus 4.6 released by Anthropic·Mar 26OTIS Mock AIME 2024-2025 benchmark added·Mar 26Claude Opus 4.1 pricing increased · $15/$75 per 1M tokens·Mar 25Grok 4.20 Beta launched by xAI·Mar 25Inception added as a tracked provider·Mar 24DeepSeek R1 0528 posted 87.2% on GPQA Diamond·Mar 243 new MCP servers in AI/ML category·Mar 23GPT-4o Audio Preview marked as deprecated·Mar 23Mistral Medium 3.1 input price cut to $0.40 per 1M tokens·Mar 22DeepSeek V3.2 Speciale released·Mar 22WeirdML benchmark now tracked on BenchGecko·Mar 20Nemotron 3 Super (120B) launched by NVIDIA·Mar 20Gemini 2.5 Flash Lite priced at $0.10/$0.40 per 1M tokens·Mar 18Mistral Large 3 2512 released by Mistral AI·Mar 18Grok Code Fast 1 added to agent rankings·Mar 16Claude Sonnet 4.5 scores 91.7% on MMLU·Mar 1612 new MCP servers added across 5 categories·Mar 14GPT-5.4 Pro launched · OpenAI's new flagship·Mar 14GPT-5.4 standard tier released by OpenAI·Mar 12Grok 3 Mini marked as deprecated by xAI·Mar 12Llama 3.3 Nemotron Super 49B pricing dropped·Mar 10Liquid added as a tracked provider·Mar 10MiniMax M2.7 released by MiniMax·Mar 8Grok 4 posted 89.4% on GPQA Diamond·Mar 8LAMBADA benchmark scores now tracked·Mar 5Gemini 2.5 Flash output price reduced to $2.50 per 1M tokens·Mar 5Mercury 2 launched by Inception·Mar 3Qwen3.5-Flash released by Alibaba Qwen·Mar 35 new MCP servers added · finance and auth categories·

A Economia da IA, Rastreada.

Pulso20·saudável
Bolha278%·agitado
GPT-5.5 Pro+4.0
Open Source16.2%

O Pulso

O Pulso
healthy
7d · +3 pts
Índice Bolha · componentes
Valuation Premiumhealthy+2.1
Funding Accelerationhealthy+1.5
Concentration Riskhealthy0
Revenue Qualityhealthy+1.4
Capex Gaphealthy+0.3
Maior movimento · Valuation Premium subiu 2.1 pts
Índice Bolha IA
SaudávelEspumosoSuperaquecidoBolha
Atualizado May 14·Metodologia·Pesquisa·API grátis·Desenvolvedores

Sinais cruzados

Matriz completa
#Benchmarks
1OpenAI logoGPT-5.5 Pro99.9$30.00400K3
2Anthropic logoClaude Mythos Preview99.81000K14
3Alibaba Qwen logoQwen3.5 397B A17B96.3$0.39262K11
4DeepSeek logoDeepSeek V3.2 Speciale95.2$0.40164K9
5OpenAI logoGPT-5.4 Pro93.0$30.001050K8
6OpenAI logoGPT-5.1-Codex-Max91.2$1.25400K8
7Google DeepMind logoGemini 3.1 Pro Preview90.0$2.001049K23
8stepfun logoStep 3.5 Flash89.5$0.10262K10
9OpenAI logoGPT-5 Chat89.0$1.25128K7
10Alibaba Qwen logoQwen3.6 Plus88.7$0.331000K11
11DeepSeek logoDeepSeek R1 Distill Qwen 14B88.311
12
HA
Qwen2.5 72B Instruct Abliterated
87.56
13z-ai logoGLM 5.187.0$1.05203K12
14OpenAI logoGPT-5.2-Codex85.4$1.75400K9
15Anthropic logoClaude Instant84.64
16DeepSeek logoDeepSeek-V2 (MoE-236B, May 2024)84.47
17OpenAI logoGPT-5.483.4$2.501050K16
18Anthropic logoClaude Opus 4.6 (Fast)83.3$30.001000K12
19OpenAI logoGPT-5.1-Codex82.8$1.25400K8
20xiaomi logoMiMo-V2-Flash81.7$0.09262K11
Buscar 297 termos IA · de transformers a attention premiumAbrir
Metodologia completa
Com que frequência os dados do BenchGecko são atualizados?

Dados de modelos e benchmarks são atualizados diariamente das fontes primárias. Preços são extraídos de cada API de provedor em rotação. Sinais de buzz são agregados semanalmente. O Pulso recalcula às 00:00 UTC.

O que é O Pulso?

Uma nota composta de 0-100 da saúde da economia IA. Combina o inverso do Índice Bolha, velocidade de benchmarks, compressão de preços, diversidade de atenção e pressão na cadeia de suprimentos num único número. Menor é mais saudável.

Como as notas de benchmark são normalizadas?

Cada benchmark é normalizado min-max no conjunto completo de modelos avaliados. O pódio calcula a média das notas normalizadas em 3+ benchmarks por modelo pra evitar peso excessivo em qualquer teste individual.

De onde vêm os dados de preços?

Direto das APIs de provedores · OpenRouter, OpenAI, Anthropic, Google, xAI, DeepSeek, Mistral e outros. Cada snapshot é armazenado com atribuição de fonte na página de detalhe do modelo.

Posso citar dados do BenchGecko?

Pode sim. Cada página tem uma barra Compartilhar e Citar com formatos APA, MLA, BibTeX, Chicago e texto puro. Atribuição é obrigatória no tier grátis da API e encorajada em todo lugar.

Fontes ·OpenRouterEpoch AISWE-benchMCP RegistryChatbot ArenaHuggingFaceLiveBenchArtificial AnalysisSEALAider
Atualizado há 2h · 10+ fontes de referência · zero conteúdo editorial·Learn · Glossary·Pesquisa·Developers