Beta
Compare · ModelsLive · 2 picked · head to head

GPT-5 Chat vs Grok 3 Mini Beta

Side by side · benchmarks, pricing, and signals you can act on.

Winner summary

GPT-5 Chat wins 6 of 7 shared benchmarks. Leads in coding · arena · knowledge.

Category leads
coding·GPT-5 Chatarena·GPT-5 Chatknowledge·GPT-5 Chatlanguage·Grok 3 Mini Betamath·GPT-5 Chatreasoning·GPT-5 Chat
Hype vs Reality
GPT-5 Chat
#1 by perf·#1 by attention
DESERVED
Grok 3 Mini Beta
#28 by perf·no signal
QUIET
Best value
11.1x better value than GPT-5 Chat
GPT-5 Chat
14.6 pts/$
$5.63/M
Grok 3 Mini Beta
162.0 pts/$
$0.40/M
Vendor risk
OpenAI logo
OpenAI
$840.0B·Tier 1
Medium risk
xAI logo
xAI
$250.0B·Tier 1
Medium risk
Head to head
GPT-5 ChatGrok 3 Mini Beta
Aider polyglot
GPT-5 Chat leads by +38.7
Aider Polyglot · measures how well AI models can edit code across multiple programming languages using the Aider coding assistant framework.
GPT-5 Chat
88.0
Grok 3 Mini Beta
49.3
Chatbot Arena Elo · Overall
GPT-5 Chat leads by +68.7
GPT-5 Chat
1426.0
Grok 3 Mini Beta
1357.4
HELM · GPQA
GPT-5 Chat leads by +11.6
GPT-5 Chat
79.1
Grok 3 Mini Beta
67.5
HELM · IFEval
Grok 3 Mini Beta leads by +7.6
GPT-5 Chat
87.5
Grok 3 Mini Beta
95.1
HELM · MMLU-Pro
GPT-5 Chat leads by +6.4
GPT-5 Chat
86.3
Grok 3 Mini Beta
79.9
HELM · Omni-MATH
GPT-5 Chat leads by +32.9
GPT-5 Chat
64.7
Grok 3 Mini Beta
31.8
HELM · WildBench
GPT-5 Chat leads by +20.6
GPT-5 Chat
85.7
Grok 3 Mini Beta
65.1
Full benchmark table
BenchmarkGPT-5 ChatGrok 3 Mini Beta
Aider polyglot
Aider Polyglot · measures how well AI models can edit code across multiple programming languages using the Aider coding assistant framework.
88.049.3
Chatbot Arena Elo · Overall
1426.01357.4
HELM · GPQA
79.167.5
HELM · IFEval
87.595.1
HELM · MMLU-Pro
86.379.9
HELM · Omni-MATH
64.731.8
HELM · WildBench
85.765.1
Pricing · per 1M tokens · projected $/mo at 10M tokens
ModelInputOutputContextProjected $/mo
OpenAI logoGPT-5 Chat$1.25$10.00128K tokens (~64 books)$34.38
xAI logoGrok 3 Mini Beta$0.30$0.50131K tokens (~66 books)$3.50