Compare · ModelsLive · 2 picked · head to head
GPT-5 Chat vs Grok 3 Mini Beta
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
GPT-5 Chat wins on 6/7 benchmarks
GPT-5 Chat wins 6 of 7 shared benchmarks. Leads in coding · arena · knowledge.
Category leads
coding·GPT-5 Chatarena·GPT-5 Chatknowledge·GPT-5 Chatlanguage·Grok 3 Mini Betamath·GPT-5 Chatreasoning·GPT-5 Chat
Hype vs Reality
Attention vs performance
GPT-5 Chat
#1 by perf·#1 by attention
Grok 3 Mini Beta
#28 by perf·no signal
Best value
Grok 3 Mini Beta
11.1x better value than GPT-5 Chat
GPT-5 Chat
14.6 pts/$
$5.63/M
Grok 3 Mini Beta
162.0 pts/$
$0.40/M
Vendor risk
Who is behind the model
OpenAI
$840.0B·Tier 1
xAI
$250.0B·Tier 1
Head to head
7 benchmarks · 2 models
GPT-5 ChatGrok 3 Mini Beta
Aider polyglot
GPT-5 Chat leads by +38.7
Aider Polyglot · measures how well AI models can edit code across multiple programming languages using the Aider coding assistant framework.
GPT-5 Chat
88.0
Grok 3 Mini Beta
49.3
Chatbot Arena Elo · Overall
GPT-5 Chat leads by +68.7
GPT-5 Chat
1426.0
Grok 3 Mini Beta
1357.4
HELM · GPQA
GPT-5 Chat leads by +11.6
GPT-5 Chat
79.1
Grok 3 Mini Beta
67.5
HELM · IFEval
Grok 3 Mini Beta leads by +7.6
GPT-5 Chat
87.5
Grok 3 Mini Beta
95.1
HELM · MMLU-Pro
GPT-5 Chat leads by +6.4
GPT-5 Chat
86.3
Grok 3 Mini Beta
79.9
HELM · Omni-MATH
GPT-5 Chat leads by +32.9
GPT-5 Chat
64.7
Grok 3 Mini Beta
31.8
HELM · WildBench
GPT-5 Chat leads by +20.6
GPT-5 Chat
85.7
Grok 3 Mini Beta
65.1
Full benchmark table
| Benchmark | GPT-5 Chat | Grok 3 Mini Beta |
|---|---|---|
Aider polyglot Aider Polyglot · measures how well AI models can edit code across multiple programming languages using the Aider coding assistant framework. | 88.0 | 49.3 |
Chatbot Arena Elo · Overall | 1426.0 | 1357.4 |
HELM · GPQA | 79.1 | 67.5 |
HELM · IFEval | 87.5 | 95.1 |
HELM · MMLU-Pro | 86.3 | 79.9 |
HELM · Omni-MATH | 64.7 | 31.8 |
HELM · WildBench | 85.7 | 65.1 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| $1.25 | $10.00 | 128K tokens (~64 books) | $34.38 | |
| $0.30 | $0.50 | 131K tokens (~66 books) | $3.50 |