Compare · ModelsLive · 2 picked · head to head
Grok 4 Fast vs Gemini 2.5 Pro
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
Grok 4 Fast wins on 3/5 benchmarks
Grok 4 Fast wins 3 of 5 shared benchmarks. Leads in reasoning · knowledge.
Category leads
reasoning·Grok 4 Fastknowledge·Grok 4 Fastcoding·Gemini 2.5 Pro
Hype vs Reality
Attention vs performance
Grok 4 Fast
#95 by perf·no signal
Gemini 2.5 Pro
#61 by perf·no signal
Best value
Grok 4 Fast
14.4x better value than Gemini 2.5 Pro
Grok 4 Fast
144.0 pts/$
$0.35/M
Gemini 2.5 Pro
10.0 pts/$
$5.63/M
Vendor risk
Who is behind the model
xAI
$250.0B·Tier 1
Google DeepMind
$4.00T·Tier 1
Head to head
5 benchmarks · 2 models
Grok 4 FastGemini 2.5 Pro
ARC-AGI
Grok 4 Fast leads by +7.5
ARC-AGI · the original Abstraction and Reasoning Corpus, testing whether AI can solve novel visual pattern recognition tasks without memorization.
Grok 4 Fast
48.5
Gemini 2.5 Pro
41.0
ARC-AGI-2
Grok 4 Fast leads by +0.4
ARC-AGI-2 · the second iteration of the Abstraction and Reasoning Corpus, testing novel pattern recognition and abstract reasoning without prior training data.
Grok 4 Fast
5.3
Gemini 2.5 Pro
4.9
Fiction.LiveBench
Grok 4 Fast leads by +2.7
Fiction.LiveBench · a continuously updated benchmark using recently published fiction to test reading comprehension and reasoning, preventing data contamination.
Grok 4 Fast
94.4
Gemini 2.5 Pro
91.7
Lech Mazur Writing
Gemini 2.5 Pro leads by +4.9
Lech Mazur Writing · evaluates creative writing ability, assessing prose quality, narrative coherence, and stylistic sophistication.
Grok 4 Fast
81.1
Gemini 2.5 Pro
86.0
WeirdML
Gemini 2.5 Pro leads by +11.2
WeirdML · tests models on unusual and adversarial machine learning tasks that require creative problem-solving beyond standard patterns.
Grok 4 Fast
42.9
Gemini 2.5 Pro
54.0
Full benchmark table
| Benchmark | Grok 4 Fast | Gemini 2.5 Pro |
|---|---|---|
ARC-AGI ARC-AGI · the original Abstraction and Reasoning Corpus, testing whether AI can solve novel visual pattern recognition tasks without memorization. | 48.5 | 41.0 |
ARC-AGI-2 ARC-AGI-2 · the second iteration of the Abstraction and Reasoning Corpus, testing novel pattern recognition and abstract reasoning without prior training data. | 5.3 | 4.9 |
Fiction.LiveBench Fiction.LiveBench · a continuously updated benchmark using recently published fiction to test reading comprehension and reasoning, preventing data contamination. | 94.4 | 91.7 |
Lech Mazur Writing Lech Mazur Writing · evaluates creative writing ability, assessing prose quality, narrative coherence, and stylistic sophistication. | 81.1 | 86.0 |
WeirdML WeirdML · tests models on unusual and adversarial machine learning tasks that require creative problem-solving beyond standard patterns. | 42.9 | 54.0 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| $0.20 | $0.50 | 2.0M tokens (~1,000 books) | $2.75 | |
| $1.25 | $10.00 | 1.0M tokens (~524 books) | $34.38 |