Compare · ModelsLive · 2 picked · head to head

Grok 4 Fast vs Gemini 2.5 Pro

Side by side · benchmarks, pricing, and signals you can act on.

Winner summary

Grok 4 Fast wins 3 of 5 shared benchmarks. Leads in reasoning · knowledge.

Category leads
reasoning·Grok 4 Fastknowledge·Grok 4 Fastcoding·Gemini 2.5 Pro
Hype vs Reality
Grok 4 Fast
#95 by perf·no signal
QUIET
Gemini 2.5 Pro
#61 by perf·no signal
QUIET
Best value
14.4x better value than Gemini 2.5 Pro
Grok 4 Fast
144.0 pts/$
$0.35/M
Gemini 2.5 Pro
10.0 pts/$
$5.63/M
Vendor risk
xAI logo
xAI
$250.0B·Tier 1
Medium risk
Google DeepMind logo
Google DeepMind
$4.00T·Tier 1
Low risk
Head to head
Grok 4 FastGemini 2.5 Pro
ARC-AGI
Grok 4 Fast leads by +7.5
ARC-AGI · the original Abstraction and Reasoning Corpus, testing whether AI can solve novel visual pattern recognition tasks without memorization.
Grok 4 Fast
48.5
Gemini 2.5 Pro
41.0
ARC-AGI-2
Grok 4 Fast leads by +0.4
ARC-AGI-2 · the second iteration of the Abstraction and Reasoning Corpus, testing novel pattern recognition and abstract reasoning without prior training data.
Grok 4 Fast
5.3
Gemini 2.5 Pro
4.9
Fiction.LiveBench
Grok 4 Fast leads by +2.7
Fiction.LiveBench · a continuously updated benchmark using recently published fiction to test reading comprehension and reasoning, preventing data contamination.
Grok 4 Fast
94.4
Gemini 2.5 Pro
91.7
Lech Mazur Writing
Gemini 2.5 Pro leads by +4.9
Lech Mazur Writing · evaluates creative writing ability, assessing prose quality, narrative coherence, and stylistic sophistication.
Grok 4 Fast
81.1
Gemini 2.5 Pro
86.0
WeirdML
Gemini 2.5 Pro leads by +11.2
WeirdML · tests models on unusual and adversarial machine learning tasks that require creative problem-solving beyond standard patterns.
Grok 4 Fast
42.9
Gemini 2.5 Pro
54.0
Full benchmark table
BenchmarkGrok 4 FastGemini 2.5 Pro
ARC-AGI
ARC-AGI · the original Abstraction and Reasoning Corpus, testing whether AI can solve novel visual pattern recognition tasks without memorization.
48.541.0
ARC-AGI-2
ARC-AGI-2 · the second iteration of the Abstraction and Reasoning Corpus, testing novel pattern recognition and abstract reasoning without prior training data.
5.34.9
Fiction.LiveBench
Fiction.LiveBench · a continuously updated benchmark using recently published fiction to test reading comprehension and reasoning, preventing data contamination.
94.491.7
Lech Mazur Writing
Lech Mazur Writing · evaluates creative writing ability, assessing prose quality, narrative coherence, and stylistic sophistication.
81.186.0
WeirdML
WeirdML · tests models on unusual and adversarial machine learning tasks that require creative problem-solving beyond standard patterns.
42.954.0
Pricing · per 1M tokens · projected $/mo at 10M tokens
ModelInputOutputContextProjected $/mo
xAI logoGrok 4 Fast$0.20$0.502.0M tokens (~1,000 books)$2.75
Google DeepMind logoGemini 2.5 Pro$1.25$10.001.0M tokens (~524 books)$34.38