Beta
Compare · ModelsLive · 2 picked · head to head

Claude Mythos Preview vs Gemini 3.1 Pro Preview

Side by side · benchmarks, pricing, and signals you can act on.

Winner summary

Claude Mythos Preview wins 3 of 3 shared benchmarks. Leads in knowledge · coding.

Category leads
knowledge·Claude Mythos Previewcoding·Claude Mythos Preview
Hype vs Reality
Claude Mythos Preview
#2 by perf·#2 by attention
DESERVED
Gemini 3.1 Pro Preview
#36 by perf·no signal
QUIET
Best value
Claude Mythos Preview
no price
Gemini 3.1 Pro Preview
8.7 pts/$
$7.00/M
Vendor risk
Anthropic logo
Anthropic
$380.0B·Tier 1
Medium risk
Google DeepMind logo
Google DeepMind
$4.00T·Tier 1
Low risk
Head to head
Claude Mythos PreviewGemini 3.1 Pro Preview
GPQA diamond
Claude Mythos Preview leads by +2.4
Graduate-Level Google-Proof QA (Diamond set) · expert-crafted questions in physics, biology, and chemistry that are difficult even for domain PhDs.
Claude Mythos Preview
94.5
Gemini 3.1 Pro Preview
92.1
SWE-Bench verified
Claude Mythos Preview leads by +18.3
Claude Mythos Preview
93.9
Gemini 3.1 Pro Preview
75.6
Terminal Bench
Claude Mythos Preview leads by +3.6
Terminal Bench · tests the ability to accomplish real-world tasks using terminal commands, evaluating shell scripting and CLI tool proficiency.
Claude Mythos Preview
82.0
Gemini 3.1 Pro Preview
78.4
Full benchmark table
BenchmarkClaude Mythos PreviewGemini 3.1 Pro Preview
GPQA diamond
Graduate-Level Google-Proof QA (Diamond set) · expert-crafted questions in physics, biology, and chemistry that are difficult even for domain PhDs.
94.592.1
SWE-Bench verified
93.975.6
Terminal Bench
Terminal Bench · tests the ability to accomplish real-world tasks using terminal commands, evaluating shell scripting and CLI tool proficiency.
82.078.4
Pricing · per 1M tokens · projected $/mo at 10M tokens
ModelInputOutputContextProjected $/mo
Anthropic logoClaude Mythos Preview1.0M tokens (~500 books)
Google DeepMind logoGemini 3.1 Pro Preview$2.00$12.001.0M tokens (~524 books)$45.00