Beta
Compare · ModelsLive · 2 picked · head to head

Gemini 3.1 Pro Preview vs Claude Mythos Preview

Side by side · benchmarks, pricing, and signals you can act on.

Winner summary

Claude Mythos Preview wins 3 of 3 shared benchmarks. Leads in knowledge · coding.

Category leads
knowledge·Claude Mythos Previewcoding·Claude Mythos Preview
Hype vs Reality
Gemini 3.1 Pro Preview
#36 by perf·no signal
QUIET
Claude Mythos Preview
#2 by perf·#2 by attention
DESERVED
Best value
Gemini 3.1 Pro Preview
8.7 pts/$
$7.00/M
Claude Mythos Preview
no price
Vendor risk
Google DeepMind logo
Google DeepMind
$4.00T·Tier 1
Low risk
Anthropic logo
Anthropic
$380.0B·Tier 1
Medium risk
Head to head
Gemini 3.1 Pro PreviewClaude Mythos Preview
GPQA diamond
Claude Mythos Preview leads by +2.4
Graduate-Level Google-Proof QA (Diamond set) · expert-crafted questions in physics, biology, and chemistry that are difficult even for domain PhDs.
Gemini 3.1 Pro Preview
92.1
Claude Mythos Preview
94.5
SWE-Bench verified
Claude Mythos Preview leads by +18.3
Gemini 3.1 Pro Preview
75.6
Claude Mythos Preview
93.9
Terminal Bench
Claude Mythos Preview leads by +3.6
Terminal Bench · tests the ability to accomplish real-world tasks using terminal commands, evaluating shell scripting and CLI tool proficiency.
Gemini 3.1 Pro Preview
78.4
Claude Mythos Preview
82.0
Full benchmark table
BenchmarkGemini 3.1 Pro PreviewClaude Mythos Preview
GPQA diamond
Graduate-Level Google-Proof QA (Diamond set) · expert-crafted questions in physics, biology, and chemistry that are difficult even for domain PhDs.
92.194.5
SWE-Bench verified
75.693.9
Terminal Bench
Terminal Bench · tests the ability to accomplish real-world tasks using terminal commands, evaluating shell scripting and CLI tool proficiency.
78.482.0
Pricing · per 1M tokens · projected $/mo at 10M tokens
ModelInputOutputContextProjected $/mo
Google DeepMind logoGemini 3.1 Pro Preview$2.00$12.001.0M tokens (~524 books)$45.00
Anthropic logoClaude Mythos Preview1.0M tokens (~500 books)