Beta
Compare · ModelsLive · 2 picked · head to head

Claude Mythos Preview vs Kimi K2 Thinking

Side by side · benchmarks, pricing, and signals you can act on.

Winner summary

Claude Mythos Preview wins 2 of 2 shared benchmarks. Leads in knowledge · coding.

Category leads
knowledge·Claude Mythos Previewcoding·Claude Mythos Preview
Hype vs Reality
Claude Mythos Preview
#2 by perf·#2 by attention
DESERVED
Kimi K2 Thinking
#77 by perf·no signal
QUIET
Best value
Claude Mythos Preview
no price
Kimi K2 Thinking
34.4 pts/$
$1.55/M
Vendor risk
Anthropic logo
Anthropic
$380.0B·Tier 1
Medium risk
moonshotai logo
moonshotai
private · undisclosed
Unknown
Head to head
Claude Mythos PreviewKimi K2 Thinking
GPQA diamond
Claude Mythos Preview leads by +15.5
Graduate-Level Google-Proof QA (Diamond set) · expert-crafted questions in physics, biology, and chemistry that are difficult even for domain PhDs.
Claude Mythos Preview
94.5
Kimi K2 Thinking
79.0
Terminal Bench
Claude Mythos Preview leads by +46.3
Terminal Bench · tests the ability to accomplish real-world tasks using terminal commands, evaluating shell scripting and CLI tool proficiency.
Claude Mythos Preview
82.0
Kimi K2 Thinking
35.7
Full benchmark table
BenchmarkClaude Mythos PreviewKimi K2 Thinking
GPQA diamond
Graduate-Level Google-Proof QA (Diamond set) · expert-crafted questions in physics, biology, and chemistry that are difficult even for domain PhDs.
94.579.0
Terminal Bench
Terminal Bench · tests the ability to accomplish real-world tasks using terminal commands, evaluating shell scripting and CLI tool proficiency.
82.035.7
Pricing · per 1M tokens · projected $/mo at 10M tokens
ModelInputOutputContextProjected $/mo
Anthropic logoClaude Mythos Preview1.0M tokens (~500 books)
moonshotai logoKimi K2 Thinking$0.60$2.50262K tokens (~131 books)$10.75