Compare · ModelsLive · 3 picked · head to head

Claude Opus 4.6 (Fast) vs GPT-5 vs GPT-5 Pro

Side by side · benchmarks, pricing, and signals you can act on.

CiteAdd another

Winner summary

GPT-5 Pro wins on 5/8 benchmarks

GPT-5 Pro wins 5 of 8 shared benchmarks. Leads in reasoning · math.

Claude Opus 4.6 (Fast)

Category leads

knowledge·Claude Opus 4.6 (Fast)reasoning·GPT-5 Promath·GPT-5 Procoding·GPT-5

Hype vs Reality

Attention vs performance

Claude Opus 4.6 (Fast)

#122 by perf·no signal

QUIET

GPT-5

#74 by perf·no signal

QUIET

GPT-5 Pro

#124 by perf·no signal

QUIET

See full mindshare →

Best value

GPT-5

15.1x better value than GPT-5 Pro

Claude Opus 4.6 (Fast)

0.5 pts/$

$90.00/M

GPT-5

9.7 pts/$

$5.63/M

GPT-5 Pro

0.6 pts/$

$67.50/M

Explore pricing →

Vendor risk

Who is behind the model

Anthropic

$380.0B·Tier 1

Medium risk

OpenAI

$840.0B·Tier 1

Medium risk

OpenAI

$840.0B·Tier 1

Medium risk

See the AI economy →

Head to head

8 benchmarks · 3 models

Claude Opus 4.6 (Fast)GPT-5GPT-5 Pro

Professional Reasoning · Finance

Claude Opus 4.6 (Fast) leads by +2.0

Claude Opus 4.6 (Fast)

53.3

GPT-5

51.3

GPT-5 Pro

51.1

Professional Reasoning · Legal

Claude Opus 4.6 (Fast) leads by +2.4

Claude Opus 4.6 (Fast)

52.3

GPT-5

49.0

GPT-5 Pro

49.9

ARC-AGI

GPT-5 Pro leads by +4.5

ARC-AGI · the original Abstraction and Reasoning Corpus, testing whether AI can solve novel visual pattern recognition tasks without memorization.

GPT-5

65.7

GPT-5 Pro

70.2

ARC-AGI-2

GPT-5 Pro leads by +8.5

ARC-AGI-2 · the second iteration of the Abstraction and Reasoning Corpus, testing novel pattern recognition and abstract reasoning without prior training data.

GPT-5

9.9

GPT-5 Pro

18.3

FrontierMath-Tier-4-2025-07-01-Private

GPT-5 Pro leads by +2.1

FrontierMath Tier 4 (Jul 2025) · the most challenging tier of frontier mathematics, containing problems that push the absolute limits of AI mathematical reasoning.

GPT-5

12.5

GPT-5 Pro

14.6

HLE

GPT-5 Pro leads by +6.6

HLE (Humanity's Last Exam) · a reasoning benchmark designed to be the hardest public evaluation of AI. Questions span mathematics, physics, philosophy, and logic · curated to be at or beyond the frontier of human expert capability. Tested with and without tool augmentation. Claude Opus 4.7 scores 46.9% without tools and 54.7% with tools · making it one of the few benchmarks where the top score is below 60%.

GPT-5

21.6

GPT-5 Pro

28.2

SimpleBench

GPT-5 Pro leads by +5.9

SimpleBench · tests fundamental reasoning capabilities with straightforward problems designed to expose gaps in basic logical and spatial thinking.

GPT-5

48.0

GPT-5 Pro

53.9

WeirdML

GPT-5 leads by +0.3

WeirdML · tests models on unusual and adversarial machine learning tasks that require creative problem-solving beyond standard patterns.

GPT-5

60.7

GPT-5 Pro

60.4

Full benchmark table

Benchmark	Claude Opus 4.6 (Fast)	GPT-5	GPT-5 Pro
Professional Reasoning · Finance	53.3	51.3	51.1
Professional Reasoning · Legal	52.3	49.0	49.9
ARC-AGI ARC-AGI · the original Abstraction and Reasoning Corpus, testing whether AI can solve novel visual pattern recognition tasks without memorization.	—	65.7	70.2
ARC-AGI-2 ARC-AGI-2 · the second iteration of the Abstraction and Reasoning Corpus, testing novel pattern recognition and abstract reasoning without prior training data.	—	9.9	18.3
FrontierMath-Tier-4-2025-07-01-Private FrontierMath Tier 4 (Jul 2025) · the most challenging tier of frontier mathematics, containing problems that push the absolute limits of AI mathematical reasoning.	—	12.5	14.6
HLE HLE (Humanity's Last Exam) · a reasoning benchmark designed to be the hardest public evaluation of AI. Questions span mathematics, physics, philosophy, and logic · curated to be at or beyond the frontier of human expert capability. Tested with and without tool augmentation. Claude Opus 4.7 scores 46.9% without tools and 54.7% with tools · making it one of the few benchmarks where the top score is below 60%.	—	21.6	28.2
SimpleBench SimpleBench · tests fundamental reasoning capabilities with straightforward problems designed to expose gaps in basic logical and spatial thinking.	—	48.0	53.9
WeirdML WeirdML · tests models on unusual and adversarial machine learning tasks that require creative problem-solving beyond standard patterns.	—	60.7	60.4

Pricing · per 1M tokens · projected $/mo at 10M tokens

Model	Input	Output	Context	Projected $/mo
Claude Opus 4.6 (Fast)	$30.00	$150.00	1.0M tokens (~500 books)	$600.00
GPT-5	$1.25	$10.00	400K tokens (~200 books)	$34.38
GPT-5 Pro	$15.00	$120.00	400K tokens (~200 books)	$412.50

People also compared

Claude Opus 4 vs GPT-5 Claude Sonnet 4.5 vs GPT-5 Gemini 2.5 Pro vs GPT-5 GPT-5 vs o3 GPT-5 vs Qwen3 Max GPT-4o vs GPT-5 GPT-5 vs GPT-5.4