Beta
Home/Comparer/Claude Opus 4.6 vs o3

Claude Opus 4.6 vs o3

Côte à côte. Chaque métrique. Chaque benchmark.

Anthropic
57.5
score moyen
11/12
benchmarks
OpenAI
55.2
score moyen
1/12
benchmarks
TypeClaude Opus 4.6o3
ProviderAnthropic logoAnthropicOpenAI logoOpenAI
score moyen57.555.2
Prix d'entrée$5.00$2.00
Prix de sortie$25.00$8.00
Fenêtre de contexte1.0M tokens (~500 books)200K tokens (~100 books)
Sorti le2026-02-042025-04-16
Code source ouvertProprietaryProprietary

12 benchmarks · Claude Opus 4.6: 11, o3: 1

BenchmarkCatégorieClaude Opus 4.6o3
ARC-AGIreasoning94.060.8
ARC-AGI-2reasoning69.26.5
FrontierMath-2025-02-28-Privatemath40.718.7
FrontierMath-Tier-4-2025-07-01-Privatemath22.92.1
GPQA diamondknowledge87.475.8
GSO-Benchcoding33.38.8
HLEknowledge31.116.3
OTIS Mock AIME 2024-2025math94.483.9
SimpleBenchreasoning61.143.7
SimpleQA Verifiedknowledge46.553.0
SWE-Bench verifiedcoding78.762.3
WeirdMLcoding77.952.4