Beta
Home/Comparer/Grok 4 vs o3

Grok 4 vs o3

Côte à côte. Chaque métrique. Chaque benchmark.

xAI
54.8
score moyen
9/19
benchmarks
OpenAI logoo3Gagnant
OpenAI
55.2
score moyen
9/19
benchmarks
TypeGrok 4o3
ProviderxAI logoxAIOpenAI logoOpenAI
score moyen54.855.2
Prix d'entrée$3.00$2.00
Prix de sortie$15.00$8.00
Fenêtre de contexte256K tokens (~128 books)200K tokens (~100 books)
Sorti le2025-07-092025-04-16
Code source ouvertProprietaryProprietary

19 benchmarks · Grok 4: 9, o3: 9

BenchmarkCatégorieGrok 4o3
Aider polyglotcoding79.681.3
ARC-AGIreasoning66.760.8
ARC-AGI-2reasoning16.06.5
DeepResearch Benchknowledge47.946.6
Fiction.LiveBenchknowledge94.488.9
FrontierMath-2025-02-28-Privatemath19.718.7
FrontierMath-Tier-4-2025-07-01-Privatemath2.12.1
GeoBenchknowledge45.074.0
GPQA diamondknowledge82.775.8
HELM — GPQAknowledge72.675.3
HELM — IFEvallanguage94.986.9
HELM — MMLU-Proknowledge85.185.9
HELM — Omni-MATHmath60.371.4
HELM — WildBenchreasoning79.786.1
Lech Mazur Writingknowledge80.783.9
OTIS Mock AIME 2024-2025math84.083.9
SimpleBenchreasoning52.643.7
SimpleQA Verifiedknowledge47.953.0
WeirdMLcoding45.752.4