Beta
Home/Comparer/o3 vs Grok 4

o3 vs Grok 4

Côte à côte. Chaque métrique. Chaque benchmark.

OpenAI logoo3Gagnant
OpenAI
55.2
score moyen
9/19
benchmarks
xAI
54.8
score moyen
9/19
benchmarks
Typeo3Grok 4
ProviderOpenAI logoOpenAIxAI logoxAI
score moyen55.254.8
Prix d'entrée$2.00$3.00
Prix de sortie$8.00$15.00
Fenêtre de contexte200K tokens (~100 books)256K tokens (~128 books)
Sorti le2025-04-162025-07-09
Code source ouvertProprietaryProprietary

19 benchmarks · o3: 9, Grok 4: 9

BenchmarkCatégorieo3Grok 4
Aider polyglotcoding81.379.6
ARC-AGIreasoning60.866.7
ARC-AGI-2reasoning6.516.0
DeepResearch Benchknowledge46.647.9
Fiction.LiveBenchknowledge88.994.4
FrontierMath-2025-02-28-Privatemath18.719.7
FrontierMath-Tier-4-2025-07-01-Privatemath2.12.1
GeoBenchknowledge74.045.0
GPQA diamondknowledge75.882.7
HELM — GPQAknowledge75.372.6
HELM — IFEvallanguage86.994.9
HELM — MMLU-Proknowledge85.985.1
HELM — Omni-MATHmath71.460.3
HELM — WildBenchreasoning86.179.7
Lech Mazur Writingknowledge83.980.7
OTIS Mock AIME 2024-2025math83.984.0
SimpleBenchreasoning43.752.6
SimpleQA Verifiedknowledge53.047.9
WeirdMLcoding52.445.7