Beta
Home/Comparar/o3 vs Grok 4

o3 vs Grok 4

Lado a lado. Cada métrica. Cada benchmark.

OpenAI logoo3Ganador
OpenAI
55.2
puntuación promedio
9/19
benchmarks
xAI
54.8
puntuación promedio
9/19
benchmarks
Tipoo3Grok 4
ProviderOpenAI logoOpenAIxAI logoxAI
puntuación promedio55.254.8
Precio de entrada$2.00$3.00
Precio de salida$8.00$15.00
Ventana de contexto200K tokens (~100 books)256K tokens (~128 books)
Publicado el2025-04-162025-07-09
Código abiertoProprietaryProprietary

19 benchmarks · o3: 9, Grok 4: 9

BenchmarkCategoríao3Grok 4
Aider polyglotcoding81.379.6
ARC-AGIreasoning60.866.7
ARC-AGI-2reasoning6.516.0
DeepResearch Benchknowledge46.647.9
Fiction.LiveBenchknowledge88.994.4
FrontierMath-2025-02-28-Privatemath18.719.7
FrontierMath-Tier-4-2025-07-01-Privatemath2.12.1
GeoBenchknowledge74.045.0
GPQA diamondknowledge75.882.7
HELM — GPQAknowledge75.372.6
HELM — IFEvallanguage86.994.9
HELM — MMLU-Proknowledge85.985.1
HELM — Omni-MATHmath71.460.3
HELM — WildBenchreasoning86.179.7
Lech Mazur Writingknowledge83.980.7
OTIS Mock AIME 2024-2025math83.984.0
SimpleBenchreasoning43.752.6
SimpleQA Verifiedknowledge53.047.9
WeirdMLcoding52.445.7