Beta
Home/Comparar/Grok 4 vs o3

Grok 4 vs o3

Lado a lado. Cada métrica. Cada benchmark.

xAI
54.8
puntuación promedio
9/19
benchmarks
OpenAI logoo3Ganador
OpenAI
55.2
puntuación promedio
9/19
benchmarks
TipoGrok 4o3
ProviderxAI logoxAIOpenAI logoOpenAI
puntuación promedio54.855.2
Precio de entrada$3.00$2.00
Precio de salida$15.00$8.00
Ventana de contexto256K tokens (~128 books)200K tokens (~100 books)
Publicado el2025-07-092025-04-16
Código abiertoProprietaryProprietary

19 benchmarks · Grok 4: 9, o3: 9

BenchmarkCategoríaGrok 4o3
Aider polyglotcoding79.681.3
ARC-AGIreasoning66.760.8
ARC-AGI-2reasoning16.06.5
DeepResearch Benchknowledge47.946.6
Fiction.LiveBenchknowledge94.488.9
FrontierMath-2025-02-28-Privatemath19.718.7
FrontierMath-Tier-4-2025-07-01-Privatemath2.12.1
GeoBenchknowledge45.074.0
GPQA diamondknowledge82.775.8
HELM — GPQAknowledge72.675.3
HELM — IFEvallanguage94.986.9
HELM — MMLU-Proknowledge85.185.9
HELM — Omni-MATHmath60.371.4
HELM — WildBenchreasoning79.786.1
Lech Mazur Writingknowledge80.783.9
OTIS Mock AIME 2024-2025math84.083.9
SimpleBenchreasoning52.643.7
SimpleQA Verifiedknowledge47.953.0
WeirdMLcoding45.752.4