Beta
Home/Comparar/o4 Mini vs o3

o4 Mini vs o3

Lado a lado. Cada métrica. Cada benchmark.

OpenAI
53.2
puntuación promedio
7/24
benchmarks
OpenAI logoo3Ganador
OpenAI
55.2
puntuación promedio
17/24
benchmarks
Tipoo4 Minio3
ProviderOpenAI logoOpenAIOpenAI logoOpenAI
puntuación promedio53.255.2
Precio de entrada$1.10$2.00
Precio de salida$4.40$8.00
Ventana de contexto200K tokens (~100 books)200K tokens (~100 books)
Publicado el2025-04-162025-04-16
Código abiertoProprietaryProprietary

24 benchmarks · o4 Mini: 7, o3: 17

BenchmarkCategoríao4 Minio3
Aider polyglotcoding72.081.3
ARC-AGIreasoning58.760.8
ARC-AGI-2reasoning6.16.5
CadEvalcoding62.074.0
Fiction.LiveBenchknowledge77.888.9
FrontierMath-2025-02-28-Privatemath24.818.7
FrontierMath-Tier-4-2025-07-01-Privatemath6.32.1
GeoBenchknowledge64.074.0
GPQA diamondknowledge72.875.8
GSO-Benchcoding3.68.8
HELM — GPQAknowledge73.575.3
HELM — IFEvallanguage92.986.9
HELM — MMLU-Proknowledge82.085.9
HELM — Omni-MATHmath72.071.4
HELM — WildBenchreasoning85.486.1
HLEknowledge13.916.3
Lech Mazur Writingknowledge75.083.9
MATH level 5math97.897.8
OTIS Mock AIME 2024-2025math81.783.9
SimpleBenchreasoning26.443.7
SimpleQA Verifiedknowledge23.953.0
SWE-Bench Verified (Bash Only)coding45.058.4
VPCTknowledge36.328.0
WeirdMLcoding52.652.4