Beta
Home/Comparer/o4 Mini vs o3

o4 Mini vs o3

Côte à côte. Chaque métrique. Chaque benchmark.

OpenAI
53.2
score moyen
7/24
benchmarks
OpenAI logoo3Gagnant
OpenAI
55.2
score moyen
17/24
benchmarks
Typeo4 Minio3
ProviderOpenAI logoOpenAIOpenAI logoOpenAI
score moyen53.255.2
Prix d'entrée$1.10$2.00
Prix de sortie$4.40$8.00
Fenêtre de contexte200K tokens (~100 books)200K tokens (~100 books)
Sorti le2025-04-162025-04-16
Code source ouvertProprietaryProprietary

24 benchmarks · o4 Mini: 7, o3: 17

BenchmarkCatégorieo4 Minio3
Aider polyglotcoding72.081.3
ARC-AGIreasoning58.760.8
ARC-AGI-2reasoning6.16.5
CadEvalcoding62.074.0
Fiction.LiveBenchknowledge77.888.9
FrontierMath-2025-02-28-Privatemath24.818.7
FrontierMath-Tier-4-2025-07-01-Privatemath6.32.1
GeoBenchknowledge64.074.0
GPQA diamondknowledge72.875.8
GSO-Benchcoding3.68.8
HELM — GPQAknowledge73.575.3
HELM — IFEvallanguage92.986.9
HELM — MMLU-Proknowledge82.085.9
HELM — Omni-MATHmath72.071.4
HELM — WildBenchreasoning85.486.1
HLEknowledge13.916.3
Lech Mazur Writingknowledge75.083.9
MATH level 5math97.897.8
OTIS Mock AIME 2024-2025math81.783.9
SimpleBenchreasoning26.443.7
SimpleQA Verifiedknowledge23.953.0
SWE-Bench Verified (Bash Only)coding45.058.4
VPCTknowledge36.328.0
WeirdMLcoding52.652.4