Beta
Rangliste/o4 Mini
OpenAI logo

o4 Mini

von OpenAI · Veroeffentlicht 2025-04-16

53.2
Durchschn. Score
$1.10/1M
Eingabepreis
$4.40/1M
Ausgabepreis
200K tokens (~100 books)
Kontextfenster
multimodal
Typ

Tested on 26 benchmarks with 53.2% average. Top scores: MATH level 5 (97.8%), HELM — IFEval (92.9%), HELM — WildBench (85.4%).

BenchmarkKategorieScoreBar
MATH level 5math97.8
HELM — IFEvallanguage92.9
HELM — WildBenchreasoning85.4
HELM — MMLU-Proknowledge82.0
OTIS Mock AIME 2024-2025math81.7
Fiction.LiveBenchknowledge77.8
Lech Mazur Writingknowledge75.0
HELM — GPQAknowledge73.5
GPQA diamondknowledge72.8
Aider polyglotcoding72.0
HELM — Omni-MATHmath72.0
GeoBenchknowledge64.0
CadEvalcoding62.0
ARC-AGIreasoning58.7
WeirdMLcoding52.6
VISTAknowledge51.8
SWE-Bench Verified (Bash Only)coding45.0
VPCTknowledge36.3
SimpleBenchreasoning26.4
Chess Puzzlesknowledge26.0
FrontierMath-2025-02-28-Privatemath24.8
SimpleQA Verifiedknowledge23.9
HLEknowledge13.9
FrontierMath-Tier-4-2025-07-01-Privatemath6.3
ARC-AGI-2reasoning6.1
GSO-Benchcoding3.6