베타
리더보드/o4 Mini
OpenAI logo

o4 Mini

제공 OpenAI · 출시일 2025-04-16

53.2
평균 점수
$1.10/1M
입력 가격
$4.40/1M
출력 가격
200K tokens (~100 books)
컨텍스트 윈도우
multimodal
유형

Tested on 26 benchmarks with 53.2% average. Top scores: MATH level 5 (97.8%), HELM — IFEval (92.9%), HELM — WildBench (85.4%).

벤치마크카테고리점수Bar
MATH level 5math97.8
HELM — IFEvallanguage92.9
HELM — WildBenchreasoning85.4
HELM — MMLU-Proknowledge82.0
OTIS Mock AIME 2024-2025math81.7
Fiction.LiveBenchknowledge77.8
Lech Mazur Writingknowledge75.0
HELM — GPQAknowledge73.5
GPQA diamondknowledge72.8
Aider polyglotcoding72.0
HELM — Omni-MATHmath72.0
GeoBenchknowledge64.0
CadEvalcoding62.0
ARC-AGIreasoning58.7
WeirdMLcoding52.6
VISTAknowledge51.8
SWE-Bench Verified (Bash Only)coding45.0
VPCTknowledge36.3
SimpleBenchreasoning26.4
Chess Puzzlesknowledge26.0
FrontierMath-2025-02-28-Privatemath24.8
SimpleQA Verifiedknowledge23.9
HLEknowledge13.9
FrontierMath-Tier-4-2025-07-01-Privatemath6.3
ARC-AGI-2reasoning6.1
GSO-Benchcoding3.6