베타
리더보드/GPT-4.1
OpenAI logo

GPT-4.1

제공 OpenAI · 출시일 2025-04-14

43.3
평균 점수
$2.00/1M
입력 가격
$8.00/1M
출력 가격
1.0M tokens (~524 books)
컨텍스트 윈도우
multimodal
유형

Tested on 22 benchmarks with 43.3% average. Top scores: HELM — WildBench (85.4%), HELM — IFEval (83.8%), MATH level 5 (83.0%).

벤치마크카테고리점수Bar
HELM — WildBenchreasoning85.4
HELM — IFEvallanguage83.8
MATH level 5math83.0
HELM — MMLU-Proknowledge81.1
GeoBenchknowledge72.0
HELM — GPQAknowledge65.9
Fiction.LiveBenchknowledge63.9
GPQA diamondknowledge55.9
Aider polyglotcoding52.4
SWE-Bench verifiedcoding48.5
HELM — Omni-MATHmath47.1
CadEvalcoding42.0
SWE-Bench Verified (Bash Only)coding39.6
WeirdMLcoding39.0
OTIS Mock AIME 2024-2025math38.3
DeepResearch Benchknowledge29.3
SimpleBenchreasoning12.4
FrontierMath-2025-02-28-Privatemath5.5
ARC-AGIreasoning5.5
HLEknowledge0.6
ARC-AGI-2reasoning0.4
FrontierMath-Tier-4-2025-07-01-Privatemath0.1