ベータ
OpenAI logo

GPT-4.1

開発元 OpenAI · リリース日 2025-04-14

43.3
平均スコア
$2.00/1M
入力料金
$8.00/1M
出力料金
1.0M tokens (~524 books)
コンテキストウィンドウ
multimodal
タイプ

Tested on 22 benchmarks with 43.3% average. Top scores: HELM — WildBench (85.4%), HELM — IFEval (83.8%), MATH level 5 (83.0%).

ベンチマークカテゴリスコアBar
HELM — WildBenchreasoning85.4
HELM — IFEvallanguage83.8
MATH level 5math83.0
HELM — MMLU-Proknowledge81.1
GeoBenchknowledge72.0
HELM — GPQAknowledge65.9
Fiction.LiveBenchknowledge63.9
GPQA diamondknowledge55.9
Aider polyglotcoding52.4
SWE-Bench verifiedcoding48.5
HELM — Omni-MATHmath47.1
CadEvalcoding42.0
SWE-Bench Verified (Bash Only)coding39.6
WeirdMLcoding39.0
OTIS Mock AIME 2024-2025math38.3
DeepResearch Benchknowledge29.3
SimpleBenchreasoning12.4
FrontierMath-2025-02-28-Privatemath5.5
ARC-AGIreasoning5.5
HLEknowledge0.6
ARC-AGI-2reasoning0.4
FrontierMath-Tier-4-2025-07-01-Privatemath0.1