ベータ
ランキング/GPT-4o-mini (2024-07-18)
OpenAI logo

GPT-4o-mini (2024-07-18)

開発元 OpenAI · リリース日 2024-07-18

43.2
平均スコア
$0.15/1M
入力料金
$0.60/1M
出力料金
128K tokens (~64 books)
コンテキストウィンドウ
multimodal
タイプ

Tested on 20 benchmarks with 43.2% average. Top scores: Chatbot Arena Elo — Overall (1317.2%), GSM8K (91.3%), HELM — WildBench (79.1%).

ベンチマークカテゴリスコアBar
Chatbot Arena Elo — Overallarena1317.2
GSM8Kmath91.3
HELM — WildBenchreasoning79.1
HELM — IFEvallanguage78.2
PIQAknowledge77.4
MMLUknowledge75.7
Lech Mazur Writingknowledge67.2
GeoBenchknowledge64.0
HELM — MMLU-Proknowledge60.3
VideoMMEmultimodal53.1
MATH level 5math52.6
HELM — GPQAknowledge36.8
HELM — Omni-MATHmath28.0
Balrogknowledge17.4
GPQA diamondknowledge17.0
WeirdMLcoding11.8
OTIS Mock AIME 2024-2025math6.8
Aider polyglotcoding3.6
VPCTknowledge1.0
ARC-AGI-2reasoning0.1