测试版
排行榜/GPT-4o-mini (2024-07-18)
OpenAI logo

GPT-4o-mini (2024-07-18)

来自 OpenAI · 发布于 2024-07-18

43.2
平均分
$0.15/1M
输入价格
$0.60/1M
输出价格
128K tokens (~64 books)
上下文窗口
multimodal
类型

Tested on 20 benchmarks with 43.2% average. Top scores: Chatbot Arena Elo — Overall (1317.2%), GSM8K (91.3%), HELM — WildBench (79.1%).

基准测试类别分数Bar
Chatbot Arena Elo — Overallarena1317.2
GSM8Kmath91.3
HELM — WildBenchreasoning79.1
HELM — IFEvallanguage78.2
PIQAknowledge77.4
MMLUknowledge75.7
Lech Mazur Writingknowledge67.2
GeoBenchknowledge64.0
HELM — MMLU-Proknowledge60.3
VideoMMEmultimodal53.1
MATH level 5math52.6
HELM — GPQAknowledge36.8
HELM — Omni-MATHmath28.0
Balrogknowledge17.4
GPQA diamondknowledge17.0
WeirdMLcoding11.8
OTIS Mock AIME 2024-2025math6.8
Aider polyglotcoding3.6
VPCTknowledge1.0
ARC-AGI-2reasoning0.1