测试版
排行榜/GPT-5.2
OpenAI logo

GPT-5.2

来自 OpenAI · 发布于 2025-12-10

54.0
平均分
$1.75/1M
输入价格
$14.00/1M
输出价格
400K tokens (~200 books)
上下文窗口
multimodal
类型

Tested on 20 benchmarks with 54.0% average. Top scores: Chatbot Arena Elo — Overall (1439.5%), Chatbot Arena Elo — Coding (1403.1%), OTIS Mock AIME 2024-2025 (96.1%).

基准测试类别分数Bar
Chatbot Arena Elo — Overallarena1439.5
Chatbot Arena Elo — Codingarena1403.1
OTIS Mock AIME 2024-2025math96.1
GPQA diamondknowledge88.5
ARC-AGIreasoning86.2
VPCTknowledge76.0
SWE-Bench verifiedcoding73.8
WeirdMLcoding72.2
SWE-Bench Verified (Bash Only)coding71.8
Terminal Benchcoding64.9
ARC-AGI-2reasoning52.9
Chess Puzzlesknowledge49.0
FrontierMath-2025-02-28-Privatemath40.7
SimpleQA Verifiedknowledge38.9
SimpleBenchreasoning35.0
APEX-Agentsagentic34.3
GSO-Benchcoding27.4
HLEknowledge24.2
PostTrainBenchknowledge21.4
FrontierMath-Tier-4-2025-07-01-Privatemath18.8