测试版
OpenAI logo

o1

来自 OpenAI · 发布于 2024-12-17

56.4
平均分
$15.00/1M
输入价格
$60.00/1M
输出价格
200K tokens (~100 books)
上下文窗口
multimodal
类型

Tested on 14 benchmarks with 56.4% average. Top scores: MATH level 5 (94.7%), Aider — Code Editing (84.2%), Fiction.LiveBench (83.3%).

基准测试类别分数Bar
MATH level 5math94.7
Aider — Code Editingcoding84.2
Fiction.LiveBenchknowledge83.3
GeoBenchknowledge80.0
OTIS Mock AIME 2024-2025math73.3
Lech Mazur Writingknowledge70.2
GPQA diamondknowledge69.0
Aider polyglotcoding61.7
CadEvalcoding56.0
WeirdMLcoding43.8
ARC-AGIreasoning30.7
SimpleBenchreasoning28.1
FrontierMath-2025-02-28-Privatemath9.3
VPCTknowledge5.5