测试版
排行榜/Grok 3
xAI logo

Grok 3

来自 xAI · 发布于 2025-06-10

38.4
平均分
$3.00/1M
输入价格
$15.00/1M
输出价格
131K tokens (~66 books)
上下文窗口
text
类型

Tested on 13 benchmarks with 38.4% average. Top scores: MATH level 5 (88.8%), Lech Mazur Writing (76.4%), GPQA diamond (67.7%).

基准测试类别分数Bar
MATH level 5math88.8
Lech Mazur Writingknowledge76.4
GPQA diamondknowledge67.7
Fiction.LiveBenchknowledge58.3
OTIS Mock AIME 2024-2025math55.5
Aider polyglotcoding53.3
WeirdMLcoding37.2
Balrogknowledge29.5
SimpleBenchreasoning23.3
ARC-AGIreasoning5.5
FrontierMath-2025-02-28-Privatemath3.8
FrontierMath-Tier-4-2025-07-01-Privatemath0.1
ARC-AGI-2reasoning0.1