测试版
排行榜/Grok 3 Beta
xAI logo

Grok 3 Beta

来自 xAI · 发布于 2025-04-09

69.5
平均分
$3.00/1M
输入价格
$15.00/1M
输出价格
131K tokens (~66 books)
上下文窗口
text
类型

Tested on 6 benchmarks with 69.5% average. Top scores: HELM — IFEval (88.4%), HELM — WildBench (84.9%), HELM — MMLU-Pro (78.8%).

基准测试类别分数Bar
HELM — IFEvallanguage88.4
HELM — WildBenchreasoning84.9
HELM — MMLU-Proknowledge78.8
HELM — GPQAknowledge65.0
Aider polyglotcoding53.3
HELM — Omni-MATHmath46.4