LIVE268社のプ��バイダーから976のAIモデルを追跡中。

BenchGeckoベータ

モデル976·プロバイダー268·ベンチマーク128·企業71·エージェント165·トップQwen3 VL 235B A22B Instruct · 1415.8%·更新1時間前·データポイント2,902·MCPサーバー4,923

ランキング/Grok 3 Beta

Grok 3 Beta

開発元 xAI · リリース日 2025-04-09

69.5

平均スコア

$3.00/1M

入力料金

$15.00/1M

出力料金

131K tokens (~66 books)

コンテキストウィンドウ

text

タイプ

Tested on 6 benchmarks with 69.5% average. Top scores: HELM — IFEval (88.4%), HELM — WildBench (84.9%), HELM — MMLU-Pro (78.8%).

ベンチマークスコア

ベンチマーク	カテゴリ	スコア	Bar
HELM — IFEval	language	88.4
HELM — WildBench	reasoning	84.9
HELM — MMLU-Pro	knowledge	78.8
HELM — GPQA	knowledge	65.0
Aider polyglot	coding	53.3
HELM — Omni-MATH	math	46.4

類似モデル

xAI Grok 3 タイムライン

$3.00/M in131Kctx13 benchmarks

Grok 3 BetaApr 2025

$3.00/M in131Kctx6 benchmarks

Grok 3 MiniJun 2025

$0.30/M in(-2.70)131Kctx11 benchmarks

Grok 3 Mini BetaApr 2025

$0.30/M in131Kctx7 benchmarks