Grok 3 Beta
di xAI · Rilascio 2025-04-09
69.5
punteggio medio
$3.00/1M
Prezzo Input
$15.00/1M
Prezzo Output
131K tokens (~66 books)
Finestra di Contesto
text
Tipo
Tested on 6 benchmarks with 69.5% average. Top scores: HELM — IFEval (88.4%), HELM — WildBench (84.9%), HELM — MMLU-Pro (78.8%).
Punteggi Benchmark
| Benchmark | Categoria | Punteggio | Bar |
|---|---|---|---|
| HELM — IFEval | language | 88.4 | |
| HELM — WildBench | reasoning | 84.9 | |
| HELM — MMLU-Pro | knowledge | 78.8 | |
| HELM — GPQA | knowledge | 65.0 | |
| Aider polyglot | coding | 53.3 | |
| HELM — Omni-MATH | math | 46.4 |