M
Kimi K2 Thinking
C贸digo abiertopor moonshotai 路 Publicado el 2025-11-06
38.1
puntuaci贸n promedio
$0.47/1M
Precio de entrada
$2.00/1M
Precio de salida
131K tokens (~66 books)
Ventana de contexto
text
Tipo
Tested on 10 benchmarks with 38.1% average. Top scores: OTIS Mock AIME 2024-2025 (83.0%), GPQA diamond (79.0%), SWE-Bench Verified (Bash Only) (63.4%).
Puntuaciones de benchmark
| Benchmark | Categor铆a | Puntuaci贸n | Bar |
|---|---|---|---|
| OTIS Mock AIME 2024-2025 | math | 83.0 | |
| GPQA diamond | knowledge | 79.0 | |
| SWE-Bench Verified (Bash Only) | coding | 63.4 | |
| WeirdML | coding | 42.8 | |
| Terminal Bench | coding | 35.7 | |
| SimpleQA Verified | knowledge | 31.6 | |
| FrontierMath-2025-02-28-Private | math | 21.4 | |
| Chess Puzzles | knowledge | 20.0 | |
| APEX-Agents | agentic | 4.0 | |
| FrontierMath-Tier-4-2025-07-01-Private | math | 0.1 |