DeepSeek-V2 (MoE-236B, May 2024)
por DeepSeek 路 Publicado el 2024-01-01
76.5
puntuaci贸n promedio
N/A
Precio de entrada
N/A
Precio de salida
N/A
Ventana de contexto
text
Tipo
Tested on 7 benchmarks with 76.5% average. Top scores: ARC AI2 (89.6%), HellaSwag (82.8%), TriviaQA (80.0%).
Puntuaciones de benchmark
| Benchmark | Categor铆a | Puntuaci贸n | Bar |
|---|---|---|---|
| ARC AI2 | knowledge | 89.6 | |
| HellaSwag | knowledge | 82.8 | |
| TriviaQA | knowledge | 80.0 | |
| Winogrande | knowledge | 72.6 | |
| BBH | reasoning | 71.7 | |
| MMLU | knowledge | 71.2 | |
| PIQA | knowledge | 67.8 |