Qwen2.5 32B Instruct
Código abiertopor Alibaba · Publicado el 2024-09-17
43.2
puntuación promedio
N/A
Precio de entrada
N/A
Precio de salida
N/A
Ventana de contexto
text-generation
Tipo
Tested on 7 benchmarks with 43.2% average. Top scores: IFEval (83.5%), MATH Level 5 (62.5%), BBH (HuggingFace) (56.5%).
Puntuaciones de benchmark
| Benchmark | Categoría | Puntuación | Bar |
|---|---|---|---|
| IFEval | language | 83.5 | |
| MATH Level 5 | math | 62.5 | |
| BBH (HuggingFace) | general | 56.5 | |
| MMLU-PRO | knowledge | 51.9 | |
| PropensityBench | safety | 22.9 | |
| MUSR | reasoning | 13.5 | |
| GPQA | knowledge | 11.7 |