Phi 4
Open Sourcedi Microsoft · Rilascio 2025-01-10
43.2
punteggio medio
$0.07/1M
Prezzo Input
$0.14/1M
Prezzo Output
16K tokens (~8 books)
Finestra di Contesto
text
Tipo
Tested on 16 benchmarks with 43.2% average. Top scores: Chatbot Arena Elo — Overall (1255.4%), MMLU (79.7%), IFEval (68.8%).
Punteggi Benchmark
| Benchmark | Categoria | Punteggio | Bar |
|---|---|---|---|
| Chatbot Arena Elo — Overall | arena | 1255.4 | |
| MMLU | knowledge | 79.7 | |
| IFEval | language | 68.8 | |
| MATH level 5 | math | 64.9 | |
| Lech Mazur Writing | knowledge | 62.6 | |
| BBH (HuggingFace) | general | 55.3 | |
| MATH Level 5 | math | 50.0 | |
| MMLU-PRO | knowledge | 48.6 | |
| GPQA diamond | knowledge | 41.4 | |
| OTIS Mock AIME 2024-2025 | math | 13.7 | |
| Balrog | knowledge | 11.6 | |
| GPQA | knowledge | 11.5 | |
| Artificial Analysis — Coding Index | speed | 11.2 | |
| Artificial Analysis — Quality Index | speed | 10.4 | |
| MUSR | reasoning | 10.1 | |
| Artificial Analysis — Agentic Index | speed | 0.0 |