phi-3-mini 3.8B
Open Sourcevon Microsoft · Veroeffentlicht 2024-01-01
61.0
Durchschn. Score
N/A
Eingabepreis
N/A
Ausgabepreis
N/A
Kontextfenster
text
Typ
Tested on 8 benchmarks with 61.0% average. Top scores: OpenBookQA (84.0%), ARC AI2 (79.9%), HellaSwag (68.9%).
Benchmark-Ergebnisse
| Benchmark | Kategorie | Score | Bar |
|---|---|---|---|
| OpenBookQA | knowledge | 84.0 | |
| ARC AI2 | knowledge | 79.9 | |
| HellaSwag | knowledge | 68.9 | |
| TriviaQA | knowledge | 64.0 | |
| BBH | reasoning | 62.3 | |
| MMLU | knowledge | 58.4 | |
| Winogrande | knowledge | 41.6 | |
| ANLI | knowledge | 29.2 |
Aehnliche Modelle
OpenAI
61.2
Alibaba Qwen
60.7
Google DeepMind
60.6
Alibaba
60.6