phi-3-medium 14B
Open Sourceby Microsoft 路 Released 2024-01-01
58.6
avg score
N/A
Input Price
N/A
Output Price
N/A
Context Window
text
Type
Tested on 10 benchmarks with 58.6% average. Top scores: ARC AI2 (88.8%), OpenBookQA (83.2%), HellaSwag (76.5%).
Benchmark Scores
| Benchmark | Category | Score | Bar |
|---|---|---|---|
| ARC AI2 | knowledge | 88.8 | |
| OpenBookQA | knowledge | 83.2 | |
| HellaSwag | knowledge | 76.5 | |
| BBH | reasoning | 75.2 | |
| TriviaQA | knowledge | 73.9 | |
| MMLU | knowledge | 70.7 | |
| Winogrande | knowledge | 63.0 | |
| ANLI | knowledge | 33.7 | |
| MATH level 5 | math | 17.6 | |
| GPQA diamond | knowledge | 3.5 |