Home/Models/Phi-1.5
Microsoft logo

Phi-1.5

by Microsoft · Released Jan 2024

Open Source
15.6
avg score
Rank #214
Compare
Better than 8% of all models
Context
N/A
Input $/1M
TBD
Output $/1M
TBD
Type
text
License
Open Source
Benchmarks
11 tested
Data updated today
About

Tested on 11 benchmarks with 16.3% average. Top scores: Winogrande (46.8%), HellaSwag (30.1%), ARC AI2 (25.9%).

Capabilities
reasoning
3.4
#170 globally
math
1.8
#207 globally
knowledge
20.8
#198 globally
general
7.5
#57 globally
language
20.3
#140 globally
Benchmark Scores
Compare All
Tested on 11 benchmarks · Ranked across 5 categories
Score Distribution (all 233 models)
0255075100
▲ You are here
MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

3.4
MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

1.8
Winogrande

Commonsense coreference resolution. Tests understanding of pronoun references in ambiguous sentences.

46.8
HellaSwag

Sentence completion requiring commonsense reasoning about physical and social situations. Tests real-world understanding.

30.1
ARC AI2

AI2 Reasoning Challenge. Grade-school science questions requiring multi-step reasoning. Easy and Challenge sets test different difficulty levels.

25.9
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Documentation
Community
BenchGecko API
phi-1-5
Specifications
  • Typetext
  • ContextN/A
  • ReleasedJan 2024
  • LicenseOpen Source
  • Statusbenchmark-only
Available On
Microsoft logoMicrosoftTBD
Share & Export
Tweet
Phi-1.5 is an open-source text AI model by Microsoft, released in January 2024. It has an average benchmark score of 15.6.