How much does Phi-1.5 cost?

Phi-1.5 is open source and can be self-hosted.

What benchmarks has Phi-1.5 been tested on?

Phi-1.5 has been evaluated on 11 benchmarks. Top scores: Winogrande: 46.8, HellaSwag: 30.1, ARC AI2: 25.9.

How does Phi-1.5 compare to RedPajama-INCITE-7B-Base?

Phi-1.5 has an average score of 17.5 while RedPajama-INCITE-7B-Base scores 17.7. RedPajama-INCITE-7B-Base slightly outperforms Phi-1.5 overall. See full comparison →

Home/Models/Phi-1.5

Phi-1.5

Name: Phi-1.5
Author: Microsoft

by Microsoft · Released Jan 2024

Open Source

17.5

avg score

Rank #252

Compare

Better than 8% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text

License

Open Source

Benchmarks

11 tested

Data updated today

About

Tested on 11 benchmarks with 16.3% average. Top scores: Winogrande (46.8%), HellaSwag (30.1%), ARC AI2 (25.9%).

Capabilities

reasoning

3.4

#196 globally

math

1.8

#240 globally

knowledge

20.8

#234 globally

general

7.5

#61 globally

language

20.3

#160 globally

Benchmark Scores

Compare All

Tested on 11 benchmarks · Ranked across 5 categories

Score Distribution (all 274 models)

0255075100

▲ You are here

reasoningCompare reasoning →

MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

3.4—

mathCompare math →

MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

1.8—

knowledgeCompare knowledge →

Winogrande

Commonsense coreference resolution. Tests understanding of pronoun references in ambiguous sentences.

46.8—

HellaSwag

Sentence completion requiring commonsense reasoning about physical and social situations. Tests real-world understanding.

30.1—

ARC AI2

AI2 Reasoning Challenge. Grade-school science questions requiring multi-step reasoning. Easy and Challenge sets test different difficulty levels.

25.9—

Quick compare:

vs RedPajama-INCITE-7B-Base

vs DeepSeek Coder 6.7B

vs Cerebras-GPT-13B

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Similar Models

RedPajama-INCITE-7B-Base

Links

Info

Microsoft Pricing explorer Developers · API

Research

Documentation

Community

Source Code

BenchGecko API

phi-1-5

Specifications

Typetext
ContextN/A
ReleasedJan 2024
LicenseOpen Source
Statusbenchmark-only

Available On

MicrosoftTBD

Frequently Asked Questions

Phi-1.5 is an open-source text AI model by Microsoft, released in January 2024. It has an average benchmark score of 17.5.

Benchmarks

Winogrande HellaSwag ARC AI2 IFEval MMLU

Microsoft · Provider Microsoft · Economy All Models Compare Models Pricing Developers · API

Phi-1.5

Frequently Asked Questions

Related Models

Benchmarks

Related Pages