How much does Phi 3 Mini 4k Instruct cost?

Phi 3 Mini 4k Instruct is open source and can be self-hosted.

What benchmarks has Phi 3 Mini 4k Instruct been tested on?

Phi 3 Mini 4k Instruct has been evaluated on 7 benchmarks. Top scores: Chatbot Arena Elo — Overall: 1127.2, IFEval: 54.8, BBH (HuggingFace): 36.6.

Is Phi 3 Mini 4k Instruct open source?

Yes, Phi 3 Mini 4k Instruct is open source.

How does Phi 3 Mini 4k Instruct compare to Magnum v4 72B?

Phi 3 Mini 4k Instruct has an average score of 51.1 while Magnum v4 72B scores 51.2. Magnum v4 72B slightly outperforms Phi 3 Mini 4k Instruct overall. See full comparison →

Home/Models/Phi 3 Mini 4k Instruct

Phi 3 Mini 4k Instruct

Name: Phi 3 Mini 4k Instruct
Author: Microsoft

by Microsoft · Released Apr 2024

Open Source

51.1

avg score

Rank #102

Compare

Better than 56% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text-generation

License

Open Source

Benchmarks

7 tested

Data updated today

About

Microsoft text generation model. 744K downloads on HuggingFace.

Tested on 7 benchmarks with 27.6% average. Top scores: Chatbot Arena Elo — Overall (1127.2%), IFEval (54.8%), BBH (HuggingFace) (36.6%).

Capabilities

reasoning

13.1

#119 globally

math

16.4

#179 globally

knowledge

22.3

#189 globally

language

54.8

#100 globally

general

36.6

#22 globally

Benchmark Scores

Compare All

Tested on 7 benchmarks · Ranked across 6 categories

Score Distribution (all 231 models)

0255075100

▲ You are here

reasoningCompare reasoning →

MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

13.1—

mathCompare math →

MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

16.4—

knowledgeCompare knowledge →

MMLU-PRO

HuggingFace MMLU-Pro. Harder version of MMLU with 10 answer choices instead of 4 and more challenging questions.

33.6—

GPQA

HuggingFace evaluation of GPQA (Graduate-Level Google-Proof Q&A). PhD-level science questions that cannot be easily searched.

11.0—

Quick compare:

vs Magnum v4 72B

vs Claude Sonnet 4.5

vs Phi 3.5 Mini Instruct

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Microsoft Phi 3

Phi 3 Mini 4k InstructApr 2024

27.6

N/AN/Actx7 benchmarks

Phi 3.5 Mini InstructAug 2024

28.2+0.6

N/AN/Actx6 benchmarks

Phi 3.5 Vision InstructAug 2024

N/AN/Actx

phi-3-medium 14BJan 2024

58.6+58.6

N/AN/Actx10 benchmarks

phi-3-mini 3.8BJan 2024

61.0+2.4

N/AN/Actx8 benchmarks

phi-3-small 7.4BJan 2024

67.4+6.4

N/AN/Actx8 benchmarks

See the full Phi 3 family →

Similar Models

Phi 3.5 Mini Instruct

Microsoft

51.5TBD

Links

Info

Research

Documentation

Community

Source Code

BenchGecko API

microsoft-phi-3-mini-4k-instruct

Specifications

Typetext-generation
ContextN/A
ReleasedApr 2024
LicenseOpen Source
StatusActive

Available On

MicrosoftTBD

Frequently Asked Questions

Phi 3 Mini 4k Instruct is an open-source text-generation AI model by Microsoft, released in April 2024. It has an average benchmark score of 51.1.

Benchmarks

Chatbot Arena Elo — Overall IFEval BBH (HuggingFace)MMLU-PRO MATH Level 5

Microsoft · Provider All Models Compare Models

Phi 3 Mini 4k Instruct

Frequently Asked Questions

Related Models

Benchmarks

Related Pages