Compare · ModelsLive · 2 picked · head to head
Phi 3 Mini 4k Instruct vs Phi 3.5 Mini Instruct
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
Phi 3.5 Mini Instruct wins on 4/6 benchmarks
Phi 3.5 Mini Instruct wins 4 of 6 shared benchmarks. Leads in general · knowledge · language.
Category leads
general·Phi 3.5 Mini Instructknowledge·Phi 3.5 Mini Instructlanguage·Phi 3.5 Mini Instructmath·Phi 3.5 Mini Instructreasoning·Phi 3 Mini 4k Instruct
Hype vs Reality
Attention vs performance
Phi 3 Mini 4k Instruct
#196 by perf·no signal
Phi 3.5 Mini Instruct
#192 by perf·no signal
Best value
Pricing unknown
Phi 3 Mini 4k Instruct
—
no price
Phi 3.5 Mini Instruct
—
no price
Vendor risk
Who is behind the model
Microsoft
$3.00T·Big Tech
Microsoft
$3.00T·Big Tech
Head to head
6 benchmarks · 2 models
Phi 3 Mini 4k InstructPhi 3.5 Mini Instruct
BBH (HuggingFace)
Phi 3.5 Mini Instruct leads by +0.2
Phi 3 Mini 4k Instruct
36.6
Phi 3.5 Mini Instruct
36.8
GPQA
Phi 3.5 Mini Instruct leads by +1.0
Phi 3 Mini 4k Instruct
11.0
Phi 3.5 Mini Instruct
12.0
IFEval
Phi 3.5 Mini Instruct leads by +3.0
Phi 3 Mini 4k Instruct
54.8
Phi 3.5 Mini Instruct
57.8
MATH Level 5
Phi 3.5 Mini Instruct leads by +3.3
Phi 3 Mini 4k Instruct
16.4
Phi 3.5 Mini Instruct
19.6
MMLU-PRO
Phi 3 Mini 4k Instruct leads by +0.7
Phi 3 Mini 4k Instruct
33.6
Phi 3.5 Mini Instruct
32.9
MUSR
Phi 3 Mini 4k Instruct leads by +3.0
Phi 3 Mini 4k Instruct
13.1
Phi 3.5 Mini Instruct
10.1
Full benchmark table
| Benchmark | Phi 3 Mini 4k Instruct | Phi 3.5 Mini Instruct |
|---|---|---|
BBH (HuggingFace) | 36.6 | 36.8 |
GPQA | 11.0 | 12.0 |
IFEval | 54.8 | 57.8 |
MATH Level 5 | 16.4 | 19.6 |
MMLU-PRO | 33.6 | 32.9 |
MUSR | 13.1 | 10.1 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| — | — | — | — | |
| — | — | — | — |