Compare · ModelsLive · 2 picked · head to head
Falcon-180B vs Phi 4 Mini Instruct
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
Phi 4 Mini Instruct wins on 5/6 benchmarks
Phi 4 Mini Instruct wins 5 of 6 shared benchmarks. Leads in general · knowledge · language.
Category leads
general·Phi 4 Mini Instructknowledge·Phi 4 Mini Instructlanguage·Phi 4 Mini Instructmath·Phi 4 Mini Instructreasoning·Falcon-180B
Hype vs Reality
Attention vs performance
Falcon-180B
#119 by perf·no signal
Phi 4 Mini Instruct
#187 by perf·no signal
Vendor risk
Who is behind the model
TII
private · undisclosed
Microsoft
$3.00T·Big Tech
Head to head
6 benchmarks · 2 models
Falcon-180BPhi 4 Mini Instruct
BBH (HuggingFace)
Phi 4 Mini Instruct leads by +16.8
Falcon-180B
21.9
Phi 4 Mini Instruct
38.7
GPQA
Phi 4 Mini Instruct leads by +5.1
Falcon-180B
2.8
Phi 4 Mini Instruct
7.9
IFEval
Phi 4 Mini Instruct leads by +41.2
Falcon-180B
32.6
Phi 4 Mini Instruct
73.8
MATH Level 5
Phi 4 Mini Instruct leads by +14.2
Falcon-180B
2.8
Phi 4 Mini Instruct
17.0
MMLU-PRO
Phi 4 Mini Instruct leads by +17.1
Falcon-180B
15.4
Phi 4 Mini Instruct
32.6
MUSR
Falcon-180B leads by +1.1
Falcon-180B
7.5
Phi 4 Mini Instruct
6.5
Full benchmark table
| Benchmark | Falcon-180B | Phi 4 Mini Instruct |
|---|---|---|
BBH (HuggingFace) | 21.9 | 38.7 |
GPQA | 2.8 | 7.9 |
IFEval | 32.6 | 73.8 |
MATH Level 5 | 2.8 | 17.0 |
MMLU-PRO | 15.4 | 32.6 |
MUSR | 7.5 | 6.5 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| — | — | — | — | |
| — | — | — | — |