Compare · ModelsLive · 2 picked · head to head
Falcon 2 11B vs Qwen2.5 72B Instruct
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
Qwen2.5 72B Instruct wins on 3/3 benchmarks
Qwen2.5 72B Instruct wins 3 of 3 shared benchmarks. Leads in knowledge.
Category leads
knowledge·Qwen2.5 72B Instruct
Hype vs Reality
Attention vs performance
Falcon 2 11B
#52 by perf·no signal
Qwen2.5 72B Instruct
#82 by perf·no signal
Best value
Qwen2.5 72B Instruct
Falcon 2 11B
—
no price
Qwen2.5 72B Instruct
140.0 pts/$
$0.38/M
Vendor risk
Who is behind the model
TII
private · undisclosed
Alibaba (Qwen)
$293.0B·Tier 1
Head to head
3 benchmarks · 2 models
Falcon 2 11BQwen2.5 72B Instruct
HellaSwag
Qwen2.5 72B Instruct leads by +2.5
HellaSwag · tests commonsense reasoning by asking models to predict the most plausible continuation of everyday scenarios.
Falcon 2 11B
77.2
Qwen2.5 72B Instruct
79.7
MMLU
Qwen2.5 72B Instruct leads by +35.9
Massive Multitask Language Understanding · 57 subjects spanning STEM, humanities, social sciences, and more. The standard benchmark for broad knowledge.
Falcon 2 11B
44.5
Qwen2.5 72B Instruct
80.4
Winogrande
Qwen2.5 72B Instruct leads by +8.0
WinoGrande · large-scale commonsense reasoning benchmark where models must resolve ambiguous pronouns in carefully constructed sentence pairs.
Falcon 2 11B
56.6
Qwen2.5 72B Instruct
64.6
Full benchmark table
| Benchmark | Falcon 2 11B | Qwen2.5 72B Instruct |
|---|---|---|
HellaSwag HellaSwag · tests commonsense reasoning by asking models to predict the most plausible continuation of everyday scenarios. | 77.2 | 79.7 |
MMLU Massive Multitask Language Understanding · 57 subjects spanning STEM, humanities, social sciences, and more. The standard benchmark for broad knowledge. | 44.5 | 80.4 |
Winogrande WinoGrande · large-scale commonsense reasoning benchmark where models must resolve ambiguous pronouns in carefully constructed sentence pairs. | 56.6 | 64.6 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| — | — | — | — | |
| $0.36 | $0.40 | 33K tokens (~16 books) | $3.70 |