Compare · ModelsLive · 2 picked · head to head

Falcon 2 11B vs Qwen2.5 72B Instruct

Side by side · benchmarks, pricing, and signals you can act on.

Winner summary

Qwen2.5 72B Instruct wins 3 of 3 shared benchmarks. Leads in knowledge.

Category leads
knowledge·Qwen2.5 72B Instruct
Hype vs Reality
Falcon 2 11B
#52 by perf·no signal
QUIET
Qwen2.5 72B Instruct
#82 by perf·no signal
QUIET
Best value
Falcon 2 11B
no price
Qwen2.5 72B Instruct
140.0 pts/$
$0.38/M
Vendor risk
TII logo
TII
private · undisclosed
Unknown
Alibaba Qwen logo
Alibaba (Qwen)
$293.0B·Tier 1
Low risk
Head to head
Falcon 2 11BQwen2.5 72B Instruct
HellaSwag
Qwen2.5 72B Instruct leads by +2.5
HellaSwag · tests commonsense reasoning by asking models to predict the most plausible continuation of everyday scenarios.
Falcon 2 11B
77.2
Qwen2.5 72B Instruct
79.7
MMLU
Qwen2.5 72B Instruct leads by +35.9
Massive Multitask Language Understanding · 57 subjects spanning STEM, humanities, social sciences, and more. The standard benchmark for broad knowledge.
Falcon 2 11B
44.5
Qwen2.5 72B Instruct
80.4
Winogrande
Qwen2.5 72B Instruct leads by +8.0
WinoGrande · large-scale commonsense reasoning benchmark where models must resolve ambiguous pronouns in carefully constructed sentence pairs.
Falcon 2 11B
56.6
Qwen2.5 72B Instruct
64.6
Full benchmark table
BenchmarkFalcon 2 11BQwen2.5 72B Instruct
HellaSwag
HellaSwag · tests commonsense reasoning by asking models to predict the most plausible continuation of everyday scenarios.
77.279.7
MMLU
Massive Multitask Language Understanding · 57 subjects spanning STEM, humanities, social sciences, and more. The standard benchmark for broad knowledge.
44.580.4
Winogrande
WinoGrande · large-scale commonsense reasoning benchmark where models must resolve ambiguous pronouns in carefully constructed sentence pairs.
56.664.6
Pricing · per 1M tokens · projected $/mo at 10M tokens
ModelInputOutputContextProjected $/mo
TII logoFalcon 2 11B
Alibaba Qwen logoQwen2.5 72B Instruct$0.36$0.4033K tokens (~16 books)$3.70