Compare · ModelsLive · 2 picked · head to head
Gemma 2 9B vs Qwen-14B
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
Gemma 2 9B wins on 3/3 benchmarks
Gemma 2 9B wins 3 of 3 shared benchmarks. Leads in math · knowledge.
Category leads
math·Gemma 2 9Bknowledge·Gemma 2 9B
Hype vs Reality
Attention vs performance
Gemma 2 9B
#165 by perf·no signal
Qwen-14B
#37 by perf·no signal
Vendor risk
Who is behind the model
Google DeepMind
$4.00T·Tier 1
Alibaba (Qwen)
$293.0B·Tier 1
Head to head
3 benchmarks · 2 models
Gemma 2 9BQwen-14B
GSM8K
Gemma 2 9B leads by +23.6
Grade School Math 8K · 8,500 linguistically diverse grade-school math word problems that require multi-step reasoning to solve.
Gemma 2 9B
84.9
Qwen-14B
61.3
MMLU
Gemma 2 9B leads by +7.7
Massive Multitask Language Understanding · 57 subjects spanning STEM, humanities, social sciences, and more. The standard benchmark for broad knowledge.
Gemma 2 9B
62.8
Qwen-14B
55.1
PIQA
Gemma 2 9B leads by +7.6
PIQA (Physical Interaction QA) · tests intuitive physical reasoning by asking models to select the correct approach for everyday physical tasks.
Gemma 2 9B
67.4
Qwen-14B
59.8
Full benchmark table
| Benchmark | Gemma 2 9B | Qwen-14B |
|---|---|---|
GSM8K Grade School Math 8K · 8,500 linguistically diverse grade-school math word problems that require multi-step reasoning to solve. | 84.9 | 61.3 |
MMLU Massive Multitask Language Understanding · 57 subjects spanning STEM, humanities, social sciences, and more. The standard benchmark for broad knowledge. | 62.8 | 55.1 |
PIQA PIQA (Physical Interaction QA) · tests intuitive physical reasoning by asking models to select the correct approach for everyday physical tasks. | 67.4 | 59.8 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| $0.03 | $0.09 | 8K tokens (~4 books) | $0.45 | |
| — | — | — | — |