Compare · ModelsLive · 2 picked · head to head
TinyLlama 1.1B Chat V1.0 vs Gpt Neo 125m
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
TinyLlama 1.1B Chat V1.0 wins on 4/6 benchmarks
TinyLlama 1.1B Chat V1.0 wins 4 of 6 shared benchmarks. Leads in general · math · reasoning.
Category leads
general·TinyLlama 1.1B Chat V1.0knowledge·Gpt Neo 125mlanguage·Gpt Neo 125mmath·TinyLlama 1.1B Chat V1.0reasoning·TinyLlama 1.1B Chat V1.0
Hype vs Reality
Attention vs performance
TinyLlama 1.1B Chat V1.0
#241 by perf·no signal
Gpt Neo 125m
#238 by perf·no signal
Vendor risk
Who is behind the model
T
TinyLlama
private · undisclosed
eleutherai
private · undisclosed
Head to head
6 benchmarks · 2 models
TinyLlama 1.1B Chat V1.0Gpt Neo 125m
BBH (HuggingFace)
TinyLlama 1.1B Chat V1.0 leads by +0.6
TinyLlama 1.1B Chat V1.0
4.0
Gpt Neo 125m
3.4
GPQA
Gpt Neo 125m leads by +0.5
TinyLlama 1.1B Chat V1.0
0.0
Gpt Neo 125m
0.5
IFEval
Gpt Neo 125m leads by +13.1
TinyLlama 1.1B Chat V1.0
6.0
Gpt Neo 125m
19.1
MATH Level 5
TinyLlama 1.1B Chat V1.0 leads by +0.9
TinyLlama 1.1B Chat V1.0
1.5
Gpt Neo 125m
0.6
MMLU-PRO
TinyLlama 1.1B Chat V1.0 leads by +0.8
TinyLlama 1.1B Chat V1.0
1.1
Gpt Neo 125m
0.3
MUSR
TinyLlama 1.1B Chat V1.0 leads by +1.7
TinyLlama 1.1B Chat V1.0
4.3
Gpt Neo 125m
2.6
Full benchmark table
| Benchmark | TinyLlama 1.1B Chat V1.0 | Gpt Neo 125m |
|---|---|---|
BBH (HuggingFace) | 4.0 | 3.4 |
GPQA | 0.0 | 0.5 |
IFEval | 6.0 | 19.1 |
MATH Level 5 | 1.5 | 0.6 |
MMLU-PRO | 1.1 | 0.3 |
MUSR | 4.3 | 2.6 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
T TinyLlama 1.1B Chat V1.0 | — | — | — | — |
| — | — | — | — |