Compare · ModelsLive · 2 picked · head to head
Gpt Neo 125m vs TinyLlama 1.1B Chat V1.0
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
TinyLlama 1.1B Chat V1.0 wins on 4/6 benchmarks
TinyLlama 1.1B Chat V1.0 wins 4 of 6 shared benchmarks. Leads in general · math · reasoning.
Category leads
general·TinyLlama 1.1B Chat V1.0knowledge·Gpt Neo 125mlanguage·Gpt Neo 125mmath·TinyLlama 1.1B Chat V1.0reasoning·TinyLlama 1.1B Chat V1.0
Hype vs Reality
Attention vs performance
Gpt Neo 125m
#238 by perf·no signal
TinyLlama 1.1B Chat V1.0
#241 by perf·no signal
Vendor risk
Who is behind the model
eleutherai
private · undisclosed
T
TinyLlama
private · undisclosed
Head to head
6 benchmarks · 2 models
Gpt Neo 125mTinyLlama 1.1B Chat V1.0
BBH (HuggingFace)
TinyLlama 1.1B Chat V1.0 leads by +0.6
Gpt Neo 125m
3.4
TinyLlama 1.1B Chat V1.0
4.0
GPQA
Gpt Neo 125m leads by +0.5
Gpt Neo 125m
0.5
TinyLlama 1.1B Chat V1.0
0.0
IFEval
Gpt Neo 125m leads by +13.1
Gpt Neo 125m
19.1
TinyLlama 1.1B Chat V1.0
6.0
MATH Level 5
TinyLlama 1.1B Chat V1.0 leads by +0.9
Gpt Neo 125m
0.6
TinyLlama 1.1B Chat V1.0
1.5
MMLU-PRO
TinyLlama 1.1B Chat V1.0 leads by +0.8
Gpt Neo 125m
0.3
TinyLlama 1.1B Chat V1.0
1.1
MUSR
TinyLlama 1.1B Chat V1.0 leads by +1.7
Gpt Neo 125m
2.6
TinyLlama 1.1B Chat V1.0
4.3
Full benchmark table
| Benchmark | Gpt Neo 125m | TinyLlama 1.1B Chat V1.0 |
|---|---|---|
BBH (HuggingFace) | 3.4 | 4.0 |
GPQA | 0.5 | 0.0 |
IFEval | 19.1 | 6.0 |
MATH Level 5 | 0.6 | 1.5 |
MMLU-PRO | 0.3 | 1.1 |
MUSR | 2.6 | 4.3 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| — | — | — | — | |
T TinyLlama 1.1B Chat V1.0 | — | — | — | — |