Compare · ModelsLive · 2 picked · head to head
GPT-4o-mini (2024-07-18) vs Gemini 2.5 Flash Lite
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
Gemini 2.5 Flash Lite wins on 3/5 benchmarks
Gemini 2.5 Flash Lite wins 3 of 5 shared benchmarks. Leads in language · math · reasoning.
Category leads
knowledge·GPT-4o-mini (2024-07-18)language·Gemini 2.5 Flash Litemath·Gemini 2.5 Flash Litereasoning·Gemini 2.5 Flash Lite
Hype vs Reality
Attention vs performance
GPT-4o-mini (2024-07-18)
#125 by perf·no signal
Gemini 2.5 Flash Lite
#44 by perf·no signal
Best value
Gemini 2.5 Flash Lite
2.1x better value than GPT-4o-mini (2024-07-18)
GPT-4o-mini (2024-07-18)
115.2 pts/$
$0.38/M
Gemini 2.5 Flash Lite
236.4 pts/$
$0.25/M
Vendor risk
Who is behind the model
OpenAI
$840.0B·Tier 1
Google DeepMind
$4.00T·Tier 1
Head to head
5 benchmarks · 2 models
GPT-4o-mini (2024-07-18)Gemini 2.5 Flash Lite
HELM · GPQA
GPT-4o-mini (2024-07-18) leads by +5.9
GPT-4o-mini (2024-07-18)
36.8
Gemini 2.5 Flash Lite
30.9
HELM · IFEval
Gemini 2.5 Flash Lite leads by +2.8
GPT-4o-mini (2024-07-18)
78.2
Gemini 2.5 Flash Lite
81.0
HELM · MMLU-Pro
GPT-4o-mini (2024-07-18) leads by +6.6
GPT-4o-mini (2024-07-18)
60.3
Gemini 2.5 Flash Lite
53.7
HELM · Omni-MATH
Gemini 2.5 Flash Lite leads by +20.0
GPT-4o-mini (2024-07-18)
28.0
Gemini 2.5 Flash Lite
48.0
HELM · WildBench
Gemini 2.5 Flash Lite leads by +2.7
GPT-4o-mini (2024-07-18)
79.1
Gemini 2.5 Flash Lite
81.8
Full benchmark table
| Benchmark | GPT-4o-mini (2024-07-18) | Gemini 2.5 Flash Lite |
|---|---|---|
HELM · GPQA | 36.8 | 30.9 |
HELM · IFEval | 78.2 | 81.0 |
HELM · MMLU-Pro | 60.3 | 53.7 |
HELM · Omni-MATH | 28.0 | 48.0 |
HELM · WildBench | 79.1 | 81.8 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| $0.15 | $0.60 | 128K tokens (~64 books) | $2.62 | |
| $0.10 | $0.40 | 1.0M tokens (~524 books) | $1.75 |