Arbitrage · Llama 4 Maverick4 providers · $0.180 → $0.400 · 55% spread
Cheapest Provider for Llama 4 Maverick
Meta's multimodal Llama 4 variant · vision + text · long context.
Cheapest input
Fireworks AI
$0.180/M
Recently cut 18%
Fastest
Groq
420 tok/s
1M context · speed leader
Savings calculator
Save 55%
vs DeepInfra at $0.400/M input. For 100M tokens/mo, that is $22/mo saved by routing to Fireworks AI.
Sorted by input price
All 4 providers
| Provider | In $/M | Out $/M |
|---|---|---|
| $0.180 | $0.720 | |
| $0.200 | $0.600 | |
| $0.270 | $0.850 | |
D DeepInfra | $0.400 | $1.20 |
Notes: Fireworks AI · Recently cut 18% ; Groq · 1M context · speed leader ; Together AI · Standard ; DeepInfra · Lower context cap
Frequently Asked Questions
Fireworks AI at $0.180/M input and $0.720/M output. That is 55% cheaper than DeepInfra. Recently cut 18%.