Beta
Arbitrage · gpt-oss-120b5 providers · $0.150$0.400 · 63% spread

Cheapest Provider for gpt-oss-120b

OpenAI's larger open release · reasoning-capable · still cheap on optimized stacks.

120BApache 2.0Model detail page →
Cheapest input
Groq logo
Groq
Speed leader
Fastest
Cerebras logo
Cerebras
Wafer-scale chip
Savings calculator
vs DeepInfra at $0.400/M input. For 100M tokens/mo, that is $25/mo saved by routing to Groq.
Sorted by input price
ProviderIn $/MOut $/M
Groq logoGroqWinner$0.150$0.600
Fireworks AI logoFireworks AI$0.180$0.720
Cerebras logoCerebras$0.200$0.900
Together AI logoTogether AI$0.220$0.600
D
DeepInfra
$0.400$1.20

Notes: Groq · Speed leader ; Fireworks AI · Standard serverless ; Cerebras · Wafer-scale chip ; Together AI · Flat pricing ; DeepInfra · Higher markup

Groq at $0.150/M input and $0.600/M output. That is 63% cheaper than DeepInfra. Speed leader.