Arbitrage · Qwen3.5 397B-A17B5 providers · $0.280 → $0.600 · 53% spread
Cheapest Provider for Qwen3.5 397B-A17B
Alibaba's flagship MoE model · 397B total / 17B active · multilingual leader.
Cheapest input
Alibaba Cloud
$0.280/M
Direct from Alibaba · 1M context
Fastest
Fireworks AI
140 tok/s
Serverless endpoint
Savings calculator
Save 53%
vs DeepInfra at $0.600/M input. For 100M tokens/mo, that is $32/mo saved by routing to Alibaba Cloud.
Sorted by input price
All 5 providers
| Provider | In $/M | Out $/M |
|---|---|---|
| $0.280 | $0.850 | |
H Hyperbolic | $0.350 | $0.350 |
| $0.450 | $0.450 | |
| $0.500 | $1.50 | |
D DeepInfra | $0.600 | $1.80 |
Notes: Alibaba Cloud · Direct from Alibaba · 1M context ; Hyperbolic · Upstart · aggressive pricing ; Together AI · MoE-optimized · flat output ; Fireworks AI · Serverless endpoint ; DeepInfra · Lower context cap
Frequently Asked Questions
Alibaba Cloud at $0.280/M input and $0.850/M output. That is 53% cheaper than DeepInfra. Direct from Alibaba · 1M context.