Beta
Arbitrage · Qwen3.5 397B-A17B5 providers · $0.280$0.600 · 53% spread

Cheapest Provider for Qwen3.5 397B-A17B

Alibaba's flagship MoE model · 397B total / 17B active · multilingual leader.

397B (17B active)Apache 2.0Model detail page →
Cheapest input
Alibaba Cloud logo
Alibaba Cloud
Direct from Alibaba · 1M context
Fastest
Fireworks AI logo
Fireworks AI
Serverless endpoint
Savings calculator
vs DeepInfra at $0.600/M input. For 100M tokens/mo, that is $32/mo saved by routing to Alibaba Cloud.
Sorted by input price
ProviderIn $/MOut $/M
Alibaba Cloud logoAlibaba CloudWinner$0.280$0.850
H
Hyperbolic
$0.350$0.350
Together AI logoTogether AI$0.450$0.450
Fireworks AI logoFireworks AI$0.500$1.50
D
DeepInfra
$0.600$1.80

Notes: Alibaba Cloud · Direct from Alibaba · 1M context ; Hyperbolic · Upstart · aggressive pricing ; Together AI · MoE-optimized · flat output ; Fireworks AI · Serverless endpoint ; DeepInfra · Lower context cap

Alibaba Cloud at $0.280/M input and $0.850/M output. That is 53% cheaper than DeepInfra. Direct from Alibaba · 1M context.