Compare · ModelsLive · 2 picked · head to head
Qwen3.5 397B A17B vs Qwen3.6 Plus
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
Qwen3.6 Plus wins on 3/3 benchmarks
Qwen3.6 Plus wins 3 of 3 shared benchmarks. Leads in speed.
Category leads
speed·Qwen3.6 Plus
Hype vs Reality
Attention vs performance
Qwen3.5 397B A17B
#5 by perf·no signal
Qwen3.6 Plus
#14 by perf·no signal
Best value
Qwen3.6 Plus
1.1x better value than Qwen3.5 397B A17B
Qwen3.5 397B A17B
57.4 pts/$
$1.36/M
Qwen3.6 Plus
62.3 pts/$
$1.14/M
Vendor risk
Who is behind the model
Alibaba (Qwen)
$293.0B·Tier 1
Alibaba (Qwen)
$293.0B·Tier 1
Head to head
3 benchmarks · 2 models
Qwen3.5 397B A17BQwen3.6 Plus
Artificial Analysis · Agentic Index
Qwen3.6 Plus leads by +5.8
Artificial Analysis Agentic Index · a composite score measuring how well a model performs in agentic workflows · multi-step tool use, planning, error recovery, and autonomous task completion. Aggregates results from multiple agentic benchmarks including SWE-bench, tool-use tests, and planning evaluations. The canonical single-number metric for "how good is this model as an agent?"
Qwen3.5 397B A17B
55.8
Qwen3.6 Plus
61.7
Artificial Analysis · Coding Index
Qwen3.6 Plus leads by +1.6
Artificial Analysis Coding Index · a composite score that aggregates performance across multiple coding benchmarks into a single index. Tracks code generation quality, debugging ability, multi-language competence, and real-world software engineering tasks. Used by Artificial Analysis to rank model coding capability in a normalized, comparable format. Useful for developers choosing between models for coding-heavy workloads.
Qwen3.5 397B A17B
41.3
Qwen3.6 Plus
42.9
Artificial Analysis · Quality Index
Qwen3.6 Plus leads by +4.9
Qwen3.5 397B A17B
45.0
Qwen3.6 Plus
50.0
Full benchmark table
| Benchmark | Qwen3.5 397B A17B | Qwen3.6 Plus |
|---|---|---|
Artificial Analysis · Agentic Index Artificial Analysis Agentic Index · a composite score measuring how well a model performs in agentic workflows · multi-step tool use, planning, error recovery, and autonomous task completion. Aggregates results from multiple agentic benchmarks including SWE-bench, tool-use tests, and planning evaluations. The canonical single-number metric for "how good is this model as an agent?" | 55.8 | 61.7 |
Artificial Analysis · Coding Index Artificial Analysis Coding Index · a composite score that aggregates performance across multiple coding benchmarks into a single index. Tracks code generation quality, debugging ability, multi-language competence, and real-world software engineering tasks. Used by Artificial Analysis to rank model coding capability in a normalized, comparable format. Useful for developers choosing between models for coding-heavy workloads. | 41.3 | 42.9 |
Artificial Analysis · Quality Index | 45.0 | 50.0 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| $0.39 | $2.34 | 262K tokens (~131 books) | $8.78 | |
| $0.33 | $1.95 | 1.0M tokens (~500 books) | $7.31 |