Compare · ModelsLive · 2 picked · head to head

GLM 5.1 vs Qwen3.6 Plus

Side by side · benchmarks, pricing, and signals you can act on.

CiteAdd another

Winner summary

GLM 5.1 wins on 6/11 benchmarks

GLM 5.1 wins 6 of 11 shared benchmarks. Leads in speed · coding · language.

Category leads

speed·GLM 5.1coding·GLM 5.1reasoning·Qwen3.6 Pluslanguage·GLM 5.1math·GLM 5.1knowledge·Qwen3.6 Plus

Hype vs Reality

Attention vs performance

GLM 5.1

#16 by perf·no signal

QUIET

Qwen3.6 Plus

#14 by perf·no signal

QUIET

See full mindshare →

Best value

Qwen3.6 Plus

2.0x better value than GLM 5.1

GLM 5.1

30.9 pts/$

$2.27/M

Qwen3.6 Plus

62.3 pts/$

$1.14/M

Explore pricing →

Vendor risk

Who is behind the model

z-ai

private · undisclosed

Unknown

Alibaba (Qwen)

$293.0B·Tier 1

Low risk

See the AI economy →

Head to head

11 benchmarks · 2 models

GLM 5.1Qwen3.6 Plus

Artificial Analysis · Agentic Index

GLM 5.1 leads by +5.4

Artificial Analysis Agentic Index · a composite score measuring how well a model performs in agentic workflows · multi-step tool use, planning, error recovery, and autonomous task completion. Aggregates results from multiple agentic benchmarks including SWE-bench, tool-use tests, and planning evaluations. The canonical single-number metric for "how good is this model as an agent?"

GLM 5.1

67.0

Qwen3.6 Plus

61.7

Artificial Analysis · Coding Index

GLM 5.1 leads by +0.5

Artificial Analysis Coding Index · a composite score that aggregates performance across multiple coding benchmarks into a single index. Tracks code generation quality, debugging ability, multi-language competence, and real-world software engineering tasks. Used by Artificial Analysis to rank model coding capability in a normalized, comparable format. Useful for developers choosing between models for coding-heavy workloads.

GLM 5.1

43.4

Qwen3.6 Plus

42.9

Artificial Analysis · Quality Index

GLM 5.1 leads by +1.4

GLM 5.1

51.4

Qwen3.6 Plus

50.0

LiveBench · Agentic Coding

GLM 5.1

55.0

Qwen3.6 Plus

55.0

LiveBench · Coding

Qwen3.6 Plus leads by +2.8

GLM 5.1

75.4

Qwen3.6 Plus

78.2

LiveBench · Data Analysis

Qwen3.6 Plus leads by +6.7

GLM 5.1

63.2

Qwen3.6 Plus

69.9

LiveBench · If

GLM 5.1 leads by +10.1

GLM 5.1

68.5

Qwen3.6 Plus

58.3

LiveBench · Language

Qwen3.6 Plus leads by +3.2

GLM 5.1

71.8

Qwen3.6 Plus

75.0

LiveBench · Mathematics

GLM 5.1 leads by +1.2

GLM 5.1

84.9

Qwen3.6 Plus

83.7

LiveBench · Overall

Qwen3.6 Plus leads by +0.7

GLM 5.1

70.2

Qwen3.6 Plus

70.8

LiveBench · Reasoning

Qwen3.6 Plus leads by +3.3

GLM 5.1

72.5

Qwen3.6 Plus

75.8

Full benchmark table

Benchmark	GLM 5.1	Qwen3.6 Plus
Artificial Analysis · Agentic Index Artificial Analysis Agentic Index · a composite score measuring how well a model performs in agentic workflows · multi-step tool use, planning, error recovery, and autonomous task completion. Aggregates results from multiple agentic benchmarks including SWE-bench, tool-use tests, and planning evaluations. The canonical single-number metric for "how good is this model as an agent?"	67.0	61.7
Artificial Analysis · Coding Index Artificial Analysis Coding Index · a composite score that aggregates performance across multiple coding benchmarks into a single index. Tracks code generation quality, debugging ability, multi-language competence, and real-world software engineering tasks. Used by Artificial Analysis to rank model coding capability in a normalized, comparable format. Useful for developers choosing between models for coding-heavy workloads.	43.4	42.9
Artificial Analysis · Quality Index	51.4	50.0
LiveBench · Agentic Coding	55.0	55.0
LiveBench · Coding	75.4	78.2
LiveBench · Data Analysis	63.2	69.9
LiveBench · If	68.5	58.3
LiveBench · Language	71.8	75.0
LiveBench · Mathematics	84.9	83.7
LiveBench · Overall	70.2	70.8
LiveBench · Reasoning	72.5	75.8

Pricing · per 1M tokens · projected $/mo at 10M tokens

Model	Input	Output	Context	Projected $/mo
GLM 5.1	$1.05	$3.50	203K tokens (~101 books)	$16.63
Qwen3.6 Plus	$0.33	$1.95	1.0M tokens (~500 books)	$7.31

People also compared

GPT-5.5 Pro vs Qwen3.6 Plus GLM 5.1 vs GPT-5.5 Pro GPT-5.5 vs Qwen3.6 Plus GLM 5.1 vs GPT-5.5 GPT-5 Chat vs Qwen3.6 Plus Claude Mythos Preview vs Qwen3.6 Plus GLM 5.1 vs GPT-5 Chat Claude Mythos Preview vs GLM 5.1