LIVETracking 971 AI models from 268 providers.

Models971·Providers268·Benchmarks128·Companies71·Agents165·TopQwen3 VL 235B A22B Instruct · 1415.8%·Updatedjust now·Data Points2,902·MCP Servers4,923

Home/Models/GLM 5.1

GLM 5.1

by z-ai · Released Apr 2026

Open Source

87.0

avg score

Rank #12

Top 5% of all models

Context

203K tokens (~101 books)

Input $/1M

$0.95

Output $/1M

$3.15

Type

text

License

Open Source

Benchmarks

12 tested

Data updated today

About

GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...

Tested on 12 benchmarks with 70.2% average. Top scores: Chatbot Arena Elo — Overall (1467.4%), LiveBench — Mathematics (84.9%), LiveBench — Coding (75.4%).

Looking for similar performance at lower cost?
Qwen3.6 Plus scores 88.7 (102% as good) at $0.33/1M input · 66% cheaper

Capabilities

coding

65.2

#27 globally

reasoning

67.9

#24 globally

math

84.9

#13 globally

knowledge

70.2

#17 globally

language

70.1

#68 globally

speed

89.9

#6 globally

Benchmark Scores

Tested on 12 benchmarks · Ranked across 7 categories

Score Distribution (all 231 models)

0255075100

▲ You are here

codingCompare coding →

LiveBench — Coding

Regularly refreshed coding problems that avoid data contamination. New problems added monthly to prevent memorization.

75.4—

LiveBench — Agentic Coding

LiveBench coding tasks that require multi-step reasoning and tool use. Tests planning and execution of complex coding workflows.

55.0—

reasoningCompare reasoning →

LiveBench — Reasoning

Regularly refreshed reasoning problems testing logical deduction, spatial reasoning, and analytical thinking.

72.5—

LiveBench — Data Analysis

Fresh data analysis tasks testing ability to interpret tables, charts, and statistical data.

63.2—

mathCompare math →

LiveBench — Mathematics

Regularly updated math problems that test numerical reasoning, algebra, calculus, and combinatorics.

84.9—

Quick compare:

vs Qwen2.5 72B Instruct Abliterated

vs DeepSeek R1 Distill Qwen 14B

vs GPT-5.2-Codex

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Similar Models

Qwen2.5 72B Instruct Abliterated

DeepSeek R1 Distill Qwen 14B

Links

Info

Research

Technical Report

Documentation

API Docs Playground

Community

Source Code

GitHub Hugging Face

BenchGecko API

glm-5-1

Specifications

Typetext
Context203K tokens (~101 books)
ReleasedApr 2026
LicenseOpen Source
StatusActive
Cost / Message~$0.005

Available On

z-ai$0.95

Categories

coding reasoning math knowledge language speed

Learn More

context-window transformer open-weights tokens

Share & Export

Related Models

Qwen2.5 72B Instruct Abliterated

DeepSeek R1 Distill Qwen 14B

Frequently Asked Questions

GLM 5.1 is an open-source text AI model by z-ai, released in April 2026. It has an average benchmark score of 87.0. Context window: 203K tokens.

Related Models

Qwen2.5 72B Instruct Abliterated · HuiHui AI DeepSeek R1 Distill Qwen 14B · DeepSeek GPT-5.2-Codex · OpenAI Qwen3.6 Plus · Alibaba Qwen GPT-5 Chat · OpenAI

Benchmarks

Chatbot Arena Elo — Overall LiveBench — Mathematics LiveBench — Coding LiveBench — Reasoning LiveBench — Language

Related Pages

z-ai · Provider All Models Compare Models Context Window · Glossary