How much does GLM 5 cost?

GLM 5 costs $0.60 per million input tokens and $1.92 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.003 per message.

How does GLM 5 compare to Claude Opus 4.5?

GLM 5 has an average score of 69.5 while Claude Opus 4.5 scores 69.2. GLM 5 outperforms Claude Opus 4.5 overall. GLM 5 costs $0.60/1M input vs Claude Opus 4.5 at $5.00/1M input. See full comparison →

Home/Models/GLM 5

GLM 5

Name: GLM 5
Price: 0.6 USD
Author: z-ai

by z-ai · Released Feb 2026

Open Source

69.5

avg score

Rank #39

Compare

Better than 83% of all models

Context

203K tokens (~101 books)

Input $/1M

$0.60

Output $/1M

$1.92

Type

text

License

Open Source

Benchmarks

28 tested

Data updated today

About

GLM-5 is Z.ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading...

Tested on 28 benchmarks with 57.6% average. Top scores: Chatbot Arena Elo — Overall (1455.6%), Chatbot Arena Elo — Coding (1441.0%), OpenCompass — AIME2025 (95.8%).

Looking for similar performance at lower cost?
Gemma 4 31B scores 68.2 (98% as good) at $0.13/1M input · 78% cheaper

Capabilities

coding

64.6

#30 globally

reasoning

46.1

#54 globally

math

55.6

#61 globally

knowledge

53.6

#82 globally

language

75.4

#58 globally

Benchmark Scores

Compare All

Tested on 28 benchmarks · Ranked across 6 categories

Score Distribution (all 233 models)

0255075100

▲ You are here

codingCompare coding →

OpenCompass — LiveCodeBenchV6

OpenCompass Live Code Bench v6. Fresh competitive programming problems to evaluate code generation without memorization.

86.2—

LiveBench — Coding

Regularly refreshed coding problems that avoid data contamination. New problems added monthly to prevent memorization.

73.6—

SWE-Bench verified

Real-world software engineering tasks from GitHub issues. Models must diagnose bugs and write patches that pass test suites. Human-verified subset of SWE-bench.

72.1—

reasoningCompare reasoning →

LiveBench — Reasoning

Regularly refreshed reasoning problems testing logical deduction, spatial reasoning, and analytical thinking.

69.1—

LiveBench — Data Analysis

Fresh data analysis tasks testing ability to interpret tables, charts, and statistical data.

67.9—

ARC-AGI

Abstraction and Reasoning Corpus. Tests fluid intelligence through novel visual pattern recognition puzzles. Core measure of general intelligence.

44.7—

mathCompare math →