GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts...
Tested on 9 benchmarks with 40.1% average. Top scores: Chatbot Arena Elo — Overall (1377.9%), LiveBench — Coding (64.2%), LiveBench — Mathematics (62.5%).
Llama 3.3 70B Instruct (free) scores 29.6 (105% as good) at $0.00/1M input · 100% cheaper
Regularly refreshed coding problems that avoid data contamination. New problems added monthly to prevent memorization.
LiveBench coding tasks that require multi-step reasoning and tool use. Tests planning and execution of complex coding workflows.
Fresh data analysis tasks testing ability to interpret tables, charts, and statistical data.
Regularly refreshed reasoning problems testing logical deduction, spatial reasoning, and analytical thinking.
Regularly updated math problems that test numerical reasoning, algebra, calculus, and combinatorics.
- Typemultimodal
- Context131K tokens (~66 books)
- ReleasedDec 2025
- LicenseOpen Source
- StatusActive
- Cost / Message~$0.002