GLM 4.6

Compared with GLM-4.5, this generation brings several key improvements:

Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks.
Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages.
Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability.
More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks.
Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.

Share

Model details

Context window202,752 tokens
Max completion size77 tokens
Prompt cost / 1K tokens$0.00000035
Completion cost / 1K tokens$0.0000015
Accepts
Produces

Benchmark performance

Overall

75
score
10th
placement

Cost

96
score
5th
placement

Logic

73
score
10th
placement

Speed

71
score
25th
placement

Scoring

34
score
14th
placement

Tool Use

51
score
4th
placement

Hallucination

94
score
3rd
placement

Classification

39
score
2nd
placement

Structured Output

67
score
5th
placement

Pricing

Usage pricing
Prompt
$0.00000035
Completion
$0.0000015
Request
FREE
Image
FREE
Web Search
FREE
Internal Reasoning
FREE
Input Cache Read
FREE
Input Cache Write
FREE

Best Overall scoring LLMs

xAI

Grok 4 Fast

88
score
1st
placement
Qwen

Qwen3 VL 235B A22B Instruct

86
score
2nd
placement
xAI

Grok 4.1 Fast

84
score
3rd
placement
OpenAI

GPT-5.1 Chat

82
score
4th
placement
OpenAI

GPT-5.1-Codex

82
score
4th
placement
Anthropic

Claude Haiku 4.5

80
score
5th
placement