Overview
glm-5.1 is the GLM flagship on Kyma. It is strongest when the task looks like engineering work instead of normal chat: longer coding runs, repo-scale planning, multi-step execution, and sustained agent behavior over time.
Specs
| Field | Value |
|---|---|
| Model ID | glm-5.1 |
| Best for | Long-running coding agents, repo-scale engineering, multi-step execution |
| Context window | 203K |
| Max output tokens | 65K |
| Input modalities | Text |
| Output modalities | Text |
| Tool calling | Yes |
| Structured outputs | Yes |
| Prompt caching | Yes |
| Speed | Medium |
| Cost band | Premium |
| Release stage | Stable |
Use this when
- You are running a coding agent for extended work, not just one-shot prompts.
- You need repo-scale planning, migration work, debugging, or multi-file implementation.
- You want a strong text-only engineering model that can sustain longer task chains.
Pick something else when
- You want the safest general-purpose default: use
qwen-3.6-plus. - You need multimodal agent behavior: use
kimi-k2.5. - You need the cheapest long-context option: use
glm-4.7-flashorgemini-2.5-flash. - You only need fast coding loops rather than long-horizon execution: use
qwen-3-32b.
Example
Related aliases
| Alias | Resolves to |
|---|---|
glm-flagship | glm-5.1 |