GLM 4.5 Air - Kyma API

Overview
Specs
Use this when
Pick something else when
Example

Overview

glm-4.5-air is the cheap agentic GLM option on Kyma. It is a good fit for bulk automation, repeated agent runs, and cost-sensitive workloads that still need tool use and structured outputs.

Specs

Field	Value
Model ID	`glm-4.5-air`
Best for	Bulk agents, cheap automation, long-context tasks
Context window	131K
Max output tokens	8K
Input modalities	Text
Output modalities	Text
Tool calling	Yes
Structured outputs	Yes
Prompt caching	Yes
Speed	Fast
Cost band	Cheap
Release stage	Stable

Use this when

You need an inexpensive model for repeated agent runs.
You care about cost more than flagship-level quality.
You want tool use and structured outputs in a cheaper lane.

Pick something else when

You want stronger flagship agent quality: use glm-5.1 or kimi-k2.5.
You want the strongest reasoning path: use deepseek-r1.
You need multimodal input: use gemma-4-31b or kimi-k2.5.

Example

from openai import OpenAI

client = OpenAI(base_url="https://kymaapi.com/v1", api_key="ky-...")

response = client.chat.completions.create(
    model="glm-4.5-air",
    messages=[{"role": "user", "content": "Generate a structured triage summary for these 200 support tickets."}],
    response_format={"type": "json_object"}
)

GPT-OSS 120B GLM 4.7 Flash

⌘I