Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.kymaapi.com/llms.txt

Use this file to discover all available pages before exploring further.

Kyma exposes active models through one OpenAI-compatible endpoint. The hard part is not integration — it is choosing the right model for the job. If you just want the short version:
  • Use qwen-3.6-plus for the best default
  • Use kimi-k2.6 for tool-heavy agents
  • Use deepseek-r1 for hard reasoning
  • Use gemini-2.5-flash for 1M context
See which model should I use? for the full decision page.

Quick decision guide

If you need…First pickAliasWhy
Safest defaultqwen-3.6-plusbestBest overall quality across common tasks
Tool-heavy agentskimi-k2.6agentBest first pick for tools, long sessions, screenshots
Deep reasoningdeepseek-r1reasoningBest for logic, math, and difficult analysis
Fast coding loopsqwen-3-32bfastLower latency for code/debug loops
Code-specialized outputqwen-3-codercodeCode-focused model with longer context
Long documentsgemini-2.5-flashlong-context1M context window
Visiongemma-4-31bvisionCheapest strong multimodal option
Balanced open modelllama-3.3-70bbalancedGood compromise between quality and cost
Cheap automationglm-4.5-airCost-sensitive agentic workloads

Use cases that matter

1. I just need one model

Start with qwen-3.6-plus. That is the right answer most of the time if you are building:
  • a chatbot
  • a coding assistant
  • an internal copilot
  • a general-purpose product feature

2. I am building an agent

Start with kimi-k2.6. If your agent:
  • calls tools
  • works across multiple steps
  • reads screenshots or other visual context
  • needs long sessions
then kimi-k2.6 is the best first pick. If you want a text-only engineering alternative, try glm-5.1.

3. I care about cost

Start with deepseek-v3 for strong value. If the workload is more repetitive and automation-heavy than quality-sensitive, consider:
  • glm-4.5-air
  • gpt-oss-120b

4. I need deep reasoning

Use deepseek-r1. This is the right pick for:
  • hard analysis
  • logic-heavy tasks
  • math
  • planning where quality matters more than speed

5. I need long context

Use gemini-2.5-flash. If you want cheaper long-context throughput and do not need multimodal input, look at glm-4.7-flash.

Multimodal

Image and video models bill per call (or per second of video) instead of per token, and run through a separate async endpoint - see /v1/images/generations and /v1/videos/generations.

Image (per-image pricing)

ModelBest forPrice
flux-1.1-ultraCinematic photo, hero shots, editorial$0.081 / image
flux-kontext-proImage-to-image edit, inpaint, refinement$0.054 / image
ideogram-v3Typography, packaging, posters, logos$0.108 / image
recraft-v3Vector illustration, brand assets$0.054 / image

Video (per-second pricing)

ModelBest forDurationPrice
kling-2.5-proBudget cinematic, brand b-roll5 or 10s$0.0945 / sec
kling-3-proPremium cinematic, hero brand video3-15s$0.1512 / sec
kling-3-pro-audioCinematic with native audio + dialogue3-15s$0.2268 / sec
seedance-2-proMulti-shot action, social with audio4-15s$0.4096 / sec
seedance-2-fastSocial shorts, rapid iteration4-15s$0.3266 / sec
All five video models accept an image_url to switch into image-to-video mode without changing the model ID.

Canonical sources

For the current live catalog, use:

Switching models

You do not need to change your integration. Just change the model parameter:
response = client.chat.completions.create(
    model="kimi-k2.6",  # was: "qwen-3.6-plus"
    messages=[{"role": "user", "content": "Write a Python function"}]
)
Or use an alias:
response = client.chat.completions.create(
    model="agent",  # resolves to kimi-k2.6
    messages=[{"role": "user", "content": "Plan the refactor and call tools as needed"}]
)

Discover models programmatically

curl https://kymaapi.com/v1/models
Examples:
# Agent-friendly models
curl "https://kymaapi.com/v1/models?recommended_for=agent&tools=true"

# Long-context models
curl "https://kymaapi.com/v1/models?min_context_window=128000"

# Vision-capable models
curl "https://kymaapi.com/v1/models?vision=true"