Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.kymaapi.com/llms.txt

Use this file to discover all available pages before exploring further.

Start here

If you do not want to think too hard about model choice:
  • Start with qwen-3.6-plus for the best default across general work, coding, and reasoning.
  • Switch to kimi-k2.6 if your workflow looks like an agent with tools, long sessions, or screenshots.
  • Switch to deepseek-v4-pro if you need top reasoning with 1M context.
  • Switch to deepseek-v4-flash if you want V4-tier behavior at the lowest price.
  • Switch to gemini-2.5-flash if context length is the main problem.

Quick picks

If you need…First pickWhySecond pick
One default modelqwen-3.6-plusBest overall quality and safest defaultdeepseek-v4-flash
Best valuedeepseek-v4-flashV4-tier quality, 1M context, native reasoning, lowest V4 pricedeepseek-v3
Top reasoningdeepseek-v4-pro1.6T MoE flagship, 1M context, native reasoningdeepseek-r1
Deep reasoningdeepseek-r1Best for logic, math, hard analysisdeepseek-v4-pro
Tool-heavy agentskimi-k2.6Strong tool use, long context, multimodalglm-5.1
Long-running coding agentsglm-5.1Better for repo-scale engineering and sustained executionminimax-m2.5
Agentic codingminimax-m2.5Strong engineering workflow fit for typical coding agentsglm-5.1
Fast coding loopsqwen-3-32bLower latency while staying strong on codeqwen-3-coder
1M contextgemini-2.5-flashCheapest long-context optiongemini-3-flash
Vision / screenshotsgemma-4-31bCheapest solid multimodal optionkimi-k2.6
Cheap bulk automationglm-4.5-airLow-cost agentic pathglm-4.7-flash
Cheap long-context throughputglm-4.7-flashFast, efficient, long contextgemini-2.5-flash
Default image generationrecraft-v4#1 HF Arena, design-quality, $0.054flux-2-pro
Photoreal / hero shotsflux-2-proBFL 32B, multi-reference, gen+editrecraft-v4-pro
Multi-reference blendflux-2-proUp to 10 source images via image_urls
Image edit / inpaintflux-kontext-proImage-to-image editor, requires image_urlflux-2-pro
Logos, typography, packagingideogram-v3Best legible-text image modelrecraft-v4-vector
Native SVG / vectorrecraft-v4-vectorTrue paths + layers, edit in Figmarecraft-v4-vector-pro
Print-ready design (4MP)recraft-v4-proV4 quality at 4MP for printrecraft-v4-vector-pro
Hero-quality TTS (narration)eleven-multilingual-v229 languages, expressive, brand-safeeleven-turbo-v2-5
Real-time voice agenteleven-flash-v2-5~75ms time-to-first-byte, 32 langeleven-turbo-v2-5
Music generationelevenlabs-musicPrompt-driven, lyrics support, 1s..5min
Sound effectselevenlabs-sfxWhoosh, explosion, ambient — 0.5..22s
These are decision shortcuts, not absolute rankings. If your workload changes, your best model changes too.

Choose by constraint

Pick qwen-3.6-plus.Use it when you want one model that is strong at general work, coding, reasoning, and multilingual tasks without forcing a lot of tradeoff thinking.
Pick deepseek-v4-flash.It is the best first stop when you want strong quality at lower cost — 1M context, native reasoning, MIT license. If you need an even cheaper lane for routine workloads, look at gpt-oss-120b or glm-4.5-air. If you have a stable production workload already on deepseek-v3, it stays available.
Pick deepseek-v4-pro.It is the V4 flagship — 1.6T MoE, 1M context, native reasoning. Best fit for complex coding, multi-step analysis, and research-grade work where quality wins over latency.
Pick deepseek-r1.Use it for difficult analysis, logic, math, and multi-step planning where you want a slower, deeper chain-of-thought trace. If it feels too slow, fall back to deepseek-v4-pro or qwen-3.6-plus.
Start with kimi-k2.6.It is the best first pick when your workload uses tools, screenshots, or long multi-step sessions. If you want a text-only engineering alternative for repo-scale work, try glm-5.1.
Start with glm-5.1.It is the better fit when the work is repo-scale, multi-file, and long-horizon. If your agent is more typical day-to-day coding than sustained engineering execution, fall back to minimax-m2.5.
Start with minimax-m2.5.It fits engineering workflows well for normal coding-agent usage. If you want a newer productivity-oriented variant, try minimax-m2.7. If you want a stronger long-horizon engineering model, move up to glm-5.1.
Pick qwen-3-32b.It is the best fit for tight edit-run-debug loops. If you want a more code-specialized model with more context, try qwen-3-coder.
Pick gemini-2.5-flash for the safest long-context default.If you want a newer preview path, use gemini-3-flash. If you want cheaper long-context throughput without needing multimodal input, look at glm-4.7-flash.
Start with gemma-4-31b.It is the cheapest strong multimodal option. If you also need stronger agent behavior, upgrade to kimi-k2.6.
Start with glm-4.5-air.It is the cheaper agentic lane for repeated automation tasks. If the workload is more about long context and throughput than agent behavior, try glm-4.7-flash.

Tradeoffs that matter

ModelMain strengthMain tradeoff
qwen-3.6-plusBest overall defaultNot the cheapest or fastest
deepseek-v4-proTop reasoning, 1M context, native reasoningPremium price, preview stage
deepseek-v4-flashBest value V4, 1M context, native reasoningPreview stage
deepseek-v3Stable previous-gen flagshipSmaller context, no native reasoning
deepseek-r1Best chain-of-thought reasoningSlower
kimi-k2.6Best tool-heavy agent behaviorPremium cost
minimax-m2.5Strong engineering workflowsLess general-purpose than Qwen flagship
qwen-3-32bFast codingShorter context
gemini-2.5-flash1M context at good priceNot the strongest default for coding agents
gemma-4-31bCheap visionWeaker than flagship text models on hard reasoning
glm-5.1Long-running coding agents and repo-scale engineeringPremium lane, text-only, less battle-tested in Kyma than Qwen/Kimi
glm-4.5-airCheap agentic bulk tasksLower ceiling than flagship models
glm-4.7-flashCheap long-context throughputPreview-stage model

Use by task

TaskFirst pickFallback
General chat / assistantqwen-3.6-plusdeepseek-v4-flash
Coding assistantqwen-3.6-plusqwen-3-32b
Autonomous coding agentkimi-k2.6minimax-m2.5
Repo-scale engineering workglm-5.1deepseek-v4-pro
Math / reasoning / hard analysisdeepseek-v4-prodeepseek-r1
Long document summarizationgemini-2.5-flashglm-4.7-flash
Data extraction / structured outputqwen-3-32bglm-4.5-air
Screenshot / image understandinggemma-4-31bkimi-k2.6
Cheap automationglm-4.5-airgpt-oss-120b

Still not sure?

  • Use alias best for qwen-3.6-plus
  • Use alias agent for kimi-k2.6
  • Use alias reasoning for deepseek-r1
  • Use alias long-context for gemini-2.5-flash
See model aliases and all models for the canonical live catalog.