Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.kymaapi.com/llms.txt

Use this file to discover all available pages before exploring further.

Setup

1

Open Cursor Settings

Cmd+, (Mac) or Ctrl+, (Windows) → search “OpenAI”
2

Configure API

Set the following:
  • OpenAI API Key: ky-your-api-key
  • API Base URL: https://kymaapi.com/v1
3

Select model

Add custom model: llama-3.3-70b or qwen-3-32b
4

Start coding

Use Cmd+K to chat with AI using Kyma’s models.
ModelAliasBest For
qwen-3.6-plusbestRecommended — highest quality overall
qwen-3-codercodeBest alias for code-focused tasks
kimi-k2.6agentBest tool calling, agentic workflows
gemini-2.5-flashlong-context1M context for large codebases
deepseek-v4-pro1M context, top reasoning. Complex refactors and long-codebase tasks.
deepseek-v4-flash1M context, value tier. Everyday completions where speed matters.
Use model aliases like best or code — they auto-update when better models become available.
curl "https://kymaapi.com/v1/models/recommend?agent=cursor"

Audio Models (optional)

Cursor is a coding IDE first, but if you bolt audio tooling onto your workflow (voice notes, meeting recap macros, audio QA fixtures), Kyma’s audio aliases work the same as any chat model:
  • transcribewhisper-v3-turbo — 228× realtime speech-to-text
  • audio-understandgemini-3-flash-audio — scene/tone/music recognition
Use them via POST /v1/audio/transcriptions and POST /v1/audio/understand. The full audio surface (TTS, music, SFX, voice clone/design) is documented in the Audio API reference and Model Aliases.

Prompt Caching

Prompt caching is automatic. Your system prompt and tool definitions are cached for 5 minutes, reducing costs by up to 90% on subsequent requests. No code changes needed. See Prompt Caching for details.