Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.kymaapi.com/llms.txt

Use this file to discover all available pages before exploring further.

Overview

Kyma API gives you instant access to the best open source LLMs through a single endpoint. Compatible with both OpenAI and Anthropic SDKs. Multi-provider redundancy means your requests always go through — even when individual providers are down.

51 models, one endpoint

Qwen 3.6, DeepSeek V4, Gemma 4, GPT-OSS, Kimi K2.6, Gemini, Llama, MiniMax, GLM, plus GPT Image 2, FLUX, Ideogram, Recraft, MiniMax Image for image generation, Kling, Seedance, Hailuo for video, ElevenLabs and MiniMax for voice and music.

Auto-Failover

Multi-provider redundancy. If one fails, your request is automatically retried on another.

OpenAI Compatible

Drop-in replacement. Works with any OpenAI SDK, LangChain, Cursor, and more.

Why Kyma?

KymaOther gatewaysDirect API
Price$0.50 free credit$0.001+/tokenFree (limited)
ModelsCurated active models200+ (paid)1 per provider
RedundancyAuto-failoverSingle providerNone
Setup time30 seconds5 minutesPer-API signup
OpenAI SDK

How it works

Your App → Kyma API → Best available infrastructure
                    ↳ Auto-failover if primary is down
                    ↳ Same model on backup, or equivalent quality
Kyma automatically routes your request to the fastest available infrastructure. If a provider is down, your request is silently retried on another — you never see an error. No need to manage multiple API keys or monitor provider status.

Quick example

from openai import OpenAI

client = OpenAI(
    base_url="https://kymaapi.com/v1",
    api_key="YOUR_KEY"
)

response = client.chat.completions.create(
    model="llama-3.3-70b",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

Ready to start?

Get your API key and make your first request in 30 seconds →