Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.kymaapi.com/llms.txt

Use this file to discover all available pages before exploring further.

Active model

Model IDParametersContextSpeedBest For
llama-3.3-70b70B128KFastGeneral, code, reasoning

Recommendation

Start with llama-3.3-70b — it’s the most popular open source model. Great at coding, reasoning, writing, and general tasks. Use gemini-2.5-flash or gemini-3-flash when you need very long context on Kyma.
from openai import OpenAI

client = OpenAI(base_url="https://kymaapi.com/v1", api_key="ky-...")

response = client.chat.completions.create(
    model="llama-3.3-70b",
    messages=[{"role": "user", "content": "Hello!"}]
)

Model aliases

AliasResolves to
balancedllama-3.3-70b
model="balanced"  # → llama-3.3-70b