Skip to main content
All models are free, verified working, and accessible through the same endpoint. Use GET /v1/models to get the latest list programmatically.
curl https://kymaapi.com/v1/models
These models offer the best balance of quality, speed, and capabilities.

Llama 3.3 70B

Best all-rounder. General tasks, coding, reasoning. 128K context. Ultra-fast.
model="llama-3.3-70b"

Qwen 3 32B

Best for coding. Code generation, math, multilingual. 32K context.
model="qwen-3-32b"

Gemma 4 31B

Best multimodal. Vision capable, Google’s newest. 128K context.
model="gemma-4-31b"

Qwen 3 235B

Highest quality. 235B params, complex reasoning. Ultra-fast.
model="qwen-3-235b-cerebras"

All Models

Model IDNameContextSpeedBest For
llama-3.3-70bLlama 3.3 70B128K⚡ FastGeneral, code, reasoning
llama-4-scoutLlama 4 Scout 17B512K⚡ FastLong documents, analysis
llama-3.1-8bLlama 3.1 8B8K⚡⚡ FastestQuick tasks, classification
qwen-3-32bQwen 3 32B32K⚡ FastCode, math, multilingual
kimi-k2Kimi K2128K⚡ FastAgentic coding, tool use
gpt-oss-120bGPT-OSS 120B128KMediumGeneral intelligence, writing
gpt-oss-20bGPT-OSS 20B128K⚡ FastFast general tasks
gemma-4-31bGemma 4 31B128KMediumMultimodal, vision
gemma-4-26b-moeGemma 4 26B MoE128K⚡ FastEfficient inference
gemini-3-flashGemini 3 Flash1MMediumUltra-long context
gemini-2.5-flashGemini 2.5 Flash1MMediumLong context
gemma-3-27bGemma 3 27B128KMediumGeneral
llama-3.1-8b-cerebrasLlama 3.1 8B8K⚡⚡ FastestUltra-fast inference
qwen-3-235b-cerebrasQwen 3 235B32K⚡ FastComplex reasoning
Models are updated regularly. New models are added as they become available from providers.