Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.kymaapi.com/llms.txt

Use this file to discover all available pages before exploring further.

Available models

Model IDParametersContextSpeedBest For
deepseek-v4-pro1.6T (49B active, MoE)1MMediumTop reasoning, complex coding
deepseek-v4-flash284B (13B active, MoE)1MFastGeneral + coding, best value V4
deepseek-r1671B (MoE)64KSlowReasoning, math, analysis
deepseek-v3671B (MoE)160KMediumPrevious-gen flagship, stable

V4 vs V3 — When to upgrade?

DeepSeek V4 ProDeepSeek V4 FlashDeepSeek V3
TierFlagshipValuePrevious-gen
Context1M1M160K
Max output65K65K8K
SpeedMediumFastMedium
ReasoningNativeNativeNo
LicenseMITMITMIT
ReleasePreviewPreviewStable
Rule of thumb: Default to deepseek-v4-flash for new work — it gives you 1M context, native reasoning, and the same family behavior at a fraction of the cost. Upgrade to deepseek-v4-pro when quality matters more than price. Stay on deepseek-v3 if you have a stable production workload that’s already tuned to it.

V4 vs R1 — When to use which?

V4 (both Pro and Flash) has reasoning built in, so it’s broadly capable on the same tasks R1 was made for. R1 still has its place for pure step-by-step chain-of-thought work where you want the deeper, slower reasoning trace. Most users will find V4 covers both general and reasoning needs.

Recommendation

Default to deepseek-v4-flash — the new V4 baseline. 1M context, MIT license, native reasoning, lowest V4-tier price. Step up to deepseek-v4-pro for top-tier reasoning, complex coding, or research-grade work. Use deepseek-r1 for pure reasoning tasks where you want a slower, deeper chain-of-thought trace. It maps to the reasoning model alias.
from openai import OpenAI

client = OpenAI(base_url="https://kymaapi.com/v1", api_key="ky-...")

# General + coding — use V4 Flash
response = client.chat.completions.create(
    model="deepseek-v4-flash",
    messages=[{"role": "user", "content": "Refactor this module and explain the tradeoffs."}]
)

# Top reasoning — use V4 Pro
response = client.chat.completions.create(
    model="deepseek-v4-pro",
    messages=[{"role": "user", "content": "Walk through the proof, then propose a tighter bound."}]
)

Model aliases

AliasResolves to
reasoningdeepseek-r1
# Using alias
model="reasoning"  # → deepseek-r1