Generate a synthesized voice profile from a natural-language description. No reference recording needed. Returns a voice_id usable in /v1/audio/speech.
Synchronous endpoint. Describe a voice in plain English, get back aDocumentation Index
Fetch the complete documentation index at: https://docs.kymaapi.com/llms.txt
Use this file to discover all available pages before exploring further.
voice_id you can immediately use in /v1/audio/speech on any MiniMax voice model.
Use this when you don’t have voice talent, you’re prototyping a fictional character, or you want a brand-safe persona voice from scratch.
application/json body.
text.minimax-voice-design is supported./v1/audio/speech afterward to render).male or female.child, young, middle-aged, or elderly.200 OK JSON. Same shape as /v1/audio/voice-clone.
voice_id is reusable in unlimited TTS calls.
Voice design costs ~2× voice clone because synthesizing timbre from text is strictly more compute-intensive than reproducing a captured voice.
voice_id with another account is rejected with 403 voice_not_owned.
| Status | error.code | When |
|---|---|---|
400 | not_a_voice_design_model | model is not a design SKU |
400 | description_too_long | description > 1000 chars |
400 | invalid_request | missing description |
402 | insufficient_credits | balance below $4.20 |
500 | ownership_write_failed | design succeeded but ownership row insert failed |
502 | provider_error | upstream MiniMax failure |
POST /v1/audio/voice-clone — clone from reference audio insteadPOST /v1/audio/speech — use the designed voice