Documentation Index
Fetch the complete documentation index at: https://docs.kymaapi.com/llms.txt
Use this file to discover all available pages before exploring further.
Overview
minimax-voice-design is a text-to-voice synthesis service. Describe a voice in plain English (“warm female narrator with British accent, mid-30s”) and MiniMax synthesizes a complete voice profile. The returned voice_id works the same way voice clone’s output does — usable across all MiniMax HD/Turbo TTS calls.
Right for branding when no voice talent is available, fictional characters, persona voices for chatbots and agents, and any case where a written brief is all you have.
Specs
| Field | Value |
|---|---|
| Model ID | minimax-voice-design |
| Creator | MiniMax |
| Best for | Persona voices from text, fictional characters, no-talent branding |
| Max description | 1000 characters |
| Optional hints | gender, age_group, preview_text |
| Output | voice_id (kyma-namespaced) |
| Pricing mode | Per call (flat) |
Pricing
| Cost | |
|---|---|
| Per designed voice | $4.20 flat (one-time) |
Use this when
- You don’t have voice talent and want a polished voice anyway.
- You’re prototyping characters and want to iterate on voice descriptions cheaply.
- You need a chatbot or agent with a distinctive but synthesized persona.
Pick something else when
- You have reference audio →
minimax-voice-clone($2.10/voice). - Stock voices are sufficient → use
GET /v1/audio/voicesdirectly.
Example
See also
POST /v1/audio/voice-design— endpoint referencePOST /v1/audio/voice-clone— clone from reference audio instead