Documentation Index
Fetch the complete documentation index at: https://docs.kymaapi.com/llms.txt
Use this file to discover all available pages before exploring further.
Overview
minimax-speech-hd is MiniMax’s HD-tier text-to-speech voice. Multilingual, expressive, brand-safe. Right for production narration, audiobooks, multilingual content, and budget-tier brand voice work.
Position it as the production tier between minimax-speech-turbo (bulk / real-time) and eleven-multilingual-v2 (hero brand voice).
Specs
| Field | Value |
|---|---|
| Model ID | minimax-speech-hd |
| Creator | MiniMax |
| Best for | Production narration, multilingual content |
| Max input | 5000 characters per request |
| Pricing mode | Per character |
Pricing
| Cost | |
|---|---|
| Per 1K chars | $0.140 |
| Typical sentence (~150 chars) | ~$0.021 |
Use this when
- You need production-quality voice without ElevenLabs flagship pricing.
- You’re shipping multilingual content (29+ languages supported).
- You want a single SKU good enough for narration, not just demos.
Pick something else when
- You need ultra-low-latency for real-time agents →
minimax-speech-turbo($0.090/1K char). - You’re cutting hero brand spots and the voice IS the product →
eleven-multilingual-v2($0.405/1K char).
Example
See also
POST /v1/audio/speech— endpoint referencePOST /v1/audio/voice-clone— clone a custom voice for use here