MiniMax Speech Turbo

Overview

minimax-speech-turbo is MiniMax’s low-latency text-to-speech tier. Multilingual, fast time-to-first-byte, cheapest voice on Kyma. Right for bulk TTS, real-time voice agents, conversational AI, and high-throughput pipelines.

Specs

Field	Value
Model ID	`minimax-speech-turbo`
Creator	MiniMax
Best for	Real-time agents, conversational AI, bulk narration
Max input	5000 characters per request
Pricing mode	Per character

Pricing

	Cost
Per 1K chars	$0.090
Typical sentence (~150 chars)	~$0.014

Use this when

You’re building a real-time voice agent and TTFB matters.
You’re processing high volume (chatbot replies, IVR menus, bulk narration).
You need a budget-friendly default that still sounds production-acceptable.

Pick something else when

You need richer expressivity for storytelling → minimax-speech-hd ($0.140/1K char).
You’re cutting hero brand spots → eleven-multilingual-v2.

Example

curl -X POST https://kymaapi.com/v1/audio/speech \
  -H "Authorization: Bearer $KYMA_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "minimax-speech-turbo",
    "input": "Hi, how can I help you today?",
    "voice_id": "female-shaonv"
  }' \
  --output reply.mp3

Models

MiniMax Speech Turbo

Overview

Specs

Pricing

Use this when

Pick something else when

Example

See also

​Overview

​Specs

​Pricing

​Use this when

​Pick something else when

​Example

​See also

Overview

Specs

Pricing

Use this when

Pick something else when

Example

See also