Speech 02 Turbo
MiniMax fast speech synthesis with natural prosody and multi-language support.
Speech 02 Turbo is a cloud text-to-speech model from MiniMax. It converts written text into natural, spoken audio. It offers a choice of 39 voices and supports multiple languages. Voices can be tuned for speed and emotion, and output is available as MP3, WAV, and FLAC. It runs through Replicate using your own API key, from about $0.0004 per second of output.
- Pricing
- $0.0004 per second
- Type
- Text-to-speech
- Voices
- 39 to choose from
- Languages
- Multilingual
- Voice controls
- Speed, Emotion
- Output formats
- MP3, WAV, FLAC
MiniMax
MiniMax is a Chinese AI lab whose speech and music models are widely used for multilingual voiceover and generative music.
www.minimax.io ↗Examples
Sample outputs generated with Speech 02 Turbo will appear here.