MiniMax · Audio

Speech 02 Turbo

MiniMax fast speech synthesis with natural prosody and multi-language support.

Speech 02 Turbo is a cloud text-to-speech model from MiniMax. It converts written text into natural, spoken audio. It offers a choice of 39 voices and supports multiple languages. Voices can be tuned for speed and emotion, and output is available as MP3, WAV, and FLAC. It runs through Replicate using your own API key, from about $0.0004 per second of output.

Modality

Audio

Available on

Replicate

Model ID

minimax/speech-02-turbo

Specs

Pricing: $0.0004 per second
Type: Text-to-speech
Voices: 39 to choose from
Languages: Multilingual
Voice controls: Speed, Emotion
Output formats: MP3, WAV, FLAC

View provider documentation ↗

About the creator

MiniMax

MiniMax is a Chinese AI lab whose speech and music models are widely used for multilingual voiceover and generative music.

www.minimax.io ↗

Samples

Examples

Sample outputs generated with Speech 02 Turbo will appear here.

Sample coming soon

Speech 02 Turbo

MiniMax

Examples

One-time payment. Yours forever.