Text-to-Speech
Generate natural speech from text using the MiMo-V2-TTS model.
MiMo-V2-TTS provides high-quality text-to-speech synthesis, converting text into natural-sounding audio.
Text-to-Speech is currently free for a limited time. Pricing will be announced before the free period ends.
API Endpoint
POST https://api.mimo-v2.com/v1/audio/speechExample
from openai import OpenAI
client = OpenAI(
api_key="your_mimo_api_key",
base_url="https://api.mimo-v2.com/v1"
)
response = client.audio.speech.create(
model="mimo-v2-tts",
voice="alloy",
input="Hello! Welcome to Mimo API Provider. We are glad to have you here."
)
response.stream_to_file("output.mp3")Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
model | string | Yes | The TTS model to use. Set to mimo-v2-tts. |
input | string | Yes | The text to synthesize into speech. |
voice | string | Yes | The voice to use for synthesis. |
response_format | string | No | Output audio format. Default: mp3. |
speed | number | No | Speech speed multiplier (0.25 to 4.0). Default: 1.0. |
Supported Output Formats
| Format | Description |
|---|---|
mp3 | MPEG audio (default) |
opus | Opus codec, low latency |
aac | AAC audio |
flac | Lossless audio |
wav | Uncompressed WAV |
pcm | Raw PCM audio |
MiMo API Docs