LogoMiMo API Docs
LogoMiMo API Docs
HomepageWelcome

Quick Start

Pricing & Rate Limits

API Reference

Guides

Support

FAQ

Text-to-Speech

Generate natural speech from text using the MiMo-V2-TTS model.

MiMo-V2-TTS provides high-quality text-to-speech synthesis, converting text into natural-sounding audio.

Text-to-Speech is currently free for a limited time. Pricing will be announced before the free period ends.

API Endpoint

POST https://api.mimo-v2.com/v1/audio/speech

Example

from openai import OpenAI

client = OpenAI(
    api_key="your_mimo_api_key",
    base_url="https://api.mimo-v2.com/v1"
)

response = client.audio.speech.create(
    model="mimo-v2-tts",
    voice="alloy",
    input="Hello! Welcome to Mimo API Provider. We are glad to have you here."
)

response.stream_to_file("output.mp3")

Parameters

ParameterTypeRequiredDescription
modelstringYesThe TTS model to use. Set to mimo-v2-tts.
inputstringYesThe text to synthesize into speech.
voicestringYesThe voice to use for synthesis.
response_formatstringNoOutput audio format. Default: mp3.
speednumberNoSpeech speed multiplier (0.25 to 4.0). Default: 1.0.

Supported Output Formats

FormatDescription
mp3MPEG audio (default)
opusOpus codec, low latency
aacAAC audio
flacLossless audio
wavUncompressed WAV
pcmRaw PCM audio

Table of Contents

API Endpoint
Example
Parameters
Supported Output Formats