LogoMiMo API Docs
LogoMiMo API Docs
HomepageWelcome

Quick Start

Pricing & Rate Limits

API Reference

Guides

Support

FAQ

Pricing & Rate Limits

Overview

Mimo API Provider does not impose a fixed concurrency limit. However, under high load conditions, you may encounter 429 (Too Many Requests) errors. We recommend implementing retry logic with exponential backoff in your applications.

Rate Limit Concepts

  • RPM (Requests Per Minute): The maximum number of API requests allowed per minute.
  • TPM (Tokens Per Minute): The maximum number of tokens (input + output) that can be processed per minute.

Model Details

MiMo-V2.5-Pro

PropertyDetails
Model IDmimo-v2.5-pro
CategoryText Generation - General Large Language Model
Context Length1M
Max Output Length128K
CapabilitiesText generation, deep thinking, streaming, function calling, structured output, web search
Pricing (USD)Input $0.435 / 1M tokens, Input (cache hit) $0.0036 / 1M tokens, Output $0.87 / 1M tokens
Rate LimitRPM: 100, TPM: 10M

MiMo-V2.5

PropertyDetails
Model IDmimo-v2.5
CategoryText Generation - Multimodal Understanding Model
Context Length1M
Max Output Length128K
CapabilitiesMultimodal understanding, deep thinking, streaming, function calling, structured output, web search
Pricing (USD)Input $0.14 / 1M tokens, Input (cache hit) $0.0028 / 1M tokens, Output $0.28 / 1M tokens
Rate LimitRPM: 100, TPM: 10M

MiMo-V2-Pro

PropertyDetails
Model IDmimo-v2-pro
CategoryText Generation - General Large Language Model
Context Length1M
Max Output Length128K
CapabilitiesText generation, deep thinking, streaming, function calling, structured output, web search
Pricing (USD)Automatically routed to MiMo-V2.5-Pro pricing after June 1, 2026: Input $0.435 / 1M tokens, Input (cache hit) $0.0036 / 1M tokens, Output $0.87 / 1M tokens
Rate LimitRPM: 100, TPM: 10M

MiMo-V2-Omni

PropertyDetails
Model IDmimo-v2-omni
CategoryText Generation - Multimodal Understanding Model
Context Length256K
Max Output Length128K
CapabilitiesMultimodal understanding, deep thinking, streaming, function calling, structured output, web search
Pricing (USD)Automatically routed to MiMo-V2.5 pricing after June 1, 2026: Input $0.14 / 1M tokens, Input (cache hit) $0.0028 / 1M tokens, Output $0.28 / 1M tokens
Rate LimitRPM: 100, TPM: 10M

MiMo-V2-TTS

PropertyDetails
Model IDmimo-v2-tts
CategorySpeech Synthesis Model
Context Length8K
Max Output Length8K
CapabilitiesText-to-speech synthesis
PricingFree (limited time)
Rate LimitRPM: 100, TPM: 10M

Web Search Plugin

ServicePriceDescription
Web Search$5 / 1000 callsIncludes web search and page parsing for search-related content

Note: Cache write is currently free for a limited time.

Table of Contents

Overview
Rate Limit Concepts
Model Details
MiMo-V2.5-Pro
MiMo-V2.5
MiMo-V2-Pro
MiMo-V2-Omni
MiMo-V2-TTS
Web Search Plugin