Pricing & Rate Limits

Overview

Mimo API Provider does not impose a fixed concurrency limit. However, under high load conditions, you may encounter 429 (Too Many Requests) errors. We recommend implementing retry logic with exponential backoff in your applications.

Rate Limit Concepts

RPM (Requests Per Minute): The maximum number of API requests allowed per minute.
TPM (Tokens Per Minute): The maximum number of tokens (input + output) that can be processed per minute.

Model Details

MiMo-V2.5-Pro

Property	Details
Model ID	`mimo-v2.5-pro`
Category	Text Generation - General Large Language Model
Context Length	1M
Max Output Length	128K
Capabilities	Text generation, deep thinking, streaming, function calling, structured output, web search
Pricing (USD)	Input $0.435 / 1M tokens, Input (cache hit) $0.0036 / 1M tokens, Output $0.87 / 1M tokens
Rate Limit	RPM: 100, TPM: 10M

MiMo-V2.5

Property	Details
Model ID	`mimo-v2.5`
Category	Text Generation - Multimodal Understanding Model
Context Length	1M
Max Output Length	128K
Capabilities	Multimodal understanding, deep thinking, streaming, function calling, structured output, web search
Pricing (USD)	Input $0.14 / 1M tokens, Input (cache hit) $0.0028 / 1M tokens, Output $0.28 / 1M tokens
Rate Limit	RPM: 100, TPM: 10M

MiMo-V2-Pro

Property	Details
Model ID	`mimo-v2-pro`
Category	Text Generation - General Large Language Model
Context Length	1M
Max Output Length	128K
Capabilities	Text generation, deep thinking, streaming, function calling, structured output, web search
Pricing (USD)	Automatically routed to MiMo-V2.5-Pro pricing after June 1, 2026: Input $0.435 / 1M tokens, Input (cache hit) $0.0036 / 1M tokens, Output $0.87 / 1M tokens
Rate Limit	RPM: 100, TPM: 10M

MiMo-V2-Omni

Property	Details
Model ID	`mimo-v2-omni`
Category	Text Generation - Multimodal Understanding Model
Context Length	256K
Max Output Length	128K
Capabilities	Multimodal understanding, deep thinking, streaming, function calling, structured output, web search
Pricing (USD)	Automatically routed to MiMo-V2.5 pricing after June 1, 2026: Input $0.14 / 1M tokens, Input (cache hit) $0.0028 / 1M tokens, Output $0.28 / 1M tokens
Rate Limit	RPM: 100, TPM: 10M

MiMo-V2-TTS

Property	Details
Model ID	`mimo-v2-tts`
Category	Speech Synthesis Model
Context Length	8K
Max Output Length	8K
Capabilities	Text-to-speech synthesis
Pricing	Free (limited time)
Rate Limit	RPM: 100, TPM: 10M

Web Search Plugin

Service	Price	Description
Web Search	$5 / 1000 calls	Includes web search and page parsing for search-related content

Note: Cache write is currently free for a limited time.

Pricing & Rate Limits

Overview

Rate Limit Concepts

RPM (Requests Per Minute): The maximum number of API requests allowed per minute.
TPM (Tokens Per Minute): The maximum number of tokens (input + output) that can be processed per minute.

Model Details

MiMo-V2.5-Pro

Property	Details
Model ID	`mimo-v2.5-pro`
Category	Text Generation - General Large Language Model
Context Length	1M
Max Output Length	128K
Capabilities	Text generation, deep thinking, streaming, function calling, structured output, web search
Pricing (USD)	Input $0.435 / 1M tokens, Input (cache hit) $0.0036 / 1M tokens, Output $0.87 / 1M tokens
Rate Limit	RPM: 100, TPM: 10M

MiMo-V2.5

Property	Details
Model ID	`mimo-v2.5`
Category	Text Generation - Multimodal Understanding Model
Context Length	1M
Max Output Length	128K
Capabilities	Multimodal understanding, deep thinking, streaming, function calling, structured output, web search
Pricing (USD)	Input $0.14 / 1M tokens, Input (cache hit) $0.0028 / 1M tokens, Output $0.28 / 1M tokens
Rate Limit	RPM: 100, TPM: 10M

MiMo-V2-Pro

Property	Details
Model ID	`mimo-v2-pro`
Category	Text Generation - General Large Language Model
Context Length	1M
Max Output Length	128K
Capabilities	Text generation, deep thinking, streaming, function calling, structured output, web search
Pricing (USD)	Automatically routed to MiMo-V2.5-Pro pricing after June 1, 2026: Input $0.435 / 1M tokens, Input (cache hit) $0.0036 / 1M tokens, Output $0.87 / 1M tokens
Rate Limit	RPM: 100, TPM: 10M

MiMo-V2-Omni

Property	Details
Model ID	`mimo-v2-omni`
Category	Text Generation - Multimodal Understanding Model
Context Length	256K
Max Output Length	128K
Capabilities	Multimodal understanding, deep thinking, streaming, function calling, structured output, web search
Pricing (USD)	Automatically routed to MiMo-V2.5 pricing after June 1, 2026: Input $0.14 / 1M tokens, Input (cache hit) $0.0028 / 1M tokens, Output $0.28 / 1M tokens
Rate Limit	RPM: 100, TPM: 10M

MiMo-V2-TTS

Property	Details
Model ID	`mimo-v2-tts`
Category	Speech Synthesis Model
Context Length	8K
Max Output Length	8K
Capabilities	Text-to-speech synthesis
Pricing	Free (limited time)
Rate Limit	RPM: 100, TPM: 10M

Web Search Plugin

Service	Price	Description
Web Search	$5 / 1000 calls	Includes web search and page parsing for search-related content

Note: Cache write is currently free for a limited time.

Overview

Rate Limit Concepts

Model Details

MiMo-V2.5-Pro

MiMo-V2.5

MiMo-V2-Pro

MiMo-V2-Omni

MiMo-V2-TTS

Web Search Plugin

Table of Contents

Pricing & Rate Limits

Overview

Rate Limit Concepts

Model Details

MiMo-V2.5-Pro

MiMo-V2.5

MiMo-V2-Pro

MiMo-V2-Omni

MiMo-V2-TTS

Web Search Plugin

Table of Contents