模型目錄

為不同工作負載選擇合適的 MiMo 模型

我們先把模型落地頁架構搭起來，後續你可以持續往每個模型頁補上場景、示例、對比與 FAQ，而不需要再重做路由。

已準備好的模型頁骨架

下面每個卡片都對應獨立、可持續擴充的 SEO 路由，之後直接補內容即可。

MiMo-V2.5-Pro

可用

Flagship reasoning for coding and agent workflows

The primary model for complex agent execution, coding, long-context reasoning, and tool-heavy workflows.

上下文視窗: 1M
輸出視窗: 128K

能力標籤

Text generationDeep reasoningStreamingFunction callingStructured outputWeb search

查看模型頁查看文件

MiMo-V2.5

可用

Full-modal understanding with 1M context

Built for applications that need to understand text, images, video, and audio in one model.

上下文視窗: 1M
輸出視窗: 128K

能力標籤

Full-modal understandingDeep reasoningStreamingFunction callingStructured outputWeb search

查看模型頁查看文件

MiMo-V2.5-TTS

可用

Expressive text-to-speech with built-in voices

Generates natural speech from assistant messages, with style control through instructions and audio tags.

上下文視窗: 8K
輸出視窗: 8K

能力標籤

Text-to-speechAudio outputSingingStyle control

查看模型頁查看文件

MiMo-V2.5-TTS-VoiceClone

可用

Voice cloning from audio samples

Replicates a target voice from an audio sample and uses it for speech synthesis.

上下文視窗: 8K
輸出視窗: 8K

能力標籤

Text-to-speechVoice cloningAudio output

查看模型頁查看文件

MiMo-V2.5-TTS-VoiceDesign

可用

Custom voice design from text descriptions

Creates a voice from a text description, then synthesizes speech in that custom voice.

上下文視窗: 8K
輸出視窗: 8K

能力標籤

Text-to-speechVoice designAudio output

查看模型頁查看文件

MiMo-V2-Pro

可用

Reasoning and production-grade text generation

Built for agent workflows, structured output, and long-context reasoning tasks.

上下文視窗: 1M
輸出視窗: 128K

能力標籤

Text generationDeep reasoningStreamingFunction callingStructured outputWeb search

查看模型頁查看文件

MiMo-V2-Omni

可用

Multimodal understanding for image, audio, and richer inputs

Designed for teams building assistants and applications that need multimodal perception.

上下文視窗: 256K
輸出視窗: 128K

能力標籤

Multimodal understandingDeep reasoningStreamingFunction callingWeb search

查看模型頁查看文件

MiMo-V2-TTS

可用

Text-to-speech output for voice experiences

A focused speech model for teams adding natural voice output to products and workflows.

上下文視窗: 8K
輸出視窗: 8K

能力標籤

Text-to-speechAudio outputVoice experiences

查看模型頁查看文件

模型目錄

為不同工作負載選擇合適的 MiMo 模型

我們先把模型落地頁架構搭起來，後續你可以持續往每個模型頁補上場景、示例、對比與 FAQ，而不需要再重做路由。

已準備好的模型頁骨架

下面每個卡片都對應獨立、可持續擴充的 SEO 路由，之後直接補內容即可。

MiMo-V2.5-Pro

可用

Flagship reasoning for coding and agent workflows

The primary model for complex agent execution, coding, long-context reasoning, and tool-heavy workflows.

上下文視窗: 1M
輸出視窗: 128K

能力標籤

Text generationDeep reasoningStreamingFunction callingStructured outputWeb search

查看模型頁查看文件

MiMo-V2.5

可用

Full-modal understanding with 1M context

Built for applications that need to understand text, images, video, and audio in one model.

上下文視窗: 1M
輸出視窗: 128K

能力標籤

Full-modal understandingDeep reasoningStreamingFunction callingStructured outputWeb search

查看模型頁查看文件

MiMo-V2.5-TTS

可用

Expressive text-to-speech with built-in voices

Generates natural speech from assistant messages, with style control through instructions and audio tags.

上下文視窗: 8K
輸出視窗: 8K

能力標籤

Text-to-speechAudio outputSingingStyle control

查看模型頁查看文件

MiMo-V2.5-TTS-VoiceClone

可用

Voice cloning from audio samples

Replicates a target voice from an audio sample and uses it for speech synthesis.

上下文視窗: 8K
輸出視窗: 8K

能力標籤

Text-to-speechVoice cloningAudio output

查看模型頁查看文件

MiMo-V2.5-TTS-VoiceDesign

可用

Custom voice design from text descriptions

Creates a voice from a text description, then synthesizes speech in that custom voice.

上下文視窗: 8K
輸出視窗: 8K

能力標籤

Text-to-speechVoice designAudio output

查看模型頁查看文件

MiMo-V2-Pro

可用

Reasoning and production-grade text generation

Built for agent workflows, structured output, and long-context reasoning tasks.

上下文視窗: 1M
輸出視窗: 128K

能力標籤

Text generationDeep reasoningStreamingFunction callingStructured outputWeb search

查看模型頁查看文件

MiMo-V2-Omni

可用

Multimodal understanding for image, audio, and richer inputs

Designed for teams building assistants and applications that need multimodal perception.

上下文視窗: 256K
輸出視窗: 128K

能力標籤

Multimodal understandingDeep reasoningStreamingFunction callingWeb search

查看模型頁查看文件

MiMo-V2-TTS

可用

Text-to-speech output for voice experiences

A focused speech model for teams adding natural voice output to products and workflows.

上下文視窗: 8K
輸出視窗: 8K

能力標籤

Text-to-speechAudio outputVoice experiences

查看模型頁查看文件