可用
MiMo-V2-Omni
Multimodal understanding for image, audio, and richer inputs
Designed for teams building assistants and applications that need multimodal perception.
落地页结构已搭好
当前页面先保持轻量,下一步可以继续叠加模型简介、适用场景、价格说明、API 示例、常见问题和竞品对比等模块,而不用改路由结构。
能力标签
Multimodal understandingDeep reasoningStreamingFunction callingWeb search
推荐场景
Vision featuresAssistant UXMultimodal apps