Xiaomi has released MiMo-V2.5-Pro-UltraSpeed: The generation speed has been increased by 10 times! It can process over 1,000 tokens per second.
igeekphone.com Jun 9, 2026

Xiaomi has released MiMo-V2.5-Pro-UltraSpeed: The generation speed has been increased by 10 times! It can process over 1,000 tokens per second.

AI-summarised brief · reviewed before publication

Xiaomi has launched MiMo-V2.5-Pro-UltraSpeed, a breakthrough in AI text generation speed, achieving 1000 tokens per second on a single standard 8-card GPU node. This is a 10-fold increase in speed, made possible by collaborations with TileRT and innovations in model and system design. The service is available on a limited-time subscription basis, with priority given to enterprises and professional developers. A dedicated webpage allows ordinary users to experience the conversation function, with a daily queue limit and session duration. The technology advancements include FP4 quantization, DFlash block parallel speculative decoding, and a restructured GPU execution architecture.

💡 Why It Matters

  • · Faster AI text generation enables parallel reasoning of models and autonomous error correction, significantly improving logical reasoning quality.
  • · This boosts productivity in code generation and programming agents.