Xiaomi LLM-Core Team
YOU?
Author Swipe
View article: MiMo-VL Technical Report
MiMo-VL Technical Report Open
We open-source MiMo-VL-7B-SFT and MiMo-VL-7B-RL, two powerful vision-language models delivering state-of-the-art performance in both general visual understanding and multimodal reasoning. MiMo-VL-7B-RL outperforms Qwen2.5-VL-7B on 35 out o…