小米发布 MiMo 开源推理模型,7B 版本在数学和代码任务上匹敌 o1-mini
英文摘要
Xiaomi unveiled the MiMo series, an Apache 2.0 licensed open-source LLM family designed for reasoning. The models include pre-trained and RL-tuned variants, with the 7B version matching o1-mini performance on math and code benchmarks. Base, SFT, and RL model checkpoints have been publicly released.
中文摘要
小米发布了 MiMo 系列,这是采用 Apache 2.0 许可的开源大语言模型家族,专为推理任务而生。模型包含预训练和强化学习调优版本,其中 7B 模型在数学和代码基准测试上的表现与 o1-mini 相当。基座模型、SFT 和 RL 模型权重均已公开发布。
关键要点
Fully open-source under Apache 2.0 license
采用 Apache 2.0 完全开源许可
7B model matches o1-mini on math and code reasoning benchmarks
7B 模型在数学和代码推理基准上达到 o1-mini 水平
Pre-trained, SFT, and RL-tuned model variants are all released
发布了基座模型、监督微调(SFT)和强化学习(RL)调优的全套模型
Model series is specifically designed and tuned for reasoning tasks
模型系列专门针对推理任务进行设计和优化