[AINews] Microsoft Build: MAI-Thinking-1 and MAI Family models
English summary
Microsoft at Build 2026 announced seven new MAI models, including the flagship MAI-Thinking-1 reasoning model with 35B active parameters, 256K context, and strong benchmark scores like 97% on AIME 2025. The company released a highly transparent 109-page technical report that impressed researchers, emphasizing clean data lineage and no use of synthetic data or distillation. Build also focused on local AI with Windows as an agent runtime, the RTX Spark Dev Box, and Project Solara/Scout agent hardware. The GitHub Copilot app was unveiled as a desktop home for agent-native development, and Web IQ was introduced as a new grounding API for agents. Overall, the event positioned Microsoft as both a first-party frontier model developer and a multi-tier AI platform company.
Chinese summary
微软在Build 2026上发布了七款新的MAI模型,包括旗舰推理模型MAI-Thinking-1,具有35B活跃参数、256K上下文窗口以及AIME 2025 97%等强大基准测试成绩。公司发布了一份长达109页的高度透明技术报告,强调数据来源清晰、未使用合成数据或蒸馏技术,获得了研究界好评。Build还聚焦本地AI,将Windows打造为代理运行时,推出RTX Spark Dev Box和Project Solara/Scout代理硬件。GitHub Copilot应用作为代理原生开发的桌面中心亮相,Web IQ作为代理新型接地API推出。整体上,活动将微软定位为既是第一方前沿模型开发者,又是多层次AI平台公司。
Key points
Microsoft announced seven new MAI models at Build 2026, including MAI-Thinking-1, MAI-Code-1-Flash, MAI-Image-2.5, MAI-Transcribe-1.5, and MAI-Voice-2.
微软在Build 2026上宣布了七款新的MAI模型,包括MAI-Thinking-1、MAI-Code-1-Flash、MAI-Image-2.5、MAI-Transcribe-1.5和MAI-Voice-2。
MAI-Thinking-1 is a 35B active parameter MoE with 256K context, scoring 97% on AIME 2025 and 53% on SWE-Bench Pro, preferred over Sonnet 4.6 by blind human raters.
MAI-Thinking-1是一个35B活跃参数的MoE模型,具有256K上下文,在AIME 2025上获得97%,在SWE-Bench Pro上获得53%,盲测中比Sonnet 4.6更受青睐。
Microsoft emphasized clean data lineage, no synthetic data, and no distillation throughout the training pipeline, releasing a 109-page transparent technical report.
微软强调在整个训练流程中数据来源清晰、无合成数据、无蒸馏,并发布了一份109页的透明技术报告。
Build highlighted local AI with Windows as an agent runtime, the RTX Spark Dev Box for local model execution, and concept hardware Project Solara/Scout.
Build强调了本地AI,将Windows打造为代理运行时,推出RTX Spark Dev Box用于本地模型运行,以及概念硬件Project Solara/Scout。
The GitHub Copilot app was positioned as the desktop home for agent-native software development, with features like canvases for bidirectional agent interaction and cross-device continuity.
GitHub Copilot应用被定位为代理原生软件开发的桌面中心,具有画布等双向代理交互功能和跨设备连续性。