smart_toyAI/PROMPT ENGINEERING

Microsoft's new MAI models

sourceSimon Willison

calendar_todayJune 3, 2026

schedule2 min read

lightbulb

EXECUTIVE SUMMARY

Microsoft Unveils Innovative MAI Models for Enhanced AI Performance

Summary

Microsoft has announced two new text large language models (LLMs): MAI-Thinking-1 and MAI-Code-1-Flash, designed for reasoning and coding tasks, respectively. These models aim to deliver high performance at lower costs, with MAI-Thinking-1 boasting 1 trillion parameters and MAI-Code-1-Flash featuring 137 billion parameters.

Key Points

Models Released: MAI-Thinking-1 (1 trillion parameters, 35 billion active) and MAI-Code-1-Flash (137 billion parameters, 5 billion active).
Target Users: MAI-Code-1-Flash is specifically designed for GitHub Copilot and Visual Studio Code users.
Performance Claims: MAI-Thinking-1 is reportedly preferred over Sonnet 4.6 in blind evaluations.
Data Training: Both models were trained on clean, commercially licensed data without distillation from third-party models.
Training Corpus: Initial web crawl of 1.2 trillion pages reduced to 794 billion after filtering for quality and content.
Content Filtering: Utilizes proprietary AI-content detection and manual inspection to remove low-quality or AI-generated content.
Licensing Issues: The training data still faces challenges regarding licensing, similar to other major LLMs.

Analysis

The introduction of MAI-Thinking-1 and MAI-Code-1-Flash reflects Microsoft's commitment to advancing AI capabilities while addressing cost and performance issues. The focus on clean, licensed data for training sets these models apart in a crowded market.

Conclusion

IT professionals should explore the potential of these new models for enhancing AI applications in their organizations, particularly in coding and reasoning tasks. Staying informed about the licensing and data quality concerns will be crucial for responsible AI deployment.