News

Microsoft scales back in-house AI chip plans as Maya chip slips a full year behind Nvidia

Jul 2, 2025

Key Points

  • Microsoft delays its Maya AI accelerator to 2026, a full year behind schedule, undermining its core strategy to reduce reliance on Nvidia and cut GPU spending for Azure.
  • By the time Maya ships, Nvidia's Blackwell accelerators will have matured, making it harder for Microsoft to justify custom silicon investments or deliver meaningful cost savings.
  • TSMC fab capacity constraints are bottlenecking all hyperscalers trying to bring custom chips to market, forcing Microsoft to scale back ambitions alongside design and manufacturing challenges.

Summary

Microsoft is scaling back its custom AI chip ambitions as the Maya accelerator slips a full year behind schedule, now expected in 2026 instead of 2025. The delay undermines a core strategic goal of reducing reliance on Nvidia and cutting GPU spend for Azure AI and Copilot workloads.

The slip leaves Microsoft trailing competitors. Amazon is planning Tranium 3 for late 2025, and Google already has its seventh-generation TPU in production. By the time Maya ships, Nvidia's Blackwell accelerators will have matured, making it harder for Microsoft to justify the custom silicon investment or deliver meaningful cost savings.

Microsoft developed the Azure Maya AI accelerator for AI tasks and the Azure Cobalt CPU for cloud workloads. Design challenges and manufacturing delays forced a pullback. Hyperscalers across the industry—Google, Amazon, and Apple—are all betting on custom silicon, but execution matters. Being GPU poor right now is a vulnerability. Companies need chips or they fall behind on performance and economics.

The slip also reflects a broader constraint in semiconductor manufacturing. TSMC fab capacity is a bottleneck for all players trying to bring custom silicon to market on aggressive timelines.