
Share
Microsoft’s new WHAM model, dubbed "Muse," aims to revolutionize game development by generating visuals and controlling actions, offering developers a powerful AI tool to enhance creativity and efficiency.
Microsoft Research has just published a groundbreaking paper in Nature, introducing the World and Human Action Model (WHAM), named "Muse." This generative AI model is specifically designed to support game development by generating game visuals, controller actions, or both. The Muse model represents a significant step forward in how AI can assist human creatives in the gaming industry.
For game developers and designers, the introduction of Muse opens up new possibilities:

The development of Muse was driven by a desire to explore how generative AI can support human creativity in game development. The project began in December 2022, shortly after the release of ChatGPT. The team at Microsoft Research saw the potential of transformer-based models and decided to apply this technology to the gaming domain.
Muse's capabilities are best demonstrated through its generated gameplay sequences. When prompted with a short sequence of human gameplay (10 frames), Muse can predict and generate complex, consistent gameplay for several minutes. This includes accurate character movements, environment interactions, and player actions, showcasing the model's deep understanding of game dynamics.
The release of Muse marks a significant milestone in AI-assisted game development. By providing tools that can handle both visual and action generation, Microsoft is empowering developers to push the boundaries of what's possible in gaming. The open-source nature of the project ensures that the community can continue to innovate and improve upon this foundational work.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
25 February 2025
88 articles
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
More Stories