
Share
Magic’s new 100M token model, developed in collaboration with Google Cloud, revolutionizes code synthesis and software development by overcoming traditional AI context limitations for unprecedented efficiency and complexity handling.
Magic, a leading AI research company, has announced significant advancements in ultra-long context models with the capability to process up to 100 million tokens during inference. This breakthrough, made possible through a partnership with Google Cloud, marks a pivotal shift in how AI models can be utilized for software development and other complex tasks.
Traditionally, AI models have relied heavily on training data, with context windows during inference being relatively short. However, Magic's Long-Term Memory (LTM) models challenge this norm by enabling reasoning over vast amounts of contextual information during real-time operations. This capability is particularly transformative for code synthesis, where having access to extensive codebases, documentation, and libraries can significantly enhance the accuracy and utility of AI-generated code.
Current evaluation methods for long-context models have limitations. One popular method, the "Needle In A Haystack" eval, involves placing a random fact within a large context window and asking the model to retrieve it. However, this approach has several flaws:

To address these issues, Magic has introduced a new evaluation method called HashHop. This method eliminates semantic hints and requires models to store and retrieve random hash pairs:
jJWlupoT → KmsFrnRa
vRLWdcwV → sVLdzfJu
YOJVrdjK → WKPUyWON
OepweRIW → JeIrWpvs
JeqPlFgA → YirRppTA
Completion YOJVrdjK → WKPUyWON
This approach ensures that the model must store and retrieve information with maximum information content, making it a more rigorous test of ultra-long context capabilities.
While the commercial applications of these models are diverse, Magic is particularly focused on software development. The ability to synthesize code while having access to an entire codebase, documentation, and libraries can lead to:
Magic's 100M token context windows represent a significant leap forward in AI capabilities, especially for software development. By addressing the limitations of current evaluation methods with HashHop, Magic is setting new standards for ultra-long context models. This partnership with Google Cloud and the introduction of LTM models promise to revolutionize how we think about AI in the tech industry.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
30 August 2024
133 articles
Related Articles
Related Articles
More Stories