
Share
Kevin Scott argues that despite skepticism, scaling laws will push AI boundaries, ensuring continued breakthroughs in large language model capabilities and efficiency.
In a recent interview with Sequoia Capital’s Training Data podcast, Microsoft CTO Kevin Scott reiterated his firm belief that the so-called "scaling laws" for large language models (LLMs) will continue to drive significant advancements in artificial intelligence. This stance comes despite growing skepticism from some quarters of the AI community that progress has plateaued.
Scott's optimism is rooted in the concept of LLM scaling laws, which were first explored by OpenAI researchers in 2020. These laws suggest that the performance of language models improves predictably as they grow larger (more parameters), are trained on more data, and have access to more computational power (compute). Essentially, simply increasing these factors can lead to substantial improvements in AI capabilities without necessarily requiring fundamental algorithmic breakthroughs.
Scott emphasized that "despite what other people think, we’re not at diminishing marginal returns on scale-up." He acknowledged that the exponential nature of these improvements means significant gains are only observable every few years due to the time it takes to build and train on supercomputers.
Not everyone shares Scott's optimism. Some researchers have challenged the idea that scaling laws will persist indefinitely. Critics argue that recent models, such as Google’s Gemini 1.5 Pro, Anthropic’s Claude Opus, and even OpenAI’s GPT-4o, show diminishing returns in performance improvements. These observations are often based on informal assessments and some benchmark results.

Scott believes that the perception of a plateau is partly due to the infrequent sampling of these exponential gains. "The unfortunate thing is you only get to sample it every couple of years because it just takes a while to build supercomputers and then train models on top of them," he explained.
He also highlighted Microsoft's ongoing commitment to AI research, particularly through its $13 billion technology-sharing deal with OpenAI. This partnership has been instrumental in advancing the capabilities of LLMs, as seen in the development of GPT-4 and other cutting-edge models.
For practitioners, Scott’s stance on scaling laws suggests that investing in larger models and more powerful computational resources remains a viable strategy. However, it also underscores the importance of long-term planning and patience, given the time and resources required to see significant improvements.
Kevin Scott’s comments serve as a reminder that the AI landscape is far from stagnant. The ongoing exploration of scaling laws and the continuous investment in computational resources indicate that significant progress in LLMs is likely to continue. For those in the field, this means staying engaged with the latest research and being prepared for periodic but substantial advancements.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
16 July 2024
88 articles
Related Articles
Related Articles
More Stories