
Share
As AI models continue their meteoric rise, experts question whether the industry can sustain a quadrupling of computational resources annually until 2030, fueling debates about future innovation limits.
In recent years, the rapid advancement of AI models has been a testament to the power of computational scaling. Our research indicates that this growth in computational resources is responsible for a significant portion of AI performance improvements (Epoch AI). The consistent and predictable gains from scaling have driven AI labs to aggressively expand training compute at an astounding rate of about 4x per year.
To put this into perspective, this 4x annual growth in AI training compute outpaces some of the fastest technological expansions in recent history. For instance, it surpasses the peak growth rates of mobile phone adoption (2x/year from 1980-1987), solar energy capacity installation (1.5x/year from 2001-2010), and human genome sequencing (3.3x/year from 2008-2015).
Given this rapid pace, a critical question arises: Is it technically feasible for the current rate of AI training scaling-approximately 4x per year-to continue through 2030? We examine four key factors that could constrain this scaling:
Our analysis incorporates various public sources, including semiconductor foundries' planned expansions, electricity providers' capacity growth forecasts, and other relevant industry data. Here are some key findings:

Based on our analysis, it is likely that training runs of 2e29 FLOP will be feasible by the end of this decade. To put this in context, if pursued, we might see by the end of the decade advances in AI as drastic as the difference between the rudimentary text generation of GPT-2 in 2019 and the sophisticated problem-solving abilities of GPT-4 in 2023.
While the technical feasibility is promising, the economics of such a massive investment are another story. Training models at this scale will require hundreds of billions of dollars over the coming years. Whether AI developers will be willing to make these investments depends on their long-term strategic goals and financial capabilities.
The rapid scaling of AI training compute has been a driving force behind recent advancements in AI performance. While there are significant technical challenges, our analysis suggests that it is possible for this 4x per year growth to continue through 2030. The key will be addressing power availability, chip manufacturing capacity, data scarcity, and the latency wall. If these constraints can be managed, we may witness unprecedented advancements in AI capabilities by the end of the decade.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
3 September 2024
88 articles
Related Articles
Related Articles
More Stories