
Share
As advancements in computing power accelerate, the predicted timeline for achieving AGI has dramatically shortened, with a significant increase in the likelihood of reaching this milestone within the next decade.
In August 2020, I wrote a post about my predictions for the timeline to achieve Artificial General Intelligence (AGI). My definition of AGI is an AI system that matches or exceeds humans at almost all (95%+) economically valuable work. To clarify, this doesn’t necessarily mean AIs need to do 100% of the work of 95% of people; if they did 95% of the work for everyone, it would also count.
Back then, my forecast was:
Fast forward to 2024, and here’s how those numbers have shifted:
To understand why these timelines have compressed, let's dive into the role of compute and how it has influenced my outlook.
When I last seriously considered the path to AGI, I identified two broad hypotheses:

In my 2020 post, I grappled with the question of how much AI capabilities are driven by better hardware versus new machine learning (ML) algorithms. My simplified estimate at the time was that 50% of AGI progress would come from compute, and 50% from better algorithms. However, as more models were scaled up, I revised this to 65% compute and 35% algorithms.
The shift in my timelines is largely due to the growing evidence supporting Hypothesis 1. In 2020, I noted that many human-like learning behaviors could be emergent properties of larger models. Since then, this view has become more mainstream. Here’s a breakdown of what changed:
Given these developments, I now believe that AGI could be closer than previously anticipated. However, it’s important to note that while compute scaling has been a significant driver, it is not the only factor. New algorithms and architectural innovations will still play a crucial role in achieving AGI.
The path to AGI is complex and multifaceted, but the evidence suggests that we are making faster progress than many anticipated. The role of compute in driving this progress cannot be overstated. As we continue to push the boundaries of what’s possible with larger models, it’s exciting to consider what the next few years might bring.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
23 January 2024
88 articles
Related Articles
Related Articles
More Stories