
Share
As traditional data sources become less effective, AI is pivoting towards reinforcement learning, where machines learn from real-world interactions. This paradigm shift promises transformative opportunities for businesses that adapt quickly.
The rapid evolution of artificial intelligence (AI) has ushered in a new era where models are increasingly trained through interaction with environments rather than static data sets. This shift, driven by the diminishing returns of easily scrapable text data, is poised to transform AI infrastructure and create significant opportunities for businesses that can harness reinforcement learning (RL). In this article, we explore the technical underpinnings and economic implications of this transition.
In their seminal essay "Welcome to the Era of Experience," Rich Sutton and David Silver proposed a paradigm shift in AI training. Instead of relying on large datasets like Common Crawl, models will learn through interaction with environments, gaining experience in real-time. This approach is particularly relevant as we approach the exhaustion of easily scrapable text data, which has been a cornerstone for training large language models (LLMs).

The shift towards RL training has far-reaching implications for both product companies and infrastructure players. Product companies can build moats through custom models that are finely tuned to their specific needs, while infrastructure providers will play a crucial role in enabling this transition by developing tools and platforms that support efficient RL training.
However, there are still open challenges and fundamental limitations that must be addressed. The risk of RFT becoming another "flop" similar to the first wave of supervised fine-tuning (SFT) is real. To avoid this, continuous innovation and a deep understanding of the technical landscape will be essential.
The era of experience marks a significant turning point in AI infrastructure. By leveraging reinforcement learning, businesses can create more intelligent and adaptable models that are better suited to their specific environments. While there are risks and challenges, the potential rewards are substantial, making this an exciting time for both product companies and infrastructure providers.
Tags
Original Sources
About the author
Marcus began tracking AI's market implications in 2016, noticing AI-related patent filings accelerating ahead of earnings upgrades before most of the sell-side had caught on. A former fixed-income quantitative analyst, he spent two decades building models that priced risk across emerging markets before pivoting to cover the economic impact of AI full-time. His writing translates opaque technical developments into clear risk/reward terms — and he's rarely diplomatic about the gap between AI valuations and underlying fundamentals. He believes most market participants still underestimate AI's long-run deflationary effect on knowledge work.
More from The Analyst →This Week's Edition
27 November 2025
88 articles
Related Articles
Related Articles
More Stories