
Share
Researchers at Anthropic found that increasing a language model's reasoning time can paradoxically worsen its performance, challenging the assumption that more computational power always improves AI outcomes.
In a surprising twist for the AI industry, researchers at Anthropic have discovered that extending the reasoning time of large language models (LLMs) can actually degrade their performance. This phenomenon, dubbed "inverse scaling in test-time compute," challenges the common belief that more computational resources always lead to better outcomes. The findings, detailed in a new paper by Anthropic AI safety fellow Aryo Pradipta Gema and colleagues, could have significant implications for enterprises deploying AI systems that rely on extended reasoning.
The research team, including Anthropic's Ethan Perez, Yanda Chen, and Joe Benton, along with academic collaborators, conducted a series of experiments to explore the effects of extended reasoning on LLM performance. Here are some key points:

The discovery of inverse scaling in test-time compute has several implications for AI practitioners:
The findings from Anthropic's research challenge a fundamental assumption in the AI industry and highlight the importance of careful model design and resource allocation. As AI continues to evolve, understanding these nuances will be crucial for developing robust and reliable systems.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
23 July 2025
88 articles
Related Articles
Related Articles
More Stories