
Share
This survey uncovers how leading AI researchers are pushing boundaries in LLM reasoning through innovative approaches like inference scaling and specialized training methods, moving beyond basic conversational capabilities.
In the rapidly evolving landscape of large language models (LLMs), reasoning has become a critical capability that sets advanced AI systems apart from conventional chatbots. A recent survey by researchers from Salesforce AI Research, Nanyang Technological University, I2R, A*STAR, Singapore, National University of Singapore, and The Hong Kong University of Science and Technology (Guangzhou) delves into the latest advancements in LLM reasoning. This article summarizes their findings, focusing on two key dimensions: regimes and architectures.
The survey categorizes methods based on when reasoning is achieved-either at inference time or through dedicated training.
Inference-Time Reasoning: Techniques that enhance reasoning capabilities during the model's inference phase. This includes:
Dedicated Training Reasoning: Methods that involve training the model specifically for reasoning tasks.
The architecture dimension differentiates between standalone LLMs and agentic compound systems that incorporate external tools or multi-agent collaborations.
Standalone LLMs: Single models that perform reasoning tasks independently.
Agentic Systems: Compound systems that integrate external tools or multiple agents.

The survey covers a range of learning algorithms used to train reasoning models:
The survey also highlights the development of domain-specific reasoning systems, which are tailored to specific industries or tasks. These systems often require specialized data and training methods to achieve high accuracy.
Despite significant progress, several challenges remain:
This survey provides a comprehensive overview of the current state and future directions in LLM reasoning. It serves as a valuable resource for researchers and practitioners looking to advance the capabilities of AI systems in logical inference, problem-solving, and decision-making.
Tags
Original Sources
↗ https://arxiv.org/pdf/2504.09037
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
2 September 2025
88 articles
Related Articles
Related Articles
More Stories