
Share
AlphaEvolve leverages large language models to tackle complex combinatorial optimization problems, pushing the boundaries of theoretical computer science and potentially revolutionizing mathematical discovery.
September 30, 2025
By Ansh Nagda, Student Researcher, and Abhradeep Thakurta, Staff Research Scientist, Google DeepMind, and Prabhakar Raghavan, Chief Technologist, Google
Large language models (LLMs) have recently made significant strides in competitive mathematics and programming. However, their impact on mathematical discovery-proving new theorems or uncovering novel combinatorial structures-has been limited. This is because these fields require absolute correctness, which can be challenging for AI to achieve without human oversight.
In our recent paper, we introduce AlphaEvolve, an LLM-based coding agent designed to find and verify combinatorial structures that enhance the hardness of approximately solving certain optimization problems. This work marks a significant step forward in using AI as a research partner in theoretical computer science.
LLM Integration: AlphaEvolve leverages the capabilities of large language models (LLMs) to generate and refine combinatorial structures.
Reinforcement Learning (RL): We employ RL to optimize the generation process.

Enhanced Problem Solving: AlphaEvolve can generate combinatorial structures that improve the hardness of approximation algorithms. This is crucial for understanding the limits of computational efficiency in solving complex problems.
Collaborative Research: The model can work alongside human researchers, providing insights and solutions that might be overlooked by humans alone.
Generation Process:
Performance Benchmarks:
AlphaEvolve represents a significant advancement in using AI for theoretical computer science. By combining the power of LLMs with reinforcement learning, it opens new avenues for discovering and verifying combinatorial structures that enhance our understanding of computational complexity. This collaborative approach between AI and human researchers has the potential to drive rapid progress in this field.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
3 October 2025
88 articles
Related Articles
Related Articles
More Stories