
Share
AI is revolutionizing mathematical research by tackling complex problems and formalizing proofs, challenging mathematicians to redefine their roles and explore novel collaborative approaches with machines.
In recent years, artificial intelligence (AI) has begun to reshape the landscape of mathematical research. While mathematics has traditionally been a field driven by human creativity and intuition, AI tools are now taking on tasks that once required significant time and effort from mathematicians. This shift not only raises questions about the future role of humans in the field but also highlights new opportunities for collaboration and discovery.
AI has already demonstrated its prowess in mathematical problem-solving, most notably through its performance at the International Mathematical Olympiad (IMO). This summer, advanced AI chatbots such as Harmonic’s Aristotle, OpenAI’s undisclosed GPT model, and Google DeepMind’s Gemini achieved gold medal-level performances at the IMO. Winning an IMO medal is a significant achievement, indicating exceptional problem-solving skills. These results challenge the notion that the highest levels of mathematical problem-solving are exclusive to humans.
However, it's important to note that contest problems differ from research mathematics, where innovation and new ideas are crucial. While AI can excel in structured, well-defined tasks like those found in competitions, the creative and exploratory nature of research remains a domain where human mathematicians still hold an edge. This distinction invites reflection on how AI can complement rather than replace human mathematicians.
Beyond problem-solving, AI has also made significant contributions to mathematical formalization. Formalization involves translating mathematical definitions, statements, and proofs into a precise, machine-readable language. This process transforms the art of proof into a rigorously verifiable algorithm, ensuring that every step in a proof is logically sound.
A leading example of this technology is Lean, an open-source proof assistant developed by Leonardo de Moura at Microsoft. Supported by a global community, Lean features the extensive open-source mathlib library, which aims to build a comprehensive mathematical repository covering everything from basic algebra to advanced research. Theorems in mathlib are verified computationally from first principles, providing a level of rigor and reliability that is difficult to achieve with traditional methods.

The interaction between human mathematical reasoning and machine assistance is evolving. This process begins with informal mathematics, where ideas are explored using natural language and intuition. As these ideas mature, they can be formalized into precise statements and proofs, which are then verified using proof assistants like Lean. This transition from informal to formal mathematics allows for deeper exploration and verification of complex theories.
As AI continues to evolve, it is crucial to prepare the next generation of mathematicians to thrive in this new landscape. Training should focus on developing skills that complement AI's capabilities, such as deep conceptual understanding, creative problem-solving, and the ability to interpret and extend machine-generated results. Mathematicians will need to be adept at using formalization tools and proof assistants, as these technologies become increasingly integral to the research process.
The integration of AI into mathematical research is a double-edged sword. While it raises concerns about the potential replacement of human mathematicians, it also offers exciting opportunities for collaboration and discovery. By embracing AI tools and focusing on areas where human creativity excels, mathematicians can continue to push the boundaries of the field.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
8 October 2025
88 articles
Related Articles
Related Articles
More Stories