
Share
Sakana AI's LLM² technique uses large language models to uncover new preference optimization algorithms, introducing DiscoPOP to better align AI with human preferences in a groundbreaking self-referential approach.
At Sakana AI, we're exploring a fascinating self-referential approach called LLM² (‘LLM-squared’) to enhance the training of Large Language Models (LLMs). This method leverages LLMs themselves to discover new algorithms for preference optimization, a critical component in aligning LLMs with human preferences. Our recent report, "Discovering Preference Optimization Algorithms with and for Large Language Models," details this innovative process and introduces a state-of-the-art loss function called DiscoPOP.
Historically, the development of deep learning models has relied heavily on trial-and-error by researchers and theoretical insights. This is particularly true for preference optimization algorithms, which are essential for ensuring that LLMs align with human values. Meanwhile, LLMs have become increasingly sophisticated, capable of generating hypotheses and writing code. This raises an intriguing question: can we use AI to automate the process of AI research and discovery?
Earlier this year, Sakana AI started using evolutionary algorithms to improve the training of foundation models like LLMs. These algorithms are inspired by natural selection and are used to iteratively refine solutions through processes of mutation, crossover, and selection. In a recent paper, we demonstrated that LLMs can act as better evolutionary algorithms themselves.
Given these promising results, we embarked on a project to use LLMs to discover new algorithms for training LLMs. We call this process LLM² (‘LLM-squared’), drawing inspiration from meta-learning techniques. Here’s how it works:
One of the key outcomes of this process is the discovery of a new loss function called Discovered Preference Optimization (DiscoPOP). Here’s what makes DiscoPOP stand out:

To give you a deeper understanding, here are some technical details:
Pipeline Architecture:
Benchmarks:
We are committed to transparency and collaboration. Therefore, we open-source the following:
We are also proud to have collaborated with the University of Oxford and Cambridge University on this project.
The potential of our method is vast. By automating the discovery of new optimization algorithms, we can reduce the need for extensive computational resources and explore a wider search space of optimal loss functions. This not only enhances the capabilities of LLMs in various applications but also paves the way for more efficient and effective AI research.
Ultimately, we envision a future where LLM² becomes a standard tool in the AI researcher's toolkit, enabling faster and more innovative developments in the field.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
14 June 2024
88 articles
Related Articles
Related Articles
More Stories