
Share
Researchers propose a novel method called "generalized hill-climbing" that aims to create versatile AI models with fewer resources, offering hope for smaller labs in the competitive field of machine learning.
In the rapidly evolving landscape of AI, most labs are leveraging a combination of pre-training and reinforcement learning (RL) to develop more generalized models. These models aren't just proficient in one task but can excel in multiple domains and adapt to new challenges. However, this approach requires substantial computational resources, making it accessible primarily to large research institutions.
For those without access to extensive GPU farms and deep pockets, there's an alternative route to achieving generalized models: generalized hill-climbing. This concept has been simmering in my mind since 2023 or 2024, but it was a tweet by Andrej Karpathy that crystallized it for me:
Software 1.0 easily automates what you can specify. Software 2.0 easily automates what you can verify., Andrej Karpathy
This insight sparked a profound question: How can we make everything verifiable?
The concept of an "Ideal State" has been a recurring theme in my work, but the focus on verifiability brought it into sharp relief. The core idea is simple: to achieve any goal, you need a clear target. Whether you call them ladder rungs, footholds, or stepping stones, there must be something to aim for.
The challenge lies in clearly defining this Ideal State, especially across various task types. This is where the importance of precise articulation comes into play. In 2024, I wrote "AI is Mostly Prompting," emphasizing that nothing compares to a well-articulated intent. Similarly, my 2025 piece "Coding is Thinking" discussed how writing and coding are fundamentally about expressing ideas clearly.

Most recently, in "How to Talk to AI" (June 2025), I argued that if you can't articulate what you want, no amount of prompting or context will help. The Ideal State concept embodies this principle perfectly.
To implement generalized hill-climbing at runtime, the following steps are crucial:
Generalized hill-climbing at runtime offers a promising alternative to traditional approaches in AI model development. By focusing on clear articulation of the Ideal State and iterative refinement, we can create models that are both flexible and verifiable. This approach democratizes access to advanced AI capabilities, making it possible for smaller teams and organizations to achieve significant results.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
16 February 2026
88 articles
Related Articles
Related Articles
More Stories