Sapient Trains a Foundation Model for $1,500 Using Innovative HRM Architecture

Models & Research

The Engineer

15 Jun 2026 · 4 min read

Researchers at Sapient have developed a novel approach to training foundation models using a Hierarchical Recurrent Model (HRM), significantly reducing costs and data requirements.

When it comes to training large language models (LLMs), the cost and resource demands are often prohibitive. Most organizations opt for fine-tuning pre-existing models rather than building from scratch, due to the astronomical expenses involved. However, researchers at Sapient have introduced a groundbreaking solution with HRM-Text, a model that promises to change this paradigm.

HRM-Text leverages a Hierarchical Recurrent Model (HRM) architecture, which is highly sample-efficient and significantly reduces the computational burden compared to traditional Transformers. This innovation not only cuts costs but also aligns more closely with real-world enterprise needs, where targeted responses are crucial.

A New Approach to Training

The key to HRM-Text's efficiency lies in its unique architecture. Unlike standard Transformers that rely on brute-force autoregressive prediction of raw text, HRM decouples computation into two layers: a slow-evolving strategic layer and a fast-evolving execution layer. This design allows the model to focus on higher-level reasoning and task-specific instructions rather than memorizing vast amounts of data.

Strategic Layer: Handles high-level decision-making and context understanding.
Execution Layer: Manages detailed, step-by-step operations required to execute tasks.

By training exclusively on instruction-response pairs, HRM-Text can develop a deep understanding of human language and reasoning with far fewer tokens. This approach is particularly beneficial in enterprise settings where users expect precise, task-oriented responses.

The researchers were able to train a 1B-parameter HRM-Text model from scratch for around $1,500, using significantly fewer training tokens compared to conventional LLMs. Despite the reduced cost and data requirements, HRM-Text achieved competitive performance on key industry benchmarks, demonstrating its potential as a viable alternative to larger, more resource-intensive models.

In Practice

For real-world AI applications, this breakthrough means that foundational pretraining is no longer limited to well-resourced institutions. Organizations can now affordably train their own highly capable reasoning models from scratch and integrate them with external knowledge stores. This shift has several practical implications:

Cost Efficiency: Dramatically reduced training costs make it feasible for smaller organizations to develop custom AI solutions.
Customization: Models can be tailored to specific industry needs, improving relevance and performance.
Faster Iteration: The economics of iteration are improved, allowing for quicker experimentation and deployment cycles.

Guan Wang, CEO of Sapient Intelligence, emphasized the importance of this shift in a statement provided to VentureBeat. "Enterprises today face three compounding problems: training is expensive, infrastructure is heavy, and experimentation cycles are too slow," Wang said. "HRM-Text addresses these issues by providing a cost-effective, lightweight, and agile solution."

The current approach to LLM training involves scraping the internet for massive datasets and running next-token prediction trillions of times. This brute-force method not only incurs high costs but also forces models to memorize vast amounts of irrelevant data just to indirectly learn how to think. Standard decoder-only models, for instance, waste valuable compute on reconstructing prompts that are already known at inference time.

HRM-Text's focus on instruction-response pairs aligns more closely with the actual needs of enterprise users. Instead of memorizing the exact sequence of words from random internet content, the model develops a deep understanding of human language and reasoning through targeted training. This approach ensures that the model is better equipped to handle real-world tasks and provide accurate, context-aware responses.

Key Takeaways

Cost Reduction: HRM-Text can be trained for around $1,500, making it an affordable option for smaller organizations.
Efficiency: The model uses fewer training tokens and computational resources while achieving competitive performance on industry benchmarks.
Practical Benefits: Organizations can develop custom AI solutions that are tailored to specific needs, improving relevance and performance.

The development of HRM-Text by Sapient represents a significant step forward in making foundation models more accessible and practical for a wider range of applications. As the industry continues to explore innovative approaches to AI training, this breakthrough could pave the way for more efficient and cost-effective solutions in the future.