
Share
Hugging Face launches Open-R1 to replicate DeepSeek’s groundbreaking R1 AI model, pushing for transparency and open-source innovation in advanced reasoning technology.
Barely a week after DeepSeek released its R1 “reasoning” AI model, which sent markets into a frenzy, researchers at Hugging Face are diving headfirst into an ambitious project. They aim to replicate the model from scratch in what they’re calling a pursuit of "open knowledge." Leandro von Werra, Head of Research at Hugging Face, along with several company engineers, has launched Open-R1.
DeepSeek’s R1 model is notable for its advanced reasoning capabilities, which have been touted as a significant leap forward in AI. However, the model's proprietary nature and lack of transparency have raised concerns among the research community. Hugging Face's Open-R1 project aims to address these issues by building an open-source version that can be freely inspected, modified, and improved.
Data Collection:
Model Architecture:
Training Process:
Evaluation Metrics:

While the project is still in its early stages, preliminary results are promising. Hugging Face has shared some initial benchmarks:
Performance on Reasoning Tasks:
Resource Efficiency:
Hugging Face is actively encouraging the AI community to contribute to the project. They have set up a dedicated GitHub repository where researchers can submit pull requests, report issues, and collaborate on improvements.
Hugging Face’s Open-R1 project represents a significant step towards more transparent and collaborative AI research. By building an open-source version of DeepSeek’s R1 model, they aim to foster innovation, ensure ethical standards, and make advanced reasoning capabilities accessible to a broader audience.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
5 February 2025
88 articles
Related Articles
Related Articles
More Stories