
Share
In a groundbreaking experiment, OpenAI's team used Codex AI to build an extensive million-line codebase without manual coding, slashing development time and sparking new possibilities for agent-driven software creation.
By Ryan Lopopolo, Member of the Technical Staff
Over the past five months, our team at OpenAI has been running an ambitious experiment: building and shipping an internal beta of a software product without writing a single line of code manually. This means every bit of application logic, tests, CI configuration, documentation, observability, and internal tooling was generated by Codex. The result? We estimate we built this in about 1/10th the time it would have taken to write the code by hand.
Our primary goal was to increase engineering velocity by orders of magnitude. To achieve this, we had to rethink the role of engineers. Instead of writing code directly, our team focused on designing environments, specifying intent, and building feedback loops that allow Codex agents to do reliable work. This shift in focus allowed us to ship a product with a million lines of code in just weeks.
The first commit to an empty repository landed in late August 2025. The initial scaffold-repository structure, CI configuration, formatting rules, package manager setup, and application framework-was generated by Codex CLI using GPT-5. Even the AGENTS.md file that directs agents on how to work in the repository was written by Codex.
There was no pre-existing human-written code to anchor the system. From the beginning, the repository was shaped by the agent.
Five months later, the repository contains on the order of a million lines of code across application logic, infrastructure, tooling, documentation, and internal developer utilities. Over this period, roughly 1,500 pull requests (PRs) have been opened and merged with a small team of just three engineers driving Codex. This translates to an average throughput of 3.5 PRs per engineer per day. Surprisingly, the throughput has increased as the team has grown to now seven engineers.

Importantly, this wasn’t output for output’s sake: the product has been used by hundreds of users internally, including daily internal power users.
The lack of hands-on human coding introduced a different kind of engineering work, focused on systems and scaffolding. Here are some key takeaways:
This experiment has shown that an agent-first approach can significantly accelerate software development while maintaining or even improving code quality. By redefining the role of engineers to focus on systems and intent, we can leverage the power of AI to build complex products more efficiently.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
12 February 2026
88 articles
Related Articles
Related Articles
More Stories