
Share
As ChatGPT Images surged past 100 million users in a week, OpenAI's engineering team navigated unprecedented scalability challenges, revealing the behind-the-scenes strategies that kept the service running smoothly.
ChatGPT Images, the latest feature from OpenAI, has been a massive hit, attracting 100 million new users and generating 700 million images in its first week. This unprecedented growth posed significant scalability challenges for the engineering team. I sat down with Sulman Choudhry (Head of Engineering, ChatGPT) and Srinivas Narayanan (VP of Engineering, OpenAI) to understand how they managed this launch.
From day one, the load on ChatGPT Images was far higher than anticipated. The feature went viral in India, with up to 1 million new users signing up per hour at peak times. Despite these challenges, the team avoided major outages by implementing robust load testing and isolation strategies.
The technical architecture behind ChatGPT Images is sophisticated and involves several key components:
When the system started struggling under the rising load, the team had to make significant changes on-the-fly. They rewrote the image generation process from synchronous to asynchronous, ensuring users didn't notice any disruptions:

The massive load on ChatGPT Images overwhelmed other OpenAI systems, but major outages were avoided through extensive preparation:
The team faced several additional challenges, including:
A year ago, ChatGPT's primary bottleneck was GPU availability. However, with this bottleneck addressed, new constraints emerged:
The launch of ChatGPT Images was a significant achievement for OpenAI, demonstrating the team's ability to handle massive scalability challenges. By implementing robust load testing, isolating systems, and making on-the-fly changes, they successfully managed to serve 100 million new users without major outages.
Tags
Original Sources
↗ https://newsletter.pragmaticengineer.com/p/chatgpt-images?utm_source=tldrai
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
16 May 2025
88 articles
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
More Stories