
Share
Nvidia reveals its ambitious Vera Rubin architecture at OCP Global Summit, showcasing liquid-cooled servers designed to handle gigawatt-scale AI factories with unrivaled efficiency and scalability.
Nvidia made a significant splash at the 2025 OCP Global Summit in San Jose, unveiling its vision for "gigawatt AI factories" based on the new Vera Rubin architecture. These data centers are designed to support the next generation of massive AI models with unprecedented efficiency and scalability.
For practitioners, this new architecture means more efficient data centers that can handle the growing demands of AI models without overheating or breaking the bank on cooling costs. The modular design also offers flexibility in expanding infrastructure, which is essential for keeping up with rapid advancements in AI technology.

Nvidia has donated the Vera Rubin NVL144 architecture to the Open Compute Project (OCP) as an open standard. This move encourages widespread adoption and innovation, allowing any company to implement this design in their data centers. It also aligns with the growing trend of open-source hardware in the tech industry.
While specific benchmarks for the Vera Rubin NVL144 are not yet available, the combination of liquid cooling, modular design, and support for next-generation GPUs suggests significant performance improvements. The ability to connect 576 GPUs in a single rack will likely set new standards for AI inference and training workloads.
Nvidia's introduction of the Vera Rubin NVL144 architecture marks a significant step forward in the development of efficient, scalable AI data centers. By donating this design to the OCP and garnering support from major players like Meta and Oracle, Nvidia is positioning itself at the forefront of the AI hardware revolution.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
15 October 2025
133 articles
Related Articles
Related Articles
More Stories