
Share
Cerebras Systems has shattered the barriers of AI hardware limitations by successfully training a 1 trillion parameter model on its CS-3 system at NeurIPS 2024, reducing infrastructure needs from thousands to just one powerful unit.
SUNNYVALE, CA AND VANCOUVER, December 10, 2024 – At NeurIPS 2024, Cerebras Systems, the leader in AI acceleration, announced a significant breakthrough: training a 1 trillion parameter model on a single CS-3 system. This achievement, in collaboration with Sandia National Laboratories, marks a major milestone in the development of large language models (LLMs), which typically require thousands of GPUs and extensive infrastructure.
Traditionally, training a trillion-parameter model is an enormous undertaking:
Cerebras' Wafer Scale Cluster technology has changed this paradigm by enabling the same task on a single CS-3 system. Here’s how they did it:

For practitioners, this means:
Siva Rajamanickam, a researcher at Sandia National Laboratories, commented on the achievement: "Traditionally, training a model of this scale would require thousands of GPUs, significant infrastructure complexity, and a team of AI infrastructure experts. With the Cerebras CS-3, the team was able to achieve this feat on a single system with no changes to model or infrastructure code."
Cerebras' achievement at NeurIPS 2024 demonstrates the power and flexibility of their Wafer Scale Cluster technology. By simplifying the training process for trillion-parameter models, Cerebras is setting a new standard in AI research and development.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
1 January 2025
88 articles
Related Articles
Related Articles
More Stories