
Share
Researchers at DeepMind unveil Gemma Scope, an open-source suite of sparse autoencoders designed to break down complex neural network representations into understandable features, democratizing access to these powerful tools.
In a significant step forward for unsupervised learning, researchers from DeepMind have introduced Gemma Scope, an open suite of sparse autoencoders (SAEs) trained on various layers and sub-layers of the Gemma 2 models. This work aims to democratize access to SAEs, which are powerful tools for decomposing neural network latent representations into interpretable features. Despite their potential, the high cost of training comprehensive suites of SAEs has limited their use outside of industry settings.
Gemma Scope is a collection of JumpReLU SAEs trained on multiple Gemma 2 models, including:
The key contributions of this work are:
This initiative can significantly reduce the barrier to entry for safety and interpretability research in neural networks. By providing pre-trained SAEs, researchers and practitioners can focus on higher-level tasks without the need to invest significant resources in training these models from scratch.
Sparse Autoencoders (SAEs):
Model Architectures:

Training Process:
By releasing these SAE weights, the Gemma Scope project aims to:
The interactive demo and tutorial further enhance accessibility, allowing practitioners to explore the capabilities of SAEs without deep expertise in the underlying algorithms.
Gemma Scope represents a significant advancement in the field of unsupervised learning, particularly for sparse autoencoders. By providing open access to pre-trained SAEs on various Gemma 2 models, this project can accelerate research and development in safety and interpretability, making it easier for the broader community to leverage these powerful tools.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
13 August 2024
88 articles
Related Articles
Related Articles
More Stories