
Share
Researchers at HKUST found that large language models with better text compression skills also perform smarter in general tasks, suggesting that compression efficiency could be a key metric for AI intelligence.
In a recent study titled "Compression Represents Intelligence Linearly," researchers from Hong Kong University of Science and Technology (HKUST) have provided empirical evidence that the ability of large language models (LLMs) to compress external text corpora correlates almost linearly with their intelligence. This work, published in COLM 2024, sheds light on why advanced compression capabilities are a strong indicator of a model's performance across various benchmarks.
The study challenges the common belief that intelligence is an abstract, hard-to-quantify concept by showing a clear, measurable relationship between compression efficiency and LLM performance. This has significant implications for practitioners:
The study involved 31 public LLMs from diverse organizations across 12 benchmarks. Here are the main findings:
To establish this relationship, the researchers:

The study's methodology is transparent and reproducible:
For developers and researchers working with LLMs, this study offers several practical insights:
The findings from "Compression Represents Intelligence Linearly" provide concrete evidence that the ability of LLMs to compress external text corpora is a strong indicator of their intelligence. This opens up new possibilities for unsupervised model evaluation and suggests that focusing on compression could be a key strategy for developing more advanced language models.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
17 April 2024
133 articles
Related Articles
Related Articles
More Stories