
Share
AI IQ scores language models using the human intelligence scale, igniting a debate between tech enthusiasts who see clarity and researchers who warn of potential misinterpretations.
For decades, the IQ test has been a contentious but widely recognized measure of human intelligence. Now, a new project called AI IQ is applying this familiar metric to artificial intelligence, assigning estimated intelligence quotients (IQs) to over 50 of the world's most powerful language models and plotting them on a standard bell curve.
The interactive visualizations at aiiq.org have garnered significant attention in the past week, with enterprise technologists praising their clarity and researchers criticizing the framework as misleading. This article dives into the technical details of AI IQ and explores why it's causing such a stir in the tech community.
AI IQ uses a combination of standardized tests and benchmarks to evaluate language models. The core methodology involves:
The project's creators argue that this approach provides a more intuitive understanding of AI capabilities. By placing AI models on the same scale as human intelligence, they aim to demystify and contextualize the performance of these systems.

However, critics point out several issues:
Despite the criticism, AI IQ has sparked important discussions about how we measure and understand AI capabilities. Here are a few key points to consider:
As the field of AI continues to evolve, projects like AI IQ will play a crucial role in shaping how we think about and measure machine intelligence. Whether you see it as a useful tool or a flawed metaphor, the conversation it has sparked is undoubtedly valuable.
Tags
Original Sources
AI IQ is here: a new site scores frontier AI models on the human IQ scale. The results are already dividing tech.
↗ https://venturebeat.com/technology/ai-iq-is-here-a-new-site-scores-frontier-ai-models-on-the-human-iq-scale-the-results-are-already-dividing-tech
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
14 May 2026
133 articles
Related Articles
Related Articles
More Stories