
Share
This article delves into why large language models fall short despite their sophistication, focusing on the missing human faculty of continual learning and its potential to unlock spontaneous insights.
Despite their impressive capabilities, large language models (LLMs) have yet to produce a genuine breakthrough. The puzzle is why. One possible reason is that these models lack some fundamental aspects of human thought. They are frozen, unable to learn from experience, and they have no “default mode” for background processing-a source of spontaneous human insight.
Human brains are continually learning and adapting. This ability, known as continual learning, allows us to integrate new information with existing knowledge without forgetting past lessons. In contrast, LLMs typically require retraining on the entire dataset to incorporate new data, which is both time-consuming and resource-intensive.
Beyond continual learning, humans also engage in continual thinking-a background process where the brain generates novel ideas and connections. This “default mode” network (DMN) is active even when we are not focused on a specific task. It's responsible for mind-wandering, daydreaming, and spontaneous insights.
To address these shortcomings, Gwern proposes the Day-Dreaming Loop (DDL)-a background process that continuously samples pairs of concepts from memory. Here’s a breakdown:

The DDL works as follows:
Implementing the DDL presents several challenges:
The strategic implications of the DDL are counterintuitive:
The DDL hypothesis suggests a future where AI systems are not just reactive but proactive in generating new ideas. By mimicking human cognitive processes, these models could unlock new levels of creativity and innovation. However, realizing this vision requires overcoming significant technical hurdles and rethinking how we design and deploy AI systems.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
15 July 2025
88 articles
Related Articles
Related Articles
More Stories