
Share
Stein unveils details on how Google's AI Mode uses sophisticated query fan-out techniques, expanding initial queries into multiple related searches to enhance user insights and interaction quality.
Google’s VP of Product for Search, Robby Stein, recently provided new insights into the mechanics of query fan-out in AI Mode during an interview. While the existence of this technique has been previously mentioned in Google’s blog posts, Stein's comments offer a deeper dive into how it operates and its practical implications.
When you enter a question in Google’s AI Mode, the system leverages a large language model (LLM) to interpret your query. This interpretation isn’t just a straightforward search; instead, the LLM "fans out" multiple related searches to Google’s infrastructure. These searches can include topics that you didn't explicitly mention.
For example, if you ask, “Things to do in Nashville with a group,” the system might generate additional queries like:
Stein explained:
"It may think of a bunch of questions like great restaurants, great bars, things to do if you have kids, and it’ll start Googling basically."
The system then combines the results from these multiple queries into a single, cohesive response that includes links. This functionality is active in AI Mode, Deep Search, and some AI Overview experiences.
AI-powered search features, including query fan-out, now serve approximately 1.5 billion users each month. These features support both text-based and multimodal input, making them versatile for a wide range of queries.
The underlying data sources are extensive:
Stein referred to Google Search as “the largest AI product in the world,” highlighting its massive scale and impact.

For more complex queries that require deeper reasoning, a feature called Deep Search may be triggered. This process can involve dozens or even hundreds of background queries and might take several minutes to complete.
Stein shared an example of using Deep Search to research home safes, which involved unfamiliar factors like fire resistance ratings and insurance implications:
"It spent, I don’t know, like a few minutes looking up information and it gave me this incredible response. Here are how the ratings would work and here are specific safes you can consider and here’s links and reviews to click on to dig deeper."
AI Mode also has access to internal Google tools, such as Google Finance and other structured data systems. For instance, if you’re comparing stocks, the system might use:
This integration allows for more comprehensive and accurate responses, leveraging Google’s extensive internal resources.
For developers and engineers working on search and AI systems, understanding query fan-out is crucial. Here are a few key takeaways:
By leveraging these techniques, Google is pushing the boundaries of what’s possible in search and AI, offering users more sophisticated and useful experiences.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
31 July 2025
88 articles
Related Articles
Related Articles
More Stories