
Share
Meet SWE-grep and SWE-grep-mini, lightning-fast coding assistants that offer the intelligence of top-tier models without the lag, ensuring your development flow stays uninterrupted.
Modern coding agents often face a tradeoff between speed and intelligence. While advanced models can handle complex tasks, they can take minutes to retrieve the necessary context, disrupting your workflow. To address this, we’ve introduced SWE-grep and SWE-grep-mini, two fast agentic models that match the retrieval capabilities of state-of-the-art coding models but operate an order of magnitude faster. These models are now available in Windsurf’s new Fast Context subagent.
In tools like Windsurf and Devin, we observed that agents often spend over 60% of their initial turn just retrieving context. This is a significant bottleneck. Historically, context retrieval has been approached in two main ways:
Embedding Search (RAG): Once the codebase is indexed, queries are fast. However:
Agentic Search: The model uses CLI tools to explore the codebase, similar to a human. While flexible, it has drawbacks:
To overcome these challenges, we trained SWE-grep and SWE-grep-mini, models that combine the flexibility of agentic search with the speed of embedding search. Here’s what makes them stand out:

Both models use reinforcement learning (RL) to improve their context retrieval capabilities. They learn from user interactions and feedback, continuously refining their search strategies.
We are rolling out the Fast Context subagent to Windsurf users progressively. Here’s how you can try it:
Cmd+Enter.You can try Fast Context in our playground at https://playground.cognition.ai/ to see how it reduces the time it takes to understand large codebases.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
17 October 2025
88 articles
Related Articles
Related Articles
More Stories