
Share
Tired of predictable AI? Meet Flint, a new language model that bucks the trend of groupthink and offers fresh, varied responses to open-ended queries.
Let’s start with a fun experiment. Open up your favorite chatbot-Claude, ChatGPT, or Gemini-and ask it to give you a random number between 1 and 10. Chances are, you’ll get 7 more often than not. If you keep asking for another number, you might see 3 or 4 pop up frequently. This isn’t magic; it’s a symptom of a deeper issue: groupthink in large language models (LLMs).
Most LLMs are incredibly powerful and useful for tasks like coding and research, but they fall short when it comes to creativity and diversity in responses. This predictability is a significant drawback, especially for applications that require originality or broad perspectives, such as brainstorming sessions or content creation.
Enter Springboards, an Australian startup with a novel approach. They’ve developed an LLM called Flint, which is designed to produce a wider range of responses to open-ended questions. The goal? To break the cycle of groupthink and provide users with more diverse and creative outputs.
Pip Bingemann, co-founder and CEO of Springboards, illustrates this point through a simple demonstration: the random number game. When he asked ChatGPT, Claude, and Flint for a random number between 1 and 10, the first two models predictably returned 7. But Flint? It gave a more varied response-3.7916.
Bingemann explains, “Most language models are fighting hallucinations, but we welcome them.” This approach is rooted in the belief that embracing a broader spectrum of potential outputs can lead to more innovative and useful results.
To understand how Flint achieves this diversity, let’s dive into some technical details:

Training Data: Flint is trained on a diverse set of data sources, including unconventional datasets that are less likely to be overrepresented in mainstream models. This helps reduce bias and increases the variety of responses.
Model Architecture: Flint uses a hybrid architecture that combines transformer-based models with reinforcement learning techniques. The reinforcement learning component is specifically tuned to reward diversity in outputs, ensuring that the model doesn’t fall into predictable patterns.
Output Generation: During inference, Flint employs a dynamic sampling strategy that considers multiple potential responses and selects the one that offers the most novelty while still being contextually appropriate. This approach helps avoid the common pitfall of LLMs converging on a single, highly probable response.
The impact of Flint’s diversity is evident in real-world applications. For example, when asked to name a type of car, mainstream models like ChatGPT and Claude often default to Toyota or Honda. Flint, however, came up with a Ford F-150-a response that is equally valid but less common.
Another test involved generating taglines for a New Balance running shoe campaign. Both Claude and ChatGPT produced the same generic line: “Run your way.” Flint, on the other hand, offered a more unique and specific tagline: “Built to last, run to win.”
As the field of AI continues to evolve, models like Flint highlight the importance of addressing groupthink and promoting diversity in machine learning. By doing so, we can unlock new possibilities and drive innovation in how we interact with and use AI.
Tags
Original Sources
LLMs are stuck in a groupthink groove. This startup is trying to get them out.
↗ https://www.technologyreview.com/2026/07/01/1140003/llms-are-stuck-in-a-groupthink-rut-this-startup-is-trying-to-get-them-out
Claude Science is Anthropic's newest flagship product
↗ https://www.technologyreview.com/2026/06/30/1139987/claude-science-is-anthropics-newest-flagship-product
Claude Science is Anthropic's newest flagship product
↗ https://www.technologyreview.com/2026/06/30/1139987/claude-science-is-anthropics-newest-flagship-product/amp
The Download: Anthropic launches Claude Science, and ...
↗ https://www.technologyreview.com/2026/07/01/1139996/the-download-anthropic-claude-science-california-carbon-manure
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
6 July 2026
68 articles
Related Articles
Related Articles
More Stories
© 2026 Cedar & Bloom. All rights reserved.