
Share
LLMs faced with everyday decisions reveal unexpected biases and limitations, challenging assumptions about their practical intelligence and ability to handle real-world scenarios.
In a recent post on Mastodon, user @knowmadd shared an intriguing experiment involving language models (LLMs) and their responses to a seemingly straightforward question. The query was simple: "I want to wash my car. The car wash is 50 meters away. Should I walk or drive?" This scenario not only highlights the nuances in how LLMs interpret and respond to user inputs but also sheds light on the underlying constraints and quirks of these models.
The core technical change here isn't a new model or algorithm; it's an observation about the behavior of existing large language models. Specifically, @knowmadd noticed that different LLMs provided varying responses to the same prompt, which can be attributed to their training data and reward mechanisms.
For practitioners, this experiment underscores the importance of understanding the limitations and quirks of LLMs. Here are a few key takeaways:
@knowmadd tested this scenario with multiple LLMs, including Deepseek and Qwen. Here’s a breakdown of the outputs:

Training Data Diversity:
Reward Functions:
Contextual Understanding:
For developers and researchers working with LLMs, these findings suggest several best practices:
While LLMs have made significant strides in natural language processing, this experiment by @knowmadd reminds us that there is still much to learn about their behavior and limitations. By understanding these nuances, practitioners can better leverage these powerful tools while mitigating potential issues.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
17 February 2026
88 articles
Related Articles
Related Articles
More Stories