
Share
Discover how adapting your prompts to new language models can unlock their full potential and avoid common performance traps discussed with real-world examples.
When a new language model (LLM) is released, it's tempting to plug in your existing prompts and expect immediate improvements. However, this often leads to disappointing results. The key to maximizing the potential of newer models lies in rewriting your prompts. This article explores why prompt optimization is crucial and provides specific reasons backed by practical examples.
One of the most straightforward differences between models is their handling of different data formats, such as Markdown vs. XML.
OpenAI Models and Markdown: Older OpenAI models were particularly adept at processing Markdown. This makes sense given the prevalence of Markdown on the internet and its simplicity in terms of token usage.
Claude 3.5 and XML: When Anthropic released Claude 3.5, they introduced an XML-based system prompt. According to Zack Witten, an Anthropic employee, this decision was driven by the fact that Claude's training data included a lot of XML content:
While OpenAI hasn't explicitly stated why they favor Markdown, their system prompts and tutorials consistently use this format. This suggests that sticking to Markdown for OpenAI models is likely to yield the best results.

Position bias refers to how different parts of a prompt are weighted by the model. Some models give more importance to the beginning of the prompt, while others prioritize the end. Understanding and leveraging this can significantly improve performance.
Each LLM has its own set of biases that can either enhance or hinder performance. Working with these biases is crucial for optimizing prompts.
Understanding Model Biases: For instance, if a model is biased towards certain types of input data (e.g., XML vs. Markdown), aligning your prompt format with these biases can lead to better results.
Practical Example: GPT-5 in Cursor: When GPT-5 was initially released in Cursor, users were disappointed with its performance. However, OpenAI and Cursor later identified and addressed specific issues, as detailed in the OpenAI gpt-5 cookbook. This highlights the importance of understanding and adapting to model-specific biases.
When a new LLM is released, don't just assume that your existing prompts will work as well. Take the time to rewrite and optimize them based on the model's specific requirements and biases. By doing so, you can avoid performance pitfalls and fully leverage the capabilities of the latest models.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
15 September 2025
88 articles
Related Articles
Related Articles
More Stories