
Share
Researchers at Apple discovered that large language models naturally achieve semantic calibration, meaning they can assess the true meaning of their responses beyond just predicting the next word accurately.
Large Language Models (LLMs) have made significant strides in generating human-like text, but one of their persistent challenges is providing meaningful confidence estimates for their outputs. While base LLMs are known to exhibit next-token calibration, it's been unclear whether they can assess the actual meaning of their responses beyond the token level. A recent paper from Apple researchers Preetum Nakkiran, Arwen Bradley, Adam Goliński, Eugene Ndiaye, Michael Kirchhof, and Sinead Williamson sheds light on this issue.

This research provides a principled explanation for the emergence of semantic calibration in LLMs and highlights the importance of maintaining this property during model training and fine-tuning. As LLMs continue to be integrated into various real-world applications, understanding and preserving semantic calibration will be crucial for ensuring their reliability and trustworthiness.
Tags
Original Sources
↗ https://machinelearning.apple.com/research/trained-on-tokens?utm_source=tldrai
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
25 March 2026
133 articles
Related Articles
Related Articles
More Stories