
Share
Researchers unveil a surprising flaw in LLMs: they can detect and transmit invisible Unicode characters, enabling sneaky communication that humans can't see. This raises red flags for security and privacy.
In a striking revelation, researchers have discovered that large language models (LLMs) like Claude and Copilot can read and write invisible text, creating an ideal covert channel for malicious activities. This vulnerability arises from the Unicode standard's quirk, which allows certain characters to be recognized by AI but remain invisible to human users. The implications are significant for both security practitioners and developers, as it opens up new vectors for prompt injection and data exfiltration.
The core of this issue lies in how Unicode handles certain characters that are non-renderable or invisible when displayed on a screen. These characters can be embedded within normal text without affecting its appearance to human readers. However, LLMs can still process these characters, making them an ideal medium for steganographic (hidden) communication.
Joseph Thacker, an independent researcher and AI engineer at Appomni, highlighted the severity of this issue: “The fact that GPT 4.0 and Claude Opus were able to really understand those invisible tags was really mind-blowing to me and made the whole AI security space much more interesting.” The ability of these models to interpret non-renderable characters significantly expands the attack surface.
To demonstrate the practicality of this technique, Johann Rehberger, a researcher who coined the term "ASCII smuggling," created two proof-of-concept (POC) attacks targeting Microsoft 365 Copilot. These attacks illustrate how invisible characters can be used to extract sensitive information:

Sales Figures Extraction:
One-Time Passcode Extraction:
Microsoft introduced mitigations for these attacks several months after they were discovered. However, the broader implications of this vulnerability remain concerning:
The discovery of invisible Unicode characters as a covert communication channel underscores the evolving nature of AI security. As LLMs become more sophisticated, so do the methods used to exploit them. Security practitioners and developers must stay vigilant and proactive in addressing these emerging threats.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
18 October 2024
88 articles
Related Articles
Related Articles
More Stories