
Share
Google's Gboard now uses a server-side LLM to offer real-time sentence and paragraph corrections, enhancing typing accuracy with minimal user effort. This article explores its development journey.
The latest advancement in Large Language Models (LLMs) has led to a significant enhancement in user typing experiences, particularly with the introduction of Proofread-a new feature in Google’s Gboard keyboard. This feature leverages a server-side LLM to provide seamless sentence-level and paragraph-level corrections with just one tap. In this article, we delve into the technical details of how Proofread was developed, from data generation and model tuning to deployment on Pixel 8 devices.
To ensure the quality of the models used in Proofread, a meticulous synthetic data pipeline was created. This pipeline is tailored for online use cases, generating diverse and realistic text samples that reflect various typing errors and contextual corrections. The data includes:
The team designed multifaceted metrics to evaluate the performance of the models. These metrics include:
The tuning process involved a two-stage approach:

The tuned PaLM2-XS model achieved a good ratio of 85.56% on a human-labeled golden set. This high performance indicates that the model is effective in correcting a wide range of errors with minimal user intervention.
Proofread was launched on Pixel 8 devices, leveraging Google Cloud’s TPU v5 infrastructure for efficient and scalable deployment. The team implemented several optimizations to reduce serving latency:
These optimizations allowed Proofread to handle thousands of daily active users with minimal delay, providing a smooth and responsive user experience.
A demo of the Proofread feature is available on YouTube, showcasing its effectiveness in real-world scenarios.
Proofread represents a significant step forward in leveraging LLMs to enhance typing experiences. By combining advanced data generation, multifaceted metrics, and sophisticated tuning techniques, the feature delivers seamless and accurate corrections with just one tap. This innovation not only improves user productivity but also sets a new standard for AI-powered text correction tools.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
11 June 2024
88 articles
Related Articles
Related Articles
More Stories