
Share
Researchers at PyMC Labs and Colgate-Palmolive have harnessed large language models to create a new method called Semantic Similarity Rating, which accurately gauges purchase intent with high human-like consistency.
In a significant advancement for consumer research, researchers from PyMC Labs and Colgate-Palmolive have developed a method called Semantic Similarity Rating (SSR) to elicit realistic purchase intent ratings using large language models (LLMs). This approach addresses the limitations of traditional survey methods by leveraging LLMs to generate textual responses that are then mapped to Likert scale distributions. The results, detailed in their recent paper, show that SSR achieves 90% of human test–retest reliability while maintaining realistic response distributions (KS similarity > 0.85).
Consumer research is a costly but essential part of product development, with companies spending billions annually to gather insights from consumer panels. However, these surveys often suffer from biases such as satisficing (respondents providing quick and easy answers), acquiescence (tendency to agree with statements), and positivity bias (overly positive responses). SSR offers a promising alternative by using LLMs to simulate synthetic consumers that provide both quantitative ratings and qualitative feedback.
The researchers tested SSR on an extensive dataset of 57 personal care product surveys conducted by Colgate-Palmolive, involving 9,300 human responses. The results were impressive:

The development of SSR has several practical implications for consumer research:
The Semantic Similarity Rating (SSR) method represents a significant step forward in consumer research. By leveraging the capabilities of large language models, SSR offers a scalable, reliable, and realistic alternative to traditional survey methods. This framework not only preserves the metrics and interpretability of human surveys but also enhances them with rich qualitative feedback. As companies continue to invest in product development, SSR could become an invaluable tool for gathering accurate and actionable consumer insights.
Tags
Original Sources
↗ https://arxiv.org/pdf/2510.08338
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
15 October 2025
88 articles
Related Articles
Related Articles
More Stories