
Share
Researchers have developed a novel method to enhance the accuracy of large language models by reducing factual errors without relying on human-labeled data, achieving impressive gains with minimal oversight.
The widespread adoption of large pre-trained language models (LLMs) has brought about a new era in natural language processing. These models can generate fluent and creative text, often rivaling human output. However, one significant drawback is their tendency to produce factually inaccurate claims, known as "hallucinations." This issue can lead to the spread of misinformation and the perpetuation of misconceptions. Manual fact-checking, while effective, is time-consuming and costly.
In a recent paper titled "Fine-tuning Language Models for Factuality," researchers Katherine Tian, Eric Mitchell, Huaxiu Yao, Christopher D. Manning, and Chelsea Finn propose a method to fine-tune LLMs to be more factual without the need for human labels. Their approach leverages two key innovations in NLP: methods for judging factuality and direct preference optimization (DPO).
The researchers used the following steps to improve the factuality of LLMs:
Automated Factuality Preference Rankings: They generated these rankings through two methods:
Fine-Tuning with DPO: The models were fine-tuned using the preference rankings as a guide, optimizing for factuality rather than just fluency or coherence.

The researchers tested their approach on Llama-2, a popular LLM, at the 7B parameter scale. They compared the performance of their fine-tuned model against two baselines: Llama-2-chat and reinforcement learning with human feedback (RLHF).
These improvements highlight the effectiveness of their approach in reducing hallucinations and enhancing factuality without the need for expensive human labels.
For practitioners working with LLMs, this research offers several practical benefits:
The work by Tian et al. demonstrates that fine-tuning LLMs for factuality without human labels is not only feasible but also highly effective. By leveraging automated methods and direct preference optimization, they achieved significant reductions in factual error rates. This approach has the potential to revolutionize how we use and trust large language models in various applications.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
16 November 2023
133 articles
Related Articles
Related Articles
More Stories