
Share
Snorkel AI unveils Snorkel-Mistral-PairRM-DPO, an advanced language model that integrates response reranking and Direct Preference Optimization for improved alignment in conversational AI, backed by impressive benchmark results.
Snorkel AI has released a new aligned version of the Mistral language model, dubbed Snorkel-Mistral-PairRM-DPO. This model is designed to improve alignment and performance in chat-based applications. The release includes benchmark results that demonstrate the effectiveness of Snorkel's approach to LLM (Large Language Model) alignment.
The key technical changes introduced by Snorkel AI involve a novel training methodology that combines response reranking with Direct Preference Optimization (DPO). This approach aims to refine the model's responses to be more aligned with human preferences, making it particularly suitable for chat applications where nuanced and contextually appropriate responses are crucial.
The Snorkel-Mistral-PairRM-DPO model can be accessed via multiple endpoints, including Hugging Face and Together AI:
https://t1q6ks6fusyg1qq7.us-east-1.aws.endpoints.huggingface.cloudimport requests
API_URL = "https://t1q6ks6fusyg1qq7.us-east-1.aws.endpoints.huggingface.cloud"
headers = {
"Accept" : "application/json",
"Content-Type": "application/json"
}

def query(payload):
response = requests.post(API_URL, headers=headers, json=payload)
return response.json()
output = query({
"inputs": "[INST] Recommend me some Hollywood movies [/INST]",
"parameters": {}
})
```
snorkelai/Snorkel-Mistral-PairRM-DPO.The key premise of Snorkel AI's approach is that specialization in alignment techniques can lead to better performance in specific applications like chat. The team plans to release more detailed results and findings on their blog in the coming weeks, providing deeper insights into the training process and benchmark results.
If you're interested in learning more about the model and its capabilities, check out the [Snorkel AI Blog](https://snorkel.ai/new-benchmark-results-demonstrate-value-of-snorkel-
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
24 January 2024
133 articles
Related Articles
Related Articles
More Stories