
Share
Cohere For AI introduces Aya Expanse, a suite of multilingual models that excel in 23 languages, surpassing competitors in accuracy and efficiency, to significantly narrow the global language divide.
Cohere For AI, the research arm of Cohere, has unveiled Aya Expanse, a groundbreaking family of multilingual models designed to enhance language support and bridge the global language gap. This new suite of models excels across 23 languages and outperforms leading open-weight models in both accuracy and performance.
These models are part of Cohere’s ongoing commitment to multilingual research and aim to accelerate advancements in the field. The 8B model is particularly noteworthy for making cutting-edge research more accessible to a broader range of researchers worldwide.
Win rates for Aya Expanse 8B range from 60.4% to 70.6%, demonstrating significant improvements in multilingual performance.
Aya Expanse builds on Cohere’s extensive work over the past two years, which has involved collaboration with over 3,000 researchers from 119 countries. This collaborative effort has led to several critical milestones:

The improvements in Aya Expanse stem from a sustained focus on expanding AI's language capabilities. Cohere’s research agenda has included:
For practitioners, Aya Expanse represents a significant step forward in addressing the language gap. The models' high performance across multiple languages means they can be used in a wide range of applications, from speech recognition and translation to content generation and understanding. This is particularly important for underrepresented languages, where quality AI support has been lacking.
Aya Expanse is not just another model; it's a significant leap in multilingual AI research. By making these models open and accessible, Cohere For AI continues to drive innovation and inclusivity in the field. Whether you're a researcher looking to push the boundaries of multilingual AI or a developer seeking robust language support, Aya Expanse offers a powerful toolset.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
25 October 2024
88 articles
Related Articles
Related Articles
More Stories