Cohere Labs Unveils Tiny Aya: Compact, Multilingual Speech Recognition Models for Local Deployment

Models & Research

The Engineer

24 May 2024 · 3 min read

Tiny Aya offers compact multilingual speech recognition for local devices, bridging language gaps with strong performance in a slim 3.35 billion parameter model that operates offline.

Cohere Labs, a leading AI research initiative, has announced the release of Tiny Aya, a family of compact multilingual speech recognition models designed to run locally on any device. This new addition to the Aya suite aims to bridge language gaps and enhance accessibility by providing strong performance without cloud dependency.

What Changed Technically?

Compact Size: Tiny Aya's base model has 3.35 billion parameters, making it significantly smaller than many large-scale models.
Local Deployment: The models can run on a variety of devices, from smartphones to edge servers, thanks to their compact size and efficient architecture.
Multilingual Support: Tiny Aya supports over 70 languages with specialized variants for different regions, ensuring balanced performance across a wide range of linguistic contexts.

Why It Matters

For practitioners and developers, Tiny Aya offers several key benefits:

Reduced Latency: Local deployment means faster response times, which is crucial for real-time applications like voice assistants.
Privacy and Security: Data remains on the device, reducing the risk of data breaches and compliance issues.
Cost Efficiency: No cloud dependency translates to lower operational costs, making it a viable option for resource-constrained environments.

Tiny Aya Variants

Tiny Aya Global

Optimized for Balanced Performance: This variant is designed to handle a wide array of languages with consistent performance across all supported regions.
Try the model

Tiny Aya Earth

Strongest for African and West Asian Languages: Specialized tuning for these regions ensures high accuracy in language-specific tasks.
Try the model

Tiny Aya Fire

Strongest for South Asian Languages: Optimized to excel with languages prevalent in South Asia, such as Hindi, Bengali, and Tamil.
Try the model

Tiny Aya Water

Strongest for Asia-Pacific and European Languages: Provides top-tier performance for a diverse set of languages across these regions.
Try the model

Other Aya Models

Aya Vision

Multimodal AI: This research model advances in multilingual multimodal tasks through synthetic data generation and cross-modal merging.
Performance: Achieves state-of-the-art performance across 23 languages while reducing computational overhead by up to 40%.
Try Aya Vision 8B | Try Aya Vision 32B

Aya Expanse

101 Languages: Mastery across a wide range of languages, including both high- and low-resource ones.
Efficiency: Combines a curated open-source dataset with compute-efficient pretraining to reduce infrastructure costs by up to 30%.
Try Aya Expanse 8B | Try Aya Expanse 32B

Aya 101

Instruction-Tuned Proficiency: Developed through a global collaborative effort involving over 3,000 researchers.
Inclusivity: Supports 101 languages, setting a new standard for multilingual AI.
Try Aya 101

Implementation Notes

Training Techniques: Tiny Aya leverages advanced training techniques to optimize performance and reduce computational overhead. This includes data augmentation, curriculum learning, and specialized loss functions.
Model Architecture: The architecture is designed to be modular, allowing for easy integration with existing systems and flexibility in deployment scenarios.
Benchmarking: Cohere