
Share
Apple’s new foundation models, designed for both device and server use, offer unprecedented multilingual support and efficiency, pushing the boundaries of on-device AI capabilities with innovative architecture tailored for Apple silicon.
Apple has recently introduced two new foundation language models designed to enhance the capabilities of Apple Intelligence, a personal intelligence system deeply integrated into iOS 18, iPadOS 18, and macOS Sequoia. These models are tailored for efficient performance on both devices and in a private cloud environment, marking significant advancements in multilingual and multimodal AI.
The first model is a ∼3 billion parameter language model optimized specifically for Apple silicon. Key technical highlights include:
Architectural Innovations:
Performance and Efficiency:
The second model is a large-scale server-based language model designed for Private Cloud Compute. This model introduces several novel architectural and training techniques:
Parallel-Track Mixture-of-Experts (PT-MoE) Transformer:
Scalability and Flexibility:

Both models are trained on a diverse dataset that includes multilingual text, audio, and image data. The training process emphasizes:
The evaluation of these models demonstrates their effectiveness across various tasks:
On-Device Model:
Server-Based Model:
For developers and researchers working on natural language processing (NLP) and speech processing, these new models offer several advantages:
Apple’s focus on Responsible AI ensures that these models are not only technically advanced but also ethically sound, making them valuable tools for building intelligent applications that respect user privacy and reduce bias.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
30 July 2024
88 articles
Related Articles
Related Articles
More Stories