
Share
Apple's OpenELM marks a significant shift by enabling powerful text generation directly on devices, balancing efficiency with privacy concerns in an era dominated by cloud-based AI services.
Apple has just joined the race to bring generative AI capabilities to on-device applications with the release of OpenELM, a new family of open-source large language models (LLMs). Unlike most LLMs that rely on cloud servers for computation, OpenELM is designed to run entirely on a single device, making it ideal for scenarios where low latency and privacy are crucial.
Apple released OpenELM on Hugging Face just a few hours ago. This family of models includes eight variants, divided into four pre-trained and four instruction-tuned versions. The parameter sizes range from 270 million to 3 billion parameters, which is relatively small compared to some of the behemoths in the LLM world (think GPT-4 or PaLM). However, these smaller sizes are precisely what make OpenELM suitable for on-device deployment.
Model Variants:
Parameter Sizes:

Pre-training:
Instruction Tuning:
Apple has provided the weights of its OpenELM models under a sample code license, which allows for both commercial usage and modification. The license only requires that if you redistribute the software without modifications, you must retain the original notice and disclaimers.
OpenELM represents a significant step forward in making generative AI accessible and practical for on-device applications. By providing both pre-trained and instruction-tuned models, Apple is offering developers the flexibility to choose the right variant for their needs. The sample code license further encourages innovation and adoption, making OpenELM a valuable addition to the LLM landscape.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
25 April 2024
133 articles
Related Articles
Related Articles
More Stories