
Share
LG AI Research unleashes three advanced EXAONE 3.5 models, boosting instruction-following and handling longer contexts, a leap forward for both on-device apps and high-performance tasks.
LG AI Research has just open-sourced three new models from the EXAONE 3.5 lineup, building on the success of the EXAONE 3.0 series released in August 2024. These models are designed to meet a wide range of needs, from lightweight on-device applications to high-performance tasks requiring top-tier performance. Here’s what you need to know:
The new EXAONE 3.5 models offer significant improvements over their predecessors, particularly in instruction-following and long-context capabilities. This is crucial for developers and researchers looking to deploy AI solutions that can handle complex tasks with precision and efficiency.
The EXAONE 3.5 models are not just powerful; they are also highly efficient to train. LG AI Research has implemented several strategies to ensure that these models can be trained cost-effectively while maintaining high performance:
Pre-training Phase:
Post-training Phase:

To ensure the reliability and trustworthiness of the EXAONE 3.5 performance evaluation results, LG AI Research conducted a thorough decontamination process. This involved:
LG AI Research is committed to continuing its open-source efforts. They will actively seek feedback on the EXAONE 3.5 models and use it to release even better versions tailored to the needs of researchers and developers. By fostering a collaborative ecosystem, they aim to drive innovation and advance the field of AI.
The open-sourcing of these three EXAONE 3.5 models represents a significant step forward in making powerful AI tools accessible to a broader audience. Whether you’re working on resource-constrained devices or high-performance applications, there’s an EXAONE 3.5 model that can meet your needs.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
10 December 2024
133 articles
Related Articles
Related Articles
More Stories