
Share
Microsoft's latest Phi 3.5 models outshine competitors with advanced capabilities in reasoning and multimodal analysis, offered freely on Hugging Face under a permissive license, sparking new possibilities for AI developers worldwide.
Microsoft has once again upped the ante in the AI landscape with the release of three new models in its evolving Phi series. These models-Phi-3.5-mini-instruct, Phi-3.5-MoE-instruct, and Phi-3.5-vision-instruct-are designed to tackle a range of tasks from fast reasoning to advanced multimodal analysis. Each model is available on Hugging Face under a permissive Microsoft-branded MIT License, allowing for commercial usage and modification.
The Phi-3.5-mini-instruct is a lightweight model with 3.82 billion parameters, tailored for instruction adherence and supporting a 128k token context length. This makes it ideal for environments where memory or compute resources are limited, such as edge devices or low-power servers.
Despite its compact size, the Phi-3.5-mini-instruct demonstrates competitive performance in multilingual and multi-turn conversational tasks. It outperforms similarly-sized models like Llama-3.1-8B-instruct and Mistral-7B-instruct on the RepoQA benchmark, which measures long context code understanding.
The Phi-3.5-MoE (Mixture of Experts) model is a significant addition to Microsoft's AI arsenal. With 41.9 billion parameters, it leverages the Mixture of Experts approach to achieve powerful reasoning capabilities.

This model is designed for scenarios where high computational power is available, making it suitable for research and large-scale applications. It shows near state-of-the-art performance on various benchmarks, often outperforming models from Google, Meta, and OpenAI.
The Phi-3.5-vision-instruct is a 4.15 billion parameter model specifically designed for vision tasks such as image and video analysis.
This model is particularly useful for applications in computer vision, where it can handle both static images and dynamic video content with high accuracy and efficiency.
All three models have been rigorously tested on various benchmarks and have shown impressive results:
Phi-3.5-mini-instruct:
Phi-3.5-MoE:
Phi-3.5-vision-instruct:
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
26 August 2024
88 articles
Related Articles
Related Articles
More Stories