Mistral Small 3.1: A Lightweight, Multimodal, and Multilingual Model with SOTA Performance

Models & Research

The Engineer

18 Mar 2025 · 4 min read

Mistral AI's latest update to its small model lineup brings unparalleled performance and expanded capabilities, making it a standout choice for both text and multimodal tasks with a larger context window.

Mistral AI has announced the release of Mistral Small 3.1, a significant update to their existing small model lineup. This new version not only improves on text performance but also enhances multimodal understanding and extends the context window to up to 128k tokens. Let's dive into what's changed technically and why it matters for practitioners.

Technical Improvements

Text Performance: Mistral Small 3.1 outperforms comparable models like Gemma 3 and GPT-4o Mini in text benchmarks. This is crucial for applications that rely heavily on natural language processing (NLP), such as chatbots, content generation, and document summarization.
Multimodal Understanding: The model now excels in handling multimodal inputs, which include both text and images. This capability is essential for tasks like image captioning, visual question answering, and cross-modal retrieval.
Expanded Context Window: With a context window of up to 128k tokens, Mistral Small 3.1 can process longer documents and maintain coherence over extended sequences. This is particularly useful for applications that require understanding long-form content, such as legal documents, technical manuals, and research papers.

Inference Speed

Mistral Small 3.1 delivers impressive inference speeds of 150 tokens per second, making it suitable for real-time applications where low latency is critical. This performance is achieved without compromising on accuracy or context window size, which is a significant achievement in the field of small models.

Licensing and Availability

The model is released under the Apache 2.0 license, making it freely available for both commercial and non-commercial use. This open-source approach encourages transparency and collaboration within the AI community, allowing researchers and developers to build upon and improve the model.

Performance Benchmarks

Instruct Performance

Text Instruct Benchmarks: Mistral Small 3.1 outperforms leading models in text instruction benchmarks, demonstrating its robustness in understanding and following complex instructions.
Multimodal Instruct Benchmarks (MM-MT-Bench): The model scores between 0 and 100 on MM-MT-Bench, a benchmark that evaluates multimodal task performance. This indicates strong capabilities in handling tasks that require both textual and visual inputs.

Pretrained Performance

All Pretrain: The pretrained base model for Mistral Small 3.1 is also available, providing a solid foundation for further fine-tuning and customization.

Use Cases

Mistral Small 3.1 is designed to handle a wide range of generative AI tasks, making it versatile and adaptable to various applications:

Instruction Following: Ideal for chatbots and virtual assistants that need to understand and execute user commands accurately.
Conversational Assistance: Suitable for real-time conversational systems where quick responses are essential.
Image Understanding: Capable of processing and generating content based on visual inputs, enhancing its utility in image-related tasks.
Function Calling: Supports rapid function execution within automated workflows, making it a valuable tool for agentic AI applications.

Key Features and Capabilities

Lightweight: The model can run on a single RTX 4090 or a Mac with 32GB RAM, making it suitable for on-device use cases.
Fast-response Conversational Assistance: Ideal for virtual assistants and other applications requiring quick, accurate responses.
Low-latency Function Calling: Capable of rapid function execution within automated workflows.
Fine-tuning for Specialized Domains: Can be fine-tuned to specialize in specific domains, such as legal advice, medical diagnostics, and technical support.
Foundation for Advanced Reasoning: The community has already built several advanced reasoning models on top of Mistral Small 3.1, showcasing its potential for further innovation.

Community Impact

The release of both base and instruct checkpoints for Mistral Small 3.1 enables the community to continue building and customizing the model. This open approach fosters a collaborative environment where developers can leverage the strengths of the model to create specialized applications and tools.