
Share
Pixtral Large, once the cornerstone of Mistral AI's offerings with its 124 billion parameters, is being phased out as the company looks to the future. This retrospective explores why it was groundbreaking and what legacy it leaves behind.
Nov 18, 2024
Mistral AI Team
Pixtral Large, once a flagship model in the Mistral AI lineup, has been deprecated. The team now focuses on more advanced vision and multimodal models. However, it's worth revisiting what made Pixtral Large such a significant milestone in the field.
Pixtral Large was built on top of Mistral Large 2, extending its multimodal capabilities without compromising text performance. The model's architecture includes:
The large context window (128K tokens) allowed the model to process multiple high-resolution images simultaneously, which is crucial for tasks like document analysis and chart recognition. This feature was a significant improvement over previous models, which often struggled with handling complex visual data efficiently.
Pixtral Large was evaluated against other leading models on several benchmarks:

Pixtral Large was versatile in its applications:
Prompt: I bought the coffee and the sausage. How much do I owe? Add an 18% tip.
Response: To calculate the total amount you owe, including the 18% tip, follow these steps:
Calculate the cost of the coffee and the sausage:
Total before tip: 9.00 CHF + 6.00 CHF = 15.00 CHF
Calculate the 18% tip:
Total amount you owe: 15.00 CHF + 2.70 CHF = 17.70 CHF
Pixtral Large was available under the Mistral Research License (MRL) for research and educational use. For commercial purposes, including experimentation, testing, and production, the model could be licensed through the Mistral Commercial License.
Despite its deprecation, Pixtral Large remains a testament to the rapid advancements in multimodal AI and sets a high bar for future models.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
19 November 2024
88 articles
Related Articles
Related Articles
More Stories