
Share
Unsloth Studio's new tools allow developers to run Google’s versatile Gemma 3n model locally, enabling fine-tuning on diverse data without cloud dependencies.
Google's Gemma 3n is a powerful multimodal model that can handle image, audio, video, and text inputs. It comes in two sizes-2B (E2B) and 4B (E4B)-and supports 140 languages for both text and multimodal tasks. The latest update from Unsloth Studio makes it possible to run and fine-tune Gemma 3n locally using their tools, which is a significant step forward for practitioners looking to leverage this model without relying on cloud infrastructure.
Unsloth Studio has made it straightforward to run Gemma 3n locally. The model is available in several configurations, each optimized for different use cases:
You can download the pre-trained models from Hugging Face:
2B (E2B) Models:
4B (E4B) Models:

For a comprehensive list of all available Gemma 3n models, including base and other formats, check out the Unsloth collection on Hugging Face.
Fine-tuning is a crucial step to adapt the model to specific tasks or datasets. Unsloth Studio provides a free Colab notebook to get you started:
The notebook includes detailed instructions and code snippets to fine-tune Gemma 3n. Here are the official recommended settings for inference:
Unsloth Studio has addressed several issues with GGUFs not working properly in Ollama. If you are using Ollama, it's recommended to redownload the models to ensure they function correctly.
For a deeper dive into the technical details and fixes, refer to the [Fixes + Technical Analysis](https://unsloth.ai/docs/models/tutorials/gemma-3-how-to-run-and-fine-t
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
4 July 2025
88 articles
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
More Stories