
Share
Google slashes the price of Gemini 1.5 Flash, now offering enhanced tuning options and multilingual support, boosting accessibility for developers needing robust AI solutions at a lower cost.
Google has announced a series of significant updates for its Gemini AI models, particularly focusing on the popular Gemini 1.5 Flash version. These changes include a substantial price reduction, expanded multilingual support, and enhanced tuning capabilities. Here’s what you need to know:
Effective August 12, Google has slashed the costs associated with using Gemini 1.5 Flash, making it more affordable for developers working on high-volume, low-latency applications such as summarization, categorization, and multi-modal understanding.
These reductions apply to prompts under 128K tokens, with similar cuts cascading into the >128K tokens tier. Additionally, features like context caching will further reduce costs for developers.
Google has completed the rollout of tuning capabilities for Gemini 1.5 Flash, making it available to all developers. This allows you to fine-tune the model to better suit your specific use cases, improving performance and relevance.
The Gemini API has been expanded to support queries in over 100 additional languages. This broadens the model's applicability for international projects and multilingual applications.

Google is also expanding access to AI Studio for Google Workspace customers. This move aims to integrate AI capabilities more seamlessly into the workflow of businesses using Google's suite of productivity tools.
To make it easier for developers to get started and navigate the new features, Google has revamped the documentation UI and updated the API reference. These improvements aim to provide clearer guidance and better support.
These updates are significant for several reasons:
If you're a developer looking to leverage these updates, here are some steps to get started:
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
14 August 2024
88 articles
Related Articles
Related Articles
More Stories