
Share
OpenAI's fine-tuning feature for GPT-4o lets developers tailor the AI model to their needs, enhancing performance and accuracy while reducing costs-ideal for businesses looking to integrate customized solutions.
August 20, 2024
OpenAI has announced the availability of fine-tuning for GPT-4o, a highly anticipated feature that allows developers to customize the model for their specific applications. This update comes with significant benefits, including improved performance, accuracy, and cost efficiency. OpenAI is also offering 1M training tokens per day for free until September 23, making it easier for organizations to get started.
Fine-tuning is a powerful tool that can significantly enhance the performance of large language models (LLMs) like GPT-4o. Here are some key benefits:
To start fine-tuning GPT-4o, follow these steps:

create and select gpt-4o-2024-08-06 from the base model drop-down.For those interested in a lighter version, GPT-4o mini is also available for fine-tuning:
gpt-4o-mini-2024-07-18 from the base model drop-down.OpenAI has been working with trusted partners to test fine-tuning on GPT-4o, and the results are promising. Here’s a success story:
Cosine Achieves State-of-the-Art Results on the SWE-bench Benchmark
Cosine’s Genie is an AI software engineering assistant that can autonomously identify and resolve bugs, build features, and refactor code in collaboration with users. By fine-tuning GPT-4o on examples of real software engineers at work, Cosine has achieved state-of-the-art results on the SWE-bench benchmark. This customization allows Genie to reason across complex technical problems more accurately and efficiently.
OpenAI is committed to expanding model customization options for developers. Fine-tuning is just the beginning, and we can expect more features and improvements in the future.
For detailed instructions and best practices, visit OpenAI’s fine-tuning documentation.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
21 August 2024
88 articles
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
More Stories