
Share
While massive language models dominate tech headlines, smaller AI models are stealthily boosting corporate America's productivity by offering practical solutions at a lower cost.
In the world of artificial intelligence, large language models (LLMs) like GPT-4 and Anthropic’s Claude have been grabbing all the headlines. These behemoths are breaking cognitive records by passing legal exams, winning math olympiads, and even generating entire articles. However, for many practical applications in corporate America, it's not these sophisticated models that are making the biggest impact. Instead, smaller, more efficient models are quietly driving productivity and cost efficiency.
The paradox lies in the disconnect between what gets media attention and what actually works in a business context. While LLMs continue to push the boundaries of what AI can do, they often come with significant drawbacks for corporate use:
Small models, on the other hand, offer several advantages:
Small models are being used in a variety of corporate productivity tools:

Small models are also transforming "knowledge factories", environments where large amounts of data need to be processed and analyzed:
When implementing small models, companies often focus on the following aspects:
While large language models continue to capture the public's imagination, it's the smaller, more efficient models that are making a significant impact in corporate America. By focusing on cost efficiency, faster response times, and easier deployment, these models are driving productivity and transforming how businesses operate.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
3 November 2025
133 articles
Related Articles
Related Articles
More Stories