
Share
Exploring GitHub's vast trove of AI projects, the author sifts through 118,000 repositories to highlight the most starred ones focused on foundation models and generative AI.
Four years ago, I analyzed the open source machine learning (ML) ecosystem. Since then, the landscape has evolved significantly, especially around foundation models. This time, I focused exclusively on this stack to understand its current state.
I started by searching GitHub using keywords like gpt, llm, and generative ai. The sheer volume of results is staggering-118K for gpt alone. To narrow it down, I filtered the repos to those with at least 500 stars. This gave me:
llmgptgenerative aiI also checked GitHub trending and social media for new repositories. After many hours, I identified 896 repos. Of these, 51 are tutorials (e.g., dair-ai/Prompt-Engineering-Guide) and aggregated lists (e.g., f/awesome-chatgpt-prompts). While these are helpful, I focused my analysis on the 845 software repositories.
The current AI stack can be broken down into three layers: infrastructure, model development, and application development.
At the foundation of this stack is infrastructure. This layer includes tools for:

The model development layer is where the magic happens. It includes:
Anything that involves changing a model’s weights-such as finetuning-happens in this layer.
The top layer is application development. It includes:
The open source AI stack is rich and diverse, with a plethora of tools and frameworks supporting each layer. This ecosystem is not just about building models but also about serving them efficiently, managing compute resources, and developing applications that leverage these models. As the landscape continues to evolve, staying updated with new repositories and tools will be crucial for practitioners.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
18 March 2024
133 articles
Related Articles
Related Articles
More Stories