
Share
NovaSky's Sky-T1-32B-Preview slashes costs to under $450 while matching O1-preview’s performance, democratizing access to advanced AI reasoning for researchers and hobbyists alike.
The NovaSky team at UC Berkeley has introduced Sky-T1-32B-Preview, a reasoning model that matches the performance of O1-preview on popular benchmarks. What sets Sky-T1 apart is its affordability; it was trained for less than $450, making high-level reasoning capabilities accessible to a broader audience.
Models like O1 and Gemini 2.0 have shown remarkable capabilities in reasoning by generating long internal chains of thought. However, these models often come with proprietary constraints, limiting access to the academic and open-source communities. Sky-T1-32B-Preview addresses this gap by providing a fully transparent and accessible alternative.
To drive progress together, NovaSky has made all resources available:

| Model | Sky-T1-32B-Preview | STILL-2 | Journey | QwQ | O1 | | --- | --- | --- | --- | --- | --- | | Data | ✅ | ✅ | ❌ | ❌ | ❌ | | Code | ✅ | ❌ | ❌ | ❌ | ❌ | | Report | ✅ | ✅ | ✅ | ❌ | ❌ | | Math Domain | ✅ | ✅ | ✅ | ✅ | ✅ | | Coding Domain | ✅ | ❌ | ❌ | ✅ | ✅ | | Model Weights | ✅ | ✅ | ❌ | ✅ | ❌ |
To generate the training data for Sky-T1-32B-Preview, we used QwQ-32B-Preview, an open-source model with reasoning capabilities similar to O1-preview. The data curation process involved several key steps:
Sky-T1-32B-Preview is a significant step forward in making advanced reasoning models accessible and affordable. By open-sourcing all components, NovaSky aims to empower the community to build on this work and explore new frontiers in AI research.
Tags
Original Sources
↗ https://novasky-ai.github.io/posts/sky-t1/?utm_source=tldrai
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
13 January 2025
88 articles
Related Articles
Related Articles
More Stories