Researchers created an open rival to OpenAI’s o1 ‘reasoning’ model for under $50

Topics

Latest

Amazon

Image Credits:Yuichiro Chino / Getty Images

Apps

Biotech & Health

Climate

Binary code representing data moving through the cloud.

Image Credits:Yuichiro Chino / Getty Images

Cloud Computing

Commerce

Crypto

Enterprise

EVs

Fintech

Fundraising

contrivance

game

Google

Government & Policy

ironware

Instagram

Layoffs

Media & Entertainment

More from TechCrunch

Events

Startup Battlefield

StrictlyVC

Podcasts

Videos

Partner Content

TechCrunch Brand Studio

Crunchboard

AI researchers at Stanford and the University of Washington were able-bodied to prepare an AI “ reasoning ” model for under $ 50 in swarm compute credits , according to a newresearch paperreleased last Friday .

The mannequin , known as s1 , execute similarly to cutting - edge reasoning model , such as OpenAI ’s o1 and DeepSeek ’s R1 , on tests measuring math and coding abilities . The s1 model isavailable on GitHub , along with the data point and code used to prepare it .

The team behind s1 said they start with an off - the - ledge basis example , then fine - tuned it through distillate , a process to extract the “ reasoning ” capabilities from another AI mannequin by take on its answers .

The researcher read s1 is distill from one of Google ’s logical thinking models , Gemini 2.0 Flash Thinking Experimental . Distillation is the same approach Berkeley research worker used tocreate an AI reasoning role model for around $ 450 last calendar month .

To some , the idea that a few researchers without meg of dollars behind them can still innovate in the AI infinite is exciting . But s1 leaven real question about the commoditization of AI model .

Where ’s the fosse if someone can nearly replicate a multi - million - dollar model with relative pocket change ?

Unsurprisingly , braggart AI science laboratory are n’t happy . OpenAI has accused DeepSeek of improperly harvesting data point from its API for the intent ofmodel distillment .

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI

The researchers behind s1 were looking to regain the unproblematic approach to achieve strong reasoning public presentation and “ tryout - time grading , ” or allowing an AI model to think more before it do a question . These were a few of the breakthrough in OpenAI ’s o1 , which DeepSeek and other AI lab have tried to replicate through various technique .

The s1 paper propose that reasoning role model can be distilled with a relatively small dataset using a summons called supervised fine - tuning ( SFT ) , in which an AI example is explicitly instructed to mime sealed behaviour in a dataset .

SFT tends to be cheaper than the large - scale reinforcement erudition method that DeepSeek utilise to prepare its competitor to OpenAI ’s o1 model , R1 .

Google offer liberal access to Gemini 2.0 Flash Thinking Experimental , albeit with day-by-day charge per unit limits , via its Google AI Studio platform .

Google ’s terms forbid contrary - engineering its models to develop service that contend with the company ’s own AI offer , however . We ’ve touch out to Google for comment .

S1 is free-base on a small-scale , off - the - ledge AI model from Alibaba - own Formosan AI laboratory Qwen , which is available to download for free . To coach s1 , the researchers created a dataset of just 1,000 carefully curated question , paired with answers to those doubt , as well as the “ mentation ” cognitive operation behind each result from Google ’s Gemini 2.0 Flash Thinking Experimental .

After training s1 , which took less than 30 minute of arc using 16 Nvidia H100 GPUs , s1 accomplish strong carrying out on sealed AI benchmark , according to the researchers . Niklas Muennighoff , a Stanford researcher who worked on the project , told TechCrunch he could rent the necessary compute today for about $ 20 .

The researchers used a groovy legerdemain to get s1 to bivalent - check its work and lead its “ thinking ” time : They separate it to wait . Adding the Scripture “ await ” during s1 ’s logical thinking help the model get at slightly more precise answers , per the paper .

In 2025 , Meta , Google , and Microsoftplan to empower hundreds of billions of dollar mark in AI infrastructure , which will partially go toward breeding next - generation AI models .

That horizontal surface of investment may still be necessary to push the envelope of AI conception . Distillation has shown to be a good method acting for stingily re - produce an AI model ’s capabilities , but it does n’t create new AI model immensely better than what ’s available today .

Topics#

More from TechCrunch#

Join us at TechCrunch Sessions: AI#

Exhibit at TechCrunch Sessions: AI#

Topics

More from TechCrunch

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI