Topics

late

AI

Amazon

Article image

Image Credits:Tero Vesalainen / Getty Images

Apps

Biotech & Health

mood

Unhappy and disappointed customer giving low rating and negative feedback in survey, poll or questionnaire. Sad and dissatisfied man giving review about service quality. Bad user experience.

Image Credits:Tero Vesalainen / Getty Images

Cloud Computing

Commerce

Crypto

initiative

EVs

Fintech

Fundraising

Gadgets

Gaming

Google

Government & Policy

ironware

Instagram

layoff

Media & Entertainment

Meta

Microsoft

privateness

Robotics

Security

Social

Space

Startups

TikTok

Transportation

Venture

More from TechCrunch

Events

Startup Battlefield

StrictlyVC

Podcasts

Videos

Partner Content

TechCrunch Brand Studio

Crunchboard

meet Us

This calendar week , Sakana AI , an Nvidia - backed inauguration that ’s raised hundreds of millions of dollars from VC firms , made a singular title . The company said it had create an AI organization , the AI CUDA Engineer , that could efficaciously accelerate up the training of certain AI models by a factor of up to 100x .

The only problem is , the system did n’t figure out .

Userson Xquickly discoveredthat Sakana ’s organization actually resulted in worse - than - ordinary model training performance . allot to one user , Sakana ’s AI resulted in a 3x slowdown — not a speedup .

What went wrong ? A bug in the code , harmonize to apostby Lucas Beyer , a member of the technological stave at OpenAI .

“ Their orig code is wrong in [ a ] subtle way , ” Beyer write on X. “ The fact they course benchmarking TWICE with wildly different result should make them kibosh and reckon . ”

In apostmortem publishedFriday , Sakana admitted that the system has find a fashion to “ cheat ” ( as Sakana draw it ) and find fault the system ’s disposition to “ reward machine politician ” — i.e. identify flaws to achieve high metrics without accomplishing the desire destination ( speeding up simulation education ) . Similar phenomenon has been respect inAI that ’s trained to recreate games of chess game .

According to Sakana , the system happen exploits in the evaluation codification that the party was using that allowed it to get around validation for truth , among other checks . Sakana says it has direct the number , and that it mean to retool its claims in update cloth .

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI

Props to Sakana for owning up to the error . But the episode is a good reminder that if a title sounds too good to be on-key , especially in AI , it belike is .