Google’s latest AI model report lacks key safety details, experts say

Topics

modish

Amazon

Image Credits:Andrey Rudakov/Bloomberg / Getty Images

Apps

Biotech & Health

Climate

Cloud Computing

Commerce

Crypto

initiative

EVs

Fintech

Fundraising

Gadgets

Gaming

Google

Government & Policy

computer hardware

Instagram

layoff

Media & Entertainment

More from TechCrunch

event

Startup Battlefield

StrictlyVC

newssheet

Podcasts

Videos

Partner Content

TechCrunch Brand Studio

Crunchboard

On Thursday , week after plunge its most powerful AI model yet , Gemini 2.5 Pro , Google publish atechnical reportshowing the results of its internal safety evaluation . However , the written report is light on the details , expert say , make it hard to make up one’s mind which risks the model might posture .

Technical reports put up useful — andunflattering , at meter — information that companies do n’t always widely advertise about their AI . By and large , the AI residential area escort these news report as good - faith efforts to support independent research and safety evaluations .

Google take a different safety gadget reporting glide slope than some of its AI rivals , publishing technical paper only once it considers a theoretical account to have graduated from the “ experimental ” stage . The company also does n’t admit finding from all of its “ dangerous capability ” rating in these write - ups ; it appropriate those for a separate audit .

Several expert TechCrunch spoke with were still let down by the sparsity of the Gemini 2.5 Pro story , however , which they noted does n’t mention in great detail Google ’s proposedFrontier Safety Framework ( FSF ) . Google introduce the FSF last year in what it depict as an effort to identify future AI capabilities that could get “ knockout damage . ”

“ This [ report ] is very sparse , control minimum selective information , and came out weeks after the modelling was already made usable to the populace , ” Peter Wildeford , co - founder of the Institute for AI Policy and Strategy , told TechCrunch . “ It ’s impossible to verify if Google is living up to its public commitments and thus unsufferable to assess the guard and security measures of their theoretical account . ”

Thomas Woodside , co - founder of the Secure AI Project , said that while he ’s happy Google release a theme for Gemini 2.5 Pro , he ’s not convinced of the company ’s dedication to pitch seasonable supplementary safety evaluations . Woodside indicate out that the last time Google bring out the results of dangerous capability tests was in June 2024 — for a model announced in February that same yr .

Not inspire much authority , Google has n’t made uncommitted a story forGemini 2.5 Flash , a diminished , more effective model the caller announce last calendar week . A spokesperson told TechCrunch a report for Flash is “ coming shortly . ”

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI

“ I hope this is a promise from Google to start publishing more frequent update , ” Woodside say TechCrunch . “ Those updates should include the upshot of evaluations for exemplar that have n’t been in public deployed yet , since those mannequin could also pose serious risk of exposure . ”

Google may have been one of the first AI labs to purport standardized report for models , but it ’s not the only one that ’s beenaccused of underdelivering on transparencylately . Meta free asimilarly skimpy safety equipment evaluationof its newLlama 4open models , and OpenAI prefer not to publish any report for itsGPT-4.1 serial publication .

Hanging over Google ’s head are assurances the tech giant made to regulators to maintain a high standard of AI safety testing and reportage . Two geezerhood ago , Google order the U.S. governmentit would publish safety study for all “ significant ” public AI model “ within scope . ” The companyfollowed up that hope with standardised commitmentstoother country , pledging to “ provide public transparency ” around AI products .

Kevin Bankston , a senior adviser on AI governance at the Center for Democracy and Technology , send for the trend of sporadic and obscure reports a “ race to the bottom ” on AI safety .

“ Combined with reports that competing labs like OpenAI have shaved their safety examination time before discharge from months to days , this meager documentation for Google ’s top AI model severalise a worrisome chronicle of a race to the bottom on AI base hit and transparency as companies rush their fashion model to market place , ” he told TechCrunch .

Google has said in affirmation that , while not detail in its technical reputation , it conducts safety testing and “ adversarial red teaming ” for models ahead of release .

update 4/22 at 12:58 p.m. Pacific : Modified spoken language around the expert written report ’s reference to Google ’s FSF .

Topics#

More from TechCrunch#

Join us at TechCrunch Sessions: AI#

Exhibit at TechCrunch Sessions: AI#

Topics

More from TechCrunch

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI