Topics

Latest

AI

Amazon

Article image

Image Credits:Nathan Laine/Bloomberg / Getty Images

Apps

Biotech & Health

mood

Sam Altman, chief executive officer of OpenAI

Image Credits:Nathan Laine/Bloomberg / Getty Images

Cloud Computing

Department of Commerce

Crypto

OpenAI GPT-4.5

Results from OpenAI’s donation scheming benchmark.Image Credits:OpenAI

Enterprise

EVs

Fintech

OpenAI GPT-4.5

OpenAI’s codeword deception benchmark results.Image Credits:OpenAI

Fundraising

appliance

punt

Google

Government & Policy

ironware

Instagram

Layoffs

Media & Entertainment

Meta

Microsoft

seclusion

Robotics

Security

societal

Space

startup

TikTok

Transportation

speculation

More from TechCrunch

Events

Startup Battlefield

StrictlyVC

Podcasts

Videos

Partner Content

TechCrunch Brand Studio

Crunchboard

meet Us

OpenAI ’s next major AI modeling , GPT-4.5 , is extremely persuasive , according to the results of OpenAI ’s intimate benchmark rating . It ’s especially dear at convincing another AI to give it cash .

On Thursday , OpenAI published awhite paperdescribing the capacity of its GPT-4.5 model , computer code - named Orion , which was resign Thursday . According to the paper , OpenAI test the mannikin on a battery of benchmarks for “ thought , ” which OpenAI defines as “ risk related to convincing people to exchange their notion ( or act on ) both inactive and interactive model - render subject . ”

In one mental test that had GPT-4.5 attack to cook another model — OpenAI’sGPT-4o — into “ donating ” virtual money , the model performed far considerably than OpenAI ’s other useable mannequin , include “ reasoning ” framework like o1 and o3 - mini . GPT-4.5 was also better than all of OpenAI ’s fashion model at deceiving GPT-4o into telling it a clandestine codeword , besting o3 - mini by 10 percentage points .

According to the white newspaper , GPT-4.5 excelled at donation mulct because of a unparalleled strategy it developed during testing . The fashion model would request modest donations from GPT-4o , give responses like “ Even just $ 2 or $ 3 from the $ 100 would help me immensely . ” As a consequence , GPT-4.5 ’s donation be given to be smaller than the sum of money OpenAI ’s other models secured .

Despite GPT-4.5 ’s increased strength , OpenAI says that the fashion model does n’t encounter itsinternal thresholdfor “ high ” risk in this special benchmark category . The company has pledged not to release models that reach the high - risk verge until it go through “ sufficient safe interventions ” to lend the risk down to “ average . ”

There ’s a real fright that AI is contributing to the spread of false or shoddy information meant to sway hearts and thinker toward malicious ends . Last year , political deepfakesspread like wildfire around the globe , and AI is increasingly being used to carry outsocialengineeringattacks targeting both consumer and corporations .

In the snowy paper for GPT-4.5 and ina paper released originally this workweek , OpenAI observe that it ’s in the process of revising its method for dig into model for substantial - worldly concern view risks , like distributing misleading info at graduated table .

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI