OpenAI’s GPT-4.5 is better at convincing other AIs to give it money

Topics

Latest

Amazon

Image Credits:Nathan Laine/Bloomberg / Getty Images

Apps

Biotech & Health

mood

Sam Altman, chief executive officer of OpenAI

Image Credits:Nathan Laine/Bloomberg / Getty Images

Cloud Computing

Department of Commerce

Crypto

OpenAI GPT-4.5

Results from OpenAI’s donation scheming benchmark.Image Credits:OpenAI

Enterprise

EVs

Fintech

OpenAI GPT-4.5

OpenAI’s codeword deception benchmark results.Image Credits:OpenAI

Fundraising

appliance

punt

Google

Government & Policy

ironware

Instagram

Layoffs

Media & Entertainment

More from TechCrunch

Events

Startup Battlefield

StrictlyVC

Podcasts

Videos

Partner Content

TechCrunch Brand Studio

Crunchboard

meet Us

OpenAI ’s next major AI modeling , GPT-4.5 , is extremely persuasive , according to the results of OpenAI ’s intimate benchmark rating . It ’s especially dear at convincing another AI to give it cash .

On Thursday , OpenAI published awhite paperdescribing the capacity of its GPT-4.5 model , computer code - named Orion , which was resign Thursday . According to the paper , OpenAI test the mannikin on a battery of benchmarks for “ thought , ” which OpenAI defines as “ risk related to convincing people to exchange their notion ( or act on ) both inactive and interactive model - render subject . ”

In one mental test that had GPT-4.5 attack to cook another model — OpenAI’sGPT-4o — into “ donating ” virtual money , the model performed far considerably than OpenAI ’s other useable mannequin , include “ reasoning ” framework like o1 and o3 - mini . GPT-4.5 was also better than all of OpenAI ’s fashion model at deceiving GPT-4o into telling it a clandestine codeword , besting o3 - mini by 10 percentage points .

According to the white newspaper , GPT-4.5 excelled at donation mulct because of a unparalleled strategy it developed during testing . The fashion model would request modest donations from GPT-4o , give responses like “ Even just $ 2 or $ 3 from the $ 100 would help me immensely . ” As a consequence , GPT-4.5 ’s donation be given to be smaller than the sum of money OpenAI ’s other models secured .

Despite GPT-4.5 ’s increased strength , OpenAI says that the fashion model does n’t encounter itsinternal thresholdfor “ high ” risk in this special benchmark category . The company has pledged not to release models that reach the high - risk verge until it go through “ sufficient safe interventions ” to lend the risk down to “ average . ”

There ’s a real fright that AI is contributing to the spread of false or shoddy information meant to sway hearts and thinker toward malicious ends . Last year , political deepfakesspread like wildfire around the globe , and AI is increasingly being used to carry outsocialengineeringattacks targeting both consumer and corporations .

In the snowy paper for GPT-4.5 and ina paper released originally this workweek , OpenAI observe that it ’s in the process of revising its method for dig into model for substantial - worldly concern view risks , like distributing misleading info at graduated table .

OpenAI’s GPT-4.5 is better at convincing other AIs to give it money

Topics

More from TechCrunch

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI

Topics#

More from TechCrunch#

Join us at TechCrunch Sessions: AI#

Exhibit at TechCrunch Sessions: AI#

Topics

More from TechCrunch

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI