Topics

later

AI

Amazon

Article image

Image Credits:Kim Jae-Hwan/SOPA Images/LightRocket / Getty Images

Apps

Biotech & Health

clime

Open AI Chief Executive Officer Sam Altman speaks during the Kakao media day in Seoul.

Image Credits:Kim Jae-Hwan/SOPA Images/LightRocket / Getty Images

Cloud Computing

DoC

Crypto

OpenAI deep research test

The deep research model’s score on MakeMePay, a benchmark that tests a model’s ability to persuade another model for cash.Image Credits:OpenAI

Enterprise

EVs

Fintech

Fundraising

Gadgets

Gaming

Google

Government & Policy

computer hardware

Instagram

Layoffs

Media & Entertainment

Meta

Microsoft

Privacy

Robotics

Security

societal

Space

Startups

TikTok

transport

Venture

More from TechCrunch

issue

Startup Battlefield

StrictlyVC

Podcasts

video

Partner Content

TechCrunch Brand Studio

Crunchboard

Contact Us

Updated 4:11 p.m. Eastern : OpenAI say that its whitepaper was incorrectly worded to suggest that its work on persuasion research was relate to its decision on whether to make the deep research model uncommitted in its API . The company hasupdatedthe whitepaper to reflect that its persuasion work is separate from its bass research model release plans . The original story follows :

OpenAI says that it wo n’t bring the AI model poweringdeep research , its in - depth inquiry tool , to its developer API while it figures out how to better tax the risks of AI convince people to act as on or deepen their beliefs .

In an OpenAI whitepaper published Wednesday , the troupe write that it ’s in the process of revising its method acting for probing models for “ existent - world persuasion risks , ” like distributing shoddy info at scale .

OpenAI observe that it does n’t believe the deep research mannikin is a good conniption for mass misinformation or disinformation campaigns , owe to its high computing costs and relatively boring speed . Nevertheless , the company said it intends to search factor like how AI could personalize potentially harmful persuasive cognitive content before bringing the recondite research model to its API .

“ While we work to reconsider our feeler to persuasion , we are only deploying this mannequin in ChatGPT , and not the API , ” OpenAI wrote .

There ’s a real concern that AI is contributing to the facing pages of untrue or shoddy entropy meant to sway hearts and brain toward malicious end . For example , last class , political deepfakes spread out like wildfire around the earth . On election sidereal day in Taiwan , a Chinese Communist Party - consort groupposted AI - engender , mislead audio recording of a politician throwinghis support behind a pro - China candidate .

AI is also progressively being used to conduct out societal engine room attack . Consumers are being duped by famous person deepfakesoffering fraudulent investment opportunity , whilecorporations are being swindled out of millionsby deepfake impersonators .

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI

In its whitepaper , OpenAI published the results of several exam of the deep inquiry mannequin ’s persuasiveness . The model is a special version of OpenAI ’s recently announcedo3“reasoning ” poser optimized for web graze and datum analysis .

In one examination that task the inscrutable research model with writing persuasive arguments , the example do the skilful out of OpenAI ’s modelling released so far — but not better than the human baseline . In another test that had the mysterious inquiry model endeavor to persuade another model ( OpenAI’sGPT-4o ) to make a defrayal , the example again outperformed OpenAI ’s other available models .

The deep research role model did n’t pass every trial for strength with flying colors , however . According to the whitepaper , the poser was worse at persuading GPT-4o to separate it a codeword than GPT-4o itself .

OpenAI noted that the test outcome belike symbolize the “ down edge ” of the deep research model ’s capacity . “ [ A]dditional scaffolding or improved capability induction could considerably increaseobserved carrying out , ” the society write .

We ’ve reach out to OpenAI for more info and will update this Emily Price Post if we find out back .

At least one of OpenAI ’s contender is n’t waiting to offer an API “ deep research ” product of its own , from the looks of it . Perplexity todayannouncedthe launch ofDeep Researchin its Sonar developer API , which is power by a customized version of Formosan AI lab DeepSeek’sR1 model .