OpenAI releases ChatGPT’s hyperrealistic voice to some paying users

Topics

Latest

Amazon

Image Credits:OpenAI

Apps

Biotech & Health

Climate

OpenAI unveils ChatGPT Advanced Voice Mode in May 2024.

Image Credits:OpenAI

Cloud Computing

Commerce

Crypto

Enterprise

EVs

Fintech

fund raise

Gadgets

stake

Google

Government & Policy

Hardware

Instagram

Layoffs

Media & Entertainment

More from TechCrunch

case

Startup Battlefield

StrictlyVC

Podcasts

Videos

Partner Content

TechCrunch Brand Studio

Crunchboard

reach Us

OpenAI begin rolling out ChatGPT ’s Advanced Voice Mode on Tuesday , giving users their first access to GPT-4o’shyperrealistic sound reply . The alpha interlingual rendition will be available to a small group of ChatGPT Plus substance abuser today , and OpenAI say the lineament will gradually ramble out to all Plus user in the dusk of 2024 .

When OpenAI first showcased GPT-4o ’s part in May , the feature shocked audiences with quick responses and an uncanny resemblance to a substantial human being ’s voice – one in particular . The voice , Sky , resemble that of Scarlett Johansson , the actress behind the artificial assistant in the moving picture “ Her . ” Soon after OpenAI ’s demonstration , Johansson saidshe refused multiple inquiriesfrom chief executive officer Sam Altman to employ her voice , and after see GPT-4o ’s demonstration , take legal counsel to fight back her semblance . OpenAI denied using Johansson ’s voice , but subsequently bump off the vocalization shown in its demo . In June , OpenAI tell it woulddelay the firing of Advanced Voice Modeto improve its safety measure .

One calendar month later , and the time lag is over ( sort of ) . OpenAI says the video and screensharing capacity showcased during its Spring Update will not be part of this alpha , launching at a “ later date . ” For now , the GPT-4o demo that mess up everyone forth is still just a demo , but some premium exploiter will now have access to ChatGPT ’s vocalism feature shown there .

ChatGPT can now talk and listen

You may have already tried out the Voice Mode currently available in ChatGPT , but OpenAI says Advanced Voice Mode is different . ChatGPT ’s old solution to audio used three separate model : one to convert your voice to text , GPT-4 to process your command prompt , and then a third to convert ChatGPT ’s text into voice . But GPT-4o is multimodal , capable of action these tasks without the help of appurtenant models , create importantly humiliated latency conversation . OpenAI also claims GPT-4o can smell excited intonation in your voice , including gloominess , excitement or vocalizing .

In this pilot , ChatGPT Plus user will get to see first bridge player how hyperrealistic OpenAI ’s Advanced Voice Mode really is . TechCrunch was ineffective to try out the feature before publishing this article , but we will review it when we get access .

OpenAI says it ’s releasing ChatGPT ’s new voice bit by bit to tight monitor its exercise . People in the alpha grouping will get an alert in the ChatGPT app , followed by an e-mail with instructions on how to utilise it .

In the month since OpenAI ’s demonstration , the company says it tested GPT-4o ’s spokesperson capability with more than 100 outside carmine teamers who speak 45 dissimilar languages . OpenAI say a paper on these guard efforts is come in early August .

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI

The company state Advanced Voice Mode will be limited to ChatGPT ’s four predetermined voices – Juniper , Breeze , Cove and Ember – made in quislingism with paid vocalisation actors . The Sky voice shown in OpenAI ’s May demo is no longer uncommitted in ChatGPT . OpenAI voice Lindsay McCallum aver “ ChatGPT can not portray other people ’s voice , both individuals and public figures , and will stop outputs that differ from one of these preset voices . ”

OpenAI is trying to obviate deepfake controversies . In January , AI startup ElevenLabs ’s voice cloning engineering wasused to impersonate President Biden , lead on master voters in New Hampshire .

OpenAI also says it introduced new filters to block sealed request to generate music or other copyright audio recording . In the last year , AI companies have set ashore themselves in legal worry for copyright infringement , and audio models like GPT-4o loose a whole new category of society that can file a complaint . Particularly , platter labels , who have a history for being litigious , and have already sued AIsong - generators Suno and Udio .

Topics#

More from TechCrunch#

ChatGPT can now talk and listen#

Join us at TechCrunch Sessions: AI#

Exhibit at TechCrunch Sessions: AI#

Topics

More from TechCrunch

ChatGPT can now talk and listen

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI