Microsoft launches a deepfakes creator at Ignite 2023 event

Topics

Latest

Amazon

Apps

Biotech & Health

Climate

Cloud Computing

Commerce

Crypto

endeavour

EVs

Fintech

Fundraising

Gadgets

game

Google

Government & Policy

Hardware

Instagram

layoff

Media & Entertainment

More from TechCrunch

consequence

Startup Battlefield

StrictlyVC

Podcasts

Partner Content

TechCrunch Brand Studio

Crunchboard

One of the more unexpected intersection to plunge out of theMicrosoft Ignite 2023 eventis a dick that can make a photorealistic avatar of a mortal and animise that avatar saying thing that the somebody did n’t needs say .

Called Azure AI Speech textbook - to - words avatar , the Modern feature , uncommitted in public preview as of today , let users generate video of an avatar speak by upload images of a person they wish the avatar to resemble and writing a script . Microsoft ’s tool trains a model to get the animation , while a separate text - to - speech mannikin — either prebuilt or civilise on the person ’s voice — “ reads ” the script aloud .

“ With text to speech avatar , user can more efficiently create picture … to ramp up training picture , product origination , customer testimonials [ and so on ] plainly with textbook input,”writes Microsoft in a blog post . “ you may employ the avatar to construct conversational agents , virtual assistants , chatbots and more . ”

avatar can speak in multiple languages . And , for chatbot scenarios , they can tap AI models like OpenAI’sGPT-3.5to respond to off - script dubiousness from customer .

But the feature raises a host of uncomfortable honorable question .

One of the major sticking point in the recent SAG - AFTRA strike was the use of AI to create digital likeness . studio apartment in the end fit in to give actors for their AI - generated likeness . But what about Microsoft and its customers ?

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI

I emailed Microsoft about its position on companies using actors ’ likenesses without , in the actors ’ horizon , proper recompense or even telling . The ship’s company did n’t respond as of publishing fourth dimension — nor did it say whether it would require that society label embodiment as AI - generated , likeYouTubeand agrowing numberof other political program .

In a comply - up electronic mail , a spokesperson clarified that Microsoft require tradition incarnation client to obtain “ explicit write license ” and consent statements from avatar talent and “ ensure that the client ’s agreement with each someone study the continuance , use and any subject limitation . ” The company also mandate that customers append revelation posit that the avatar have been created with AI and are AI - generated .

Personal voice

Microsoft appear to have more guardrails around a related generative AI tool , personal vocalisation , that ’s also launching at Ignite .

Personal voice , a new capability within Microsoft ’s custom neuronic voice serve , can repeat a user ’s vox in a few seconds provide a one - arcminute speech communication sample as an audio prompt . Microsoft pitches it as a way of life to create individualised voice helper , dub content into unlike language and generate bespoke narrations for narration , audio books and podcasts .

To guard off possible legal headaches , Microsoft ’s banning the use of prerecorded address , requiring that drug user give “ explicit consent ” in the form of a recorded argument and verifying whether this statement fit other , one - time - exercise breeding data before a client can use personal voice to synthesize new lecture . Access to the feature film is gated behind a enrolment mannequin for the clock time being , and customers must agree to apply personal voice only in applications “ where the voice does not read user - generated or open - ended mental object . ”

“ Voice exemplar usage must stay within an practical software and output must not be publishable or shareable from the software program , ” Microsoft writes in a web log mail . “ [ C]ustomers who encounter modified access eligibility criteria sustain sole control over the creation of , access to and habit of the voice models and their output [ where it concerns ] dub for film , television set , video and audio recording for amusement scenario only . ”

Microsoft did n’t initially answer TechCrunch ’s questions about how actors might be compensated for their vocalization donation — or whether it plans to implement any sort of watermarking tech so that AI - bring forth interpreter might be more easily identify .

Later in the day , a spokesperson said via e-mail that watermark will be mechanically total to personal voices , making it soft to identify whether the oral communication is synthesise — and which representative it was synthesized from . But there ’s a catch . Building watermark detection into an app or platform requires gaining approval from Microsoft to practice its watermark detection service — which plainly is n’t ideal .

This story was to begin with put out at 8 am PT on Nov. 15 and updated at 3:30pm atomic number 78 .

Topics#

More from TechCrunch#

Join us at TechCrunch Sessions: AI#

Exhibit at TechCrunch Sessions: AI#

Personal voice#

Topics

More from TechCrunch

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI

Personal voice