Generative AI video startup Tavus raises $18 million to bring face and voice cloning into every app

March 12, 2024

80

Youra 4 12 months old The generative AI startup, which helps corporations create digital “replicas” of individuals for automated personalized video campaigns, has confirmed $18 million in latest funding and announced that it’s opening its platform to 3rd parties to launch its Integrate technology into their very own software.

Reports appeared back in August that Tavus had raised “roughly $18 million,” but details were scant. The company has now confirmed to TechCrunch that it has actually raised $18 million in a Series A round that it led Scale enterprise partner – an early-stage VC who has previously backed corporations like Box, HubSpot and DocuSign. Other notable investors include Sequoia, which led Tavus' $6.1 million seed round last 12 months and took part alongside Y Combinator (YC) and HubSpot.

The video is the main target

The generative AI movement is best exemplified by text-based search engines like google and yahoo like ChatGPT and text-to-image models like DALL-E, which OpenAI is within the strategy of merging right into a single all-singing platform. But if the previous few months are anything to go by, generative AI might be on the cusp of one other mini-revolution, with video at its core.

OpenAI recently introduced Sora, a text-to-video model that might transform the creative industry as we realize it. But it's removed from the one player on the town, with tech giants like Google having been working on similar tools for several years, not to say a lot of startups which have seen significant chunks of VC change over the past 12 months for various insights into them have gathered how generative AI works could overlap with video.

Tavus, in turn, works with its customers to create replicas of individuals through voice and facial cloning. The idea is that sales and marketing teams can use Tavus to send personalized videos to prospects at scale, or a product team can create custom walkthrough videos for onboarding latest customers – all via easy text-based prompts that use previously created digital replica. And by integrating Tavus with third-party systems like Salesforce or Mailchimp, corporations can automate much of this – for instance, a customer who fills out a web-based form requesting more details about a product may be immediately emailed a video that a sales representative contacts Name the prospect by name and explain the following steps.

Tavus has managed to land some pretty big-name customers in its short existence thus far, including Salesforce and Facebook's parent company Meta, co-founder and CEO Hassan Raza They use the platform to upsell to their respective B2B customers via personalized demo videos.

You as a platform

Previously, Tavus was served through a SaaS app through which customers create their very own AI video templates. The onboarding process requires that one person, comparable to the CEO or the sales manager, to record a 15-minute video based on a script provided by Tavus.

Tavus' clones in motion Photo credits: Your

This is then used to coach the AI, after which the user goes to an online editor and selects which parts of the video they need to personalize by defining the variables – comparable to location, executive name, company or product. By integrating Tavus into their CRM system, corporations can tailor each of those variables to a particular customer segment, comparable to those that have expressed interest in a specific product.

Editing variables Photo credits: Your

Companies can create a whole lot of those replicas with different staff and backgrounds for various goal markets.

Using the in-app editor, it is feasible to generate any number of various scripts that may be attached to any use case – without the necessity to re-record the unique video.

The different avatars of Tavus Photo credits: Your

While this core SaaS product isn't going away, Tavus today is introducing a brand new, turbocharged version of its technology, together with the primary release of a set of developer APIs that enable third parties to integrate Tavus into their very own applications.

Replicate

The first facet of Tavus' latest developer platform is the “Replica API,” which is about creating “photorealistic” digital replicas with text-to-video generation. This allows an organization to model an individual (e.g. marketing director or CEO) using a brand new proprietary model called “Phoenix” from Tavus, which relies on a deep learning method called Neural Radiance Field (NeRF). This allows a 3D construct of an individual to be created from 2D images in only a number of minutes

“It essentially means that you can create entire videos with just two minutes of coaching data, which is a giant step forward from how we've previously done personalization at scale,” Raza told TechCrunch. “And now all you’ve got to do is record two minutes of coaching data and an entire image of you shall be created. And once you’ve got Replica, you’ll be able to create as many videos as you would like from one, two or a thousand scripts.”

Tavus: Simulation showing how the Phoenix NeRF model maps a user's face to create a realistic replica

Simulation showing how Tavus maps a user's face to create a sensible replica Photo credits: Your

Output: Tavus' Phoenix model creates a 3D model using 2D video input via neural radiation fields (NeRF). Photo credits: Your

The first Replica API relies on your entire functionality of the Phoenix model and captures an individual's facial movements, including cheeks, nose, eyebrows and lips.

“The movement of your entire face promotes realism, naturalness and quality – whenever you speak, your face expresses emotions that transcend the movement of your lips,” Raza explained. “If you would like to generate a whole video from a script – of you speaking, that appears natural and is of incredibly prime quality – you need to use the Replika API.”

However, Tavus can be developing a lot of additional APIs, including one specifically for lip syncing; one for dubbing; and one for running personalized mass video campaigns.

The lip sync API may have a “lower entry cost,” based on Raza, and is best suited to situations where “a high level of quality and realism isn’t required.”

The synchronization API also uses the lip sync model, but additionally includes multilingual voice cloning, meaning a monolingual user can send video campaigns in any variety of languages using their very own voice. In this case, since many of the video stays the identical, the API allows for simple alternative of lip movements to match the several sounds coming out of the user's mouth. This could prove useful for developers of a video editing software suite, for instance, in the event that they want to present their users the flexibility so as to add lip syncing, editing, and synchronization to their videos.

And then the Video Campaigns API principally bundles the Replika API with a set of additional tools – like hosting, variable mapping, thumbnails, and analytics – for those trying to launch large-scale video campaigns.

“We offer every developer the flexibility to right away deliver an end-to-end video campaign experience in their very own solutions,” said Raza. “While the Replica and Lip Sync APIs are more model-as-a-service, the Campaign API gives you tools to simply construct an AI video campaign platform.”

Raza remained tight-lipped about a number of the Tavus platform's early adopters, but said it’s “working with certainly one of the most important video platforms” to drive customer loyalty. “They need to bring this to their thousands and thousands of consumers who already use their platform to create videos day-after-day,” Raza said.

Deepfake dilemma

Instinctively, platforms like Tavus are vulnerable to abuse – in any case, what's stopping someone from uploading an existing video to create a digital copy? Deepfakes are literally a growing problem within the burgeoning AI movement, but Raza says there are controls in place to stop chicanery. For example, when a user submits their two-minute training material, they need to also provide a particular verbal consent form, which is then matched against the audio within the training material to make sure there may be a match.

“We perform these checks mechanically after which perform a human check on each replica that goes through the automated checks to make sure security,” Raza said.

It's easy to assume how this might work with Tavus as a standalone SaaS app, but now that it's a platform accessed by any variety of corporations via an API, who has control over verification? Well, because it seems, that's the case with Tavus – the corporate wants to maintain its hands on the verification wheel, even when it's just providing the engine to third-party developers.

“We perform the identical checks and likewise take responsibility for checks with (the) API,” Raza continued.

Expand reality

While OpenAI has almost develop into the general public face of generative AI, there may be good enough room for various players to bring something different to the combination. While DALL-E and OpenAI's recently released Sora model is usually about helping people create visual representations from text prompts, Tavus is more about “expanding” an individual's own reality, based on Raza.

“We see a future where everyone desires to have a digital replica of themselves, they control it and have full authority over it,” Raza said. “And it’ll be necessary that it ultimately captures increasingly of your personality, increasingly of your gestures and characteristics. This is how we see the longer term: There shall be models that create things that don’t exist, after which there shall be models that expand your reality.”

With $18 million within the bank, Raza said the newest money injection can be used to “fuel the hearth that’s already burning within the Tavus Towers.”

“We are an AI research company, so we wish to give you the chance to proceed developing newer models like Phoenix,” Raza said. “But we also just have to keep up our growth, now we have constantly had enormous demand. And we wish to give you the chance to repeatedly grow our machine learning and engineering teams to support our developers and SaaS customers.”

Generative AI video startup Tavus raises $18 million to bring face and voice cloning into every app

The video is the main target

You as a platform

Replicate

Deepfake dilemma

Expand reality

LEAVE A REPLY Cancel reply

Must Read

A brand new Chinese video generation model appears to censor politically sensitive topics

OpenAI pronounces “SearchGPT” to remain at the highest

How Salesforce's STEM 1T dataset could revolutionize the AI industry

Forget coding bootcamps: Airtable's AI can construct your app in seconds

Level AI applies algorithms to the weak points within the contact center

ChatGPT: Everything you have to know concerning the AI-powered chatbot

Breakthroughs in artificial intelligence create a brand new ‘brain’ for advanced robots

Latest articles

A brand new Chinese video generation model appears to censor politically sensitive topics

OpenAI pronounces “SearchGPT” to remain at the highest

How Salesforce's STEM 1T dataset could revolutionize the AI industry

Our Newsletter

Generative AI video startup Tavus raises $18 million to bring face and voice cloning into every app

The video is the main target

You as a platform

Replicate

Deepfake dilemma

Expand reality

RELATED ARTICLES

LEAVE A REPLY Cancel reply

Must Read

Latest articles

Our Newsletter