Realistic Virtual Voices Through AI

Overview

Resemble AI is a rising company creating compelling, realistic, and unique voice narration using AI techniques. The distributed team of 11, based out of Toronto, is motivated by their founding ideal — creating high-quality voice characters should be an automated, streamlined process. With today’s technological capabilities, it shouldn’t be so time-consuming and inconvenient to create an alternative voice to Amazon’s Alexa.

Background

Traditionally, creating a new AI voice is a manual process that requires recording a person’s speech in a studio setting and repeating this process any time major changes are needed. This is a cumbersome, expensive process that makes it difficult to iterate quickly. Furthermore, differentiating automated voice styles is becoming an increasingly important part of a brand’s identity and marketing: while most brands have always had unique visual symbols, nowadays, they often also want audio identities, much like that of larger brands.

As an initial proof of concept, Resemble AI allows cloning a voice from just 5 minutes of recorded audio data. Adding additional training data allows the voice model to become successively better. 

resemble solution box
resemble ai quote
laptop audio waves

Results

With Spell, Resemble AI was able to cut their costs significantly. Compared with Amazon SageMaker or Google Cloud Platform ML, both of which demanded high costs for the short, yet frequent training runs they depend on, integrating Spell into the team’s workflows was a much more cost-effective solution that enabled the high-powered capabilities required to perform their frequent runs at a low cost.

They noted that with Google or Amazon, users end up structuring code to adapt to how the cloud platform wants it set up, which creates latency, makes it very painful to migrate, and requires much overhead to tackle.

In contrast, Spell alleviates much of the complexity of Cloud development, taking care of many of these moving pieces so that engineers don’t have to themselves. They also save a lot of time through having a much more intuitive UI on Spell, compared to that of other Cloud platforms.

With their streamlined workflow, Resemble AI hopes to work on scaling concurrency and reducing latency so they can continue to get results to their customers as quickly as possible. Their business is growing quickly, and Spell’s MLOps infrastructure has proven to be a pivotal factor to the success of their production jobs.

Streamline Machine Learning Projects with Spell

Schedule an in-depth demonstration with a Spell representative to learn how Spell can help streamline and accelerate your machine learning development.