Now there are lot of AI tools for Audio already, but when it comes to Voice Synthesis and Audio narration, probably one of the best tools right now is Elevenlabs, which just went through couple of updates that I want to talk about in this review today.
So what exactly Elevenlabs can do right now? Well there are couple of key features here what they offer right now both free/paid version.
- Text To Speech – Over 20 different voices with 29 different languages available from English to Chinese to Japanese, there is great selection of things here for lot of people around the world
- Speech to Speech – You can upload your own voice file and it will be narrated by another person this will also clean up the existing audio file so if you have background noise and things like that this will alter it to a more higher quality
- Sound Effects – Give a prompt for any type of object or event and it will try to create sounds close to that anywhere from window breaking, explosions, humming or nature sounds, while this tool is bit more hit and miss sometimes it does come now and then give great results
- Dubbing Studio – Have a video or audio clip that is in certain language like English and want it to translate it to Spanish? Well its possible with this tool, it keeps your sound very closely similar when dubbed, even though there might be some errors on the actual translation depending on the input
- Voice Isolator (NEW) – If you recorded on a bad location or just have bad background noise, this tool just cleans everything up and makes the audio actually great
There are also some features which are behind a paywall like Voiceover Studio and Audio Native, which have their own uses. Luckily most of the essential stuff is free to use with registration credits, so unless your heavy user you can play around plenty for free.
New Video to Sound Feature
The new feature that Elevenlabs recently released was “Video to Sound” which is fully open source that you can check in their GitHub. This perhaps can be considered bit more niche type of a tool, but it has lot of functionality as we are seeing influx of new Video AI tools that obviously do not come with any sound, so instead of generating lot of background music and try to find stock audio for your clips.
Based on the testing i did with this new tool i have to say it gives mixed results out of the 4 prompts it gives out i usually find at least one decent, but the quality of the audio itself may need some post-tuning.
Creating like Ocean waves or even sounds for an abstract intro can yield decent results sometimes, so im quite impressed by that – plus there isn’t that many limitations right now on the length of the clips so you should abuse this situation much as possible, because this is the only tool with this feature that i know.
Let’s talk about Pricing
I would argue that Elevenlabs is quite generous when it comes to their freemium model – first of all you get lot of tools that are included on free version and 10,000 credits which includes around 10 minutes of audio, now if you are like dubbing movies or TV-shows for instance that’s not obviously enough, but lets say a YouTube video that can at least get you 1-2 videos done.
- The next tier starts at 5$/month which is still relatively good price contrasted to services like AI Image generation or AI Video generation which usually start around that 8-12$ ballpark for their cheapest models, but then again do we value these things at same price? Well that’s up to you.
- Creator tier at 11$/month and that comes along with 2h of audio per month which is 4x more than the previous tier, also this gives you more higher quality audio via their API
- Pro Tier is bit expensive I have to say at 99$/month giving you 10h which btw is less than previous tier if you would buy it separately, but this then again offers lot more higher quality of audio then again
- Lastly we have the Scale, which gives you 400h of audio for 300$ and that’s actually lot of time in my opinion for any large scale project out there even for industrial clients.
When it comes to pricing I think the lower tier plans are quite generous and good, but perhaps i find the last two tiers slightly overpriced given out the ratio per dollar to audio minutes is bit off for my taste, but then again there is serious money to be made with technology like this.
Closing Thoughts about Elevenlabs
So there are lot of one trick pony audio services that don’t have much to offer, but Elevenlabs is bit more serious entry in the context that they have vast ratio of different Audio models, prompt types and obviously they keep building up more things on a frequent basis.
If you are looking for more Music generation I probably would recommend you Suno or something else, but in terms of pure Dubbing and Text to speech this tool is probably best options you have.