Video wird geladen...
Video konnte nicht geladen werden
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine github: EmotiVoice is a powerful and modern open-source text-to-speech engine. EmotiVoice speaks both English and Chinese, and with over 2000 different voices. The most prominent feature is emotional synthesis, allowing you to create speech with a wide range of emotions, including... show more
312,299 Aufrufe • vor 2 Jahren •via X (Twitter)
10 Kommentare

Even demo is low sound quality

I wonder when we'll have singing voice synthesis guided by text and midi notes of a lead sound.

@camenduru, would be awesome to have a Colab available using this Engine 🥹

Would it run in M1, 8Gb Ram?

I tried running it locally and didn't get much variation between emotion prompts. Tried different (english) voices and happy/angry pretty much sounded the same most of the time. Maybe it works better with chinese?

Author here. Thanks for your interest in the project. We will post a roadmap for future updates shortly.

Does it outperform Bark?

EmotiVoice sounds amazing, especially with its prompt-controlled feature. Gonna give it a try!

@Memdotai mem it

Should try
