Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

Smol TTS keeps getting better! Introducing OuteTTS v0.2 - 500M parameters, multilingual with voice cloning! 🔥 > Multilingual - English, Chinese, Korean & Japanese > Cross platform inference w/ llama.cpp > Zero-shot voice cloning > Trained on 5 Billion audio tokens > Qwen 2.5 0.5B LLM backbone > Trained...

44,654 Aufrufe • vor 1 Jahr •via X (Twitter)

11 Kommentare

Profilbild von Vaibhav (VB) Srivastav
Vaibhav (VB) Srivastavvor 1 Jahr

Check out the model weights and inference code base here:

Profilbild von Vaibhav (VB) Srivastav
Vaibhav (VB) Srivastavvor 1 Jahr

llama.cpp compatible GGUFs:

Profilbild von Vaibhav (VB) Srivastav
Vaibhav (VB) Srivastavvor 1 Jahr

OuteTTS GitHub:

Profilbild von Haorui He
Haorui Hevor 1 Jahr

Big Congrats!!! Another SOTA TTS model trained on Emilia after F5-TTS & MaskGCT! Try out:

Profilbild von Tommy Falkowski
Tommy Falkowskivor 1 Jahr

Just tested it out and the quality is very good. More importantly, the fact that you can generate speaker profiles is awesome! Will test it out some more and add it to my growing list of supported tts engines in my app 🤣

Profilbild von SkyTab
SkyTabvor 1 Jahr

Switch to SkyTab and get $5,000! A modern and sleek POS system with commercial-grade durability. 💪 ✅ $0 upfront costs ✅ Best in-class POS ✅ Local service & 24/7 support ✅ And much more! Make the switch today:

Profilbild von Umesh
Umeshvor 1 Jahr

This is improving so fast that I don't want to speak myself anymore. Just use this and get done 🤖

Profilbild von Fronesis
Fronesisvor 1 Jahr

Thank you for your work and for sharing insights! 🙌 Advancements like OuteTTS v0.2 showcase the rapid evolution of AI and its potential to empower global communities. 🚀 The future of #AI is bright, and collaborative innovation is key to unlocking its full potential!

Profilbild von Digital Doctor
Digital Doctorvor 1 Jahr

Are you saying you can voice CLONE on a R-Pi? Is that what you're saying????

Profilbild von 斎藤ただし, Tadashi Saito
斎藤ただし, Tadashi Saitovor 1 Jahr

The font of Japanese characters is wrong, it's for (maybe) Chinese. I hope you'll pay attention and respect to each of them when you are working for multilingual/multicultural things. (like your TTS engine itself does. Brilliant quality✨️)

Profilbild von Ahmed Mansour
Ahmed Mansourvor 1 Jahr

I tried to run it on HF. average inference time for 200 chars is >1 hour running on CPU. Why is this model so heavy?

Ähnliche Videos