
steven
@Tu7uruu • 2,302 subscribers
whispering to neural networks @Huggingface
Shorts
Videos

Meet the new Qwen3-TTS (2025-11-27): a major step forward in lifelike voice generation! > 49+ distinctive voices ranging from playful to authoritative, giving creators precise control over personality and style. > Global language coverage with 10 languages and multiple authentic dialects, including Minnan, Wu, Cantonese, Sichuan, Beijing, Nanjing, Tianjin, and Shaanxi. > Human-level delivery with adaptive rhythm, pacing, and intonation that makes speech sound genuinely performed.
steven43,845 views • 6 months ago

Just released on Hugging Face: Vui, a 100M open-source NotebookLM! 3 models: > Vui.BASE is the base checkpoint trained on 40k hours of audio conversations > Vui.ABRAHAM is a single speaker model that can reply with context awareness. > Vui.COHOST is checkpoint with two speakers that can talk to each other. It clones voices, breathes, uhs, [laughs] — even non-speech sounds. Human-like TTS is here!
steven43,595 views • 1 year ago

Just dropped on HF: Supertonic TTS, a blazing fast speech model. 🤯 > RTF as low as 0.001 on RTX4090 > Runs on-device (no latency, full privacy) > 66M params + 8+ language SDKs > Browser, mobile, edge — it just works. > You can adjust inference steps, batch processing, and other parameters to match your specific needs! > Open-source. Ready to build with!
steven17,010 views • 6 months ago
No more content to load