Video yükleniyor...
Video Yüklenemedi
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale blog: Large-scale generative models such as GPT and DALL-E have revolutionized natural language processing and computer vision research. These models not only generate high fidelity text or image outputs, but are also generalists which can solve tasks not explicitly taught. In... show more
429,294 görüntüleme • 3 yıl önce •via X (Twitter)
10 Yorum

atharva3 yıl önce
meta ai shipping like crazy

Clinton Williams3 yıl önce
@MetaAI cooking over there! Smart not to open source this one even though I want it for videos and my newsletter.

たこゆず🦑3 yıl önce
名前が似ている。

Pranav3 yıl önce
Meta got no chill

pixlflip3 yıl önce
This looks rather promising. Almost makes me like Facebook

SrLOL3 yıl önce
No compilable para la comunidad no like

🕊3 yıl önce
Is it in the GitHub?

Saquib Mehmood3 yıl önce
GPT Summarize: "Voicebox is a versatile text-guided generative model for speech, trained on 50K hours of unfiltered speech. It can perform tasks like text-to-speech synthesis, noise removal, content editing, and style conversion. Voicebox outperforms VALL-E by up to 20 times."

Yudha Rebel Heart3 yıl önce
Would be very useful for dubbing

Salim Faraji Nyendwa3 yıl önce
@SaveToNotion #tweet #NewStack


