Video wird geladen...
Video konnte nicht geladen werden
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale blog: Large-scale generative models such as GPT and DALL-E have revolutionized natural language processing and computer vision research. These models not only generate high fidelity text or image outputs, but are also generalists which can solve tasks not explicitly taught. In... show more
429,143 Aufrufe • vor 3 Jahren •via X (Twitter)
10 Kommentare

atharvavor 3 Jahren
meta ai shipping like crazy

Clinton Williamsvor 3 Jahren
@MetaAI cooking over there! Smart not to open source this one even though I want it for videos and my newsletter.

たこゆず🦑vor 3 Jahren
名前が似ている。

Pranavvor 3 Jahren
Meta got no chill

pixlflipvor 3 Jahren
This looks rather promising. Almost makes me like Facebook

SrLOLvor 3 Jahren
No compilable para la comunidad no like

🕊vor 3 Jahren
Is it in the GitHub?

Saquib Mehmoodvor 3 Jahren
GPT Summarize: "Voicebox is a versatile text-guided generative model for speech, trained on 50K hours of unfiltered speech. It can perform tasks like text-to-speech synthesis, noise removal, content editing, and style conversion. Voicebox outperforms VALL-E by up to 20 times."

Yudha Rebel Heartvor 3 Jahren
Would be very useful for dubbing

Salim Faraji Nyendwavor 3 Jahren
@SaveToNotion #tweet #NewStack

