Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale blog: Large-scale generative models such as GPT and DALL-E have revolutionized natural language processing and computer vision research. These models not only generate high fidelity text or image outputs, but are also generalists which can solve tasks not explicitly taught. In...

429,143 Aufrufe • vor 3 Jahren •via X (Twitter)

10 Kommentare

Profilbild von atharva
atharvavor 3 Jahren

meta ai shipping like crazy

Profilbild von Clinton Williams
Clinton Williamsvor 3 Jahren

@MetaAI cooking over there! Smart not to open source this one even though I want it for videos and my newsletter.

Profilbild von たこゆず🦑
たこゆず🦑vor 3 Jahren

名前が似ている。

Profilbild von Pranav
Pranavvor 3 Jahren

Meta got no chill

Profilbild von pixlflip
pixlflipvor 3 Jahren

This looks rather promising. Almost makes me like Facebook

Profilbild von SrLOL
SrLOLvor 3 Jahren

No compilable para la comunidad no like

Profilbild von 🕊
🕊vor 3 Jahren

Is it in the GitHub?

Profilbild von Saquib Mehmood
Saquib Mehmoodvor 3 Jahren

GPT Summarize: "Voicebox is a versatile text-guided generative model for speech, trained on 50K hours of unfiltered speech. It can perform tasks like text-to-speech synthesis, noise removal, content editing, and style conversion. Voicebox outperforms VALL-E by up to 20 times."

Profilbild von Yudha Rebel Heart
Yudha Rebel Heartvor 3 Jahren

Would be very useful for dubbing

Profilbild von Salim Faraji Nyendwa
Salim Faraji Nyendwavor 3 Jahren

@SaveToNotion #tweet #NewStack

Ähnliche Videos