Video yükleniyor...
Video Yüklenemedi
Introducing SeamlessM4T, the first all-in-one, multilingual multimodal translation model. This single model can perform tasks across speech-to-text, speech-to-speech, text-to-text translation & speech recognition for up to 100 languages depending on the task. Details ⬇️
592,704 görüntüleme • 2 yıl önce •via X (Twitter)
9 Yorum

Compared to cascaded approaches, SeamlessM4T's single system approach reduces errors & delays, increasing translation efficiency & quality, delivering state-of-the-art results. Want to see it for yourself, try the demo ➡️

We believe SeamlessM4T represents a significant breakthrough and as part of our open approach, today we're publicly releasing this work under a CC BY-NC 4.0 license so that others can continue to build on this important field of study. Get the code ⬇️

thank you!

Oh waifu, next two days timeline gonna be filled with quote tweets by ML Bros now. How will my tweets be able to find space on people's TL? I am sorry I don't know enough ML to quote tweet this and earn Elon buxx. now we are homeress.

SeamlessM4T = incredible 👏 As a reminder, this also sits on top of one of the most comprehensive, cutting-edge translation/language tech as well:

I tried it with Japanese. The results are...extremely bad. It translates everything literally.

Meta might as well turn into a research company honestly Much better. Great engineers and researchers. Open source contribution Everything else including zuck — out

Demo Lab is 🔥

Ffmpeg but for natural language.

