Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

Today we released Meta Spirit LM — our first open source multimodal language model that freely mixes text and speech. Many existing AI voice experiences today use ASR to techniques to process speech before synthesizing with an LLM to generate text — but these approaches compromise the expressive aspects...

351,674 görüntüleme • 1 yıl önce •via X (Twitter)

10 Yorum

AI at Meta profil fotoğrafı
AI at Meta1 yıl önce

More details, including links to the research paper, model weights and code 👇

$@®+#@|= profil fotoğrafı
$@®+#@|=1 yıl önce

Sounds ass ngl

floating point profil fotoğrafı
floating point1 yıl önce

Non commercial and poor quality speech? Sad 😔

Leocifer profil fotoğrafı
Leocifer1 yıl önce

europe

Tech Dev Notes profil fotoğrafı
Tech Dev Notes1 yıl önce

The demo was a bit ...

BensenHsu profil fotoğrafı
BensenHsu1 yıl önce

The study introduces S PI R IT -LM, a model that can generate both speech and text. It is based on continuously pre-training a text language model (L LAMA 2) with a combination of text-only, speech-only, and aligned speech-text datasets. S PI R IT -LM performs well on speech and text comprehension tasks, matching or exceeding the performance of previous speech-only and text-only models. It can also learn new tasks in a few-shot setting, both within and across modalities (speech-to-text and text-to-speech). The S PI R IT -LM-E XPRESSIVE version is the first language model that can preserve the sentiment of text and speech prompts both within and across modalities. full paper:

$Q*🍓on Ethereum profil fotoğrafı
$Q*🍓on Ethereum1 yıl önce

Everything happening at once

Hamza profil fotoğrafı
Hamza1 yıl önce

this preview seems to lack somewhat

Risphere profil fotoğrafı
Risphere1 yıl önce

The quality isn't that good.

Qual profil fotoğrafı
Qual1 yıl önce

I love you, but this demo was... well, let's just say it had a rough start! At first, I thought my speakers were broken because there was no sound for a few seconds. Then, when the sound finally kicked in, I was like, "Yep, my speakers are definitely broken!"

Benzer Videolar