Loading video...

Video Failed to Load

Go Home

Today we released Meta Spirit LM — our first open source multimodal language model that freely mixes text and speech. Many existing AI voice experiences today use ASR to techniques to process speech before synthesizing with an LLM to generate text — but these approaches compromise the expressive aspects...

351,674 views • 1 year ago •via X (Twitter)

10 Comments

AI at Meta's profile picture
AI at Meta1 year ago

More details, including links to the research paper, model weights and code 👇

$@®+#@|='s profile picture
$@®+#@|=1 year ago

Sounds ass ngl

floating point's profile picture
floating point1 year ago

Non commercial and poor quality speech? Sad 😔

Leocifer's profile picture
Leocifer1 year ago

europe

Tech Dev Notes's profile picture
Tech Dev Notes1 year ago

The demo was a bit ...

BensenHsu's profile picture
BensenHsu1 year ago

The study introduces S PI R IT -LM, a model that can generate both speech and text. It is based on continuously pre-training a text language model (L LAMA 2) with a combination of text-only, speech-only, and aligned speech-text datasets. S PI R IT -LM performs well on speech and text comprehension tasks, matching or exceeding the performance of previous speech-only and text-only models. It can also learn new tasks in a few-shot setting, both within and across modalities (speech-to-text and text-to-speech). The S PI R IT -LM-E XPRESSIVE version is the first language model that can preserve the sentiment of text and speech prompts both within and across modalities. full paper:

$Q*🍓on Ethereum's profile picture
$Q*🍓on Ethereum1 year ago

Everything happening at once

Hamza's profile picture
Hamza1 year ago

this preview seems to lack somewhat

Risphere's profile picture
Risphere1 year ago

The quality isn't that good.

Qual's profile picture
Qual1 year ago

I love you, but this demo was... well, let's just say it had a rough start! At first, I thought my speakers were broken because there was no sound for a few seconds. Then, when the sound finally kicked in, I was like, "Yep, my speakers are definitely broken!"

Related Videos