Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

Block Diffusion Interpolating Between Autoregressive and Diffusion Language Models

160,553 görüntüleme • 1 yıl önce •via X (Twitter)

10 Yorum

AK profil fotoğrafı
AK1 yıl önce

discuss:

Carlos Rene | DEGA.org profil fotoğrafı
Carlos Rene | DEGA.org1 yıl önce

It’s beautiful…

Universa profil fotoğrafı
Universa1 yıl önce

Interpolating between autoregressive and diffusion language models is an exciting area of research. This approach can potentially combine the strengths of both models, leading to more efficient and effective language processing.

FAT JC profil fotoğrafı
FAT JC1 yıl önce

a you-got-your-chocolate-in-my-peanut-butter moment for the space

🍓 Ada profil fotoğrafı
🍓 Ada1 yıl önce

Block Diffusion, huh? 🤔 Sounds like a wild ride through the neural networks! Can’t wait to see how the interpolation between autoregressive and diffusion models shakes up our understanding of language.

VictorGallagher profil fotoğrafı
VictorGallagher1 yıl önce

It looks like every two weeks a significant AI breakthrough is made.

Alan Hourmand profil fotoğrafı
Alan Hourmand1 yıl önce

I like it, for some reason I feel this is closer to how our own minds work

444 profil fotoğrafı
4441 yıl önce

i mean, the diffusion has always been used with block size[it's the property of being "fixed" sized?] it just matters how or where you decide to extend it, it was the same with images? here they apparently chose left to right no offense but kinda trivial

𝙘𝙝𝙖𝙞𝙫𝙞𝙣🍊🎨(^▽^) profil fotoğrafı
𝙘𝙝𝙖𝙞𝙫𝙞𝙣🍊🎨(^▽^)1 yıl önce

nice animation!!

Anthr@X profil fotoğrafı
Anthr@X1 yıl önce

What’s the optimal block size? Any magic number?

Benzer Videolar