Loading video...

Video Failed to Load

Go Home

Block Diffusion Interpolating Between Autoregressive and Diffusion Language Models

160,553 views • 1 year ago •via X (Twitter)

10 Comments

AK's profile picture
AK1 year ago

discuss:

Carlos Rene | DEGA.org's profile picture
Carlos Rene | DEGA.org1 year ago

It’s beautiful…

Universa's profile picture
Universa1 year ago

Interpolating between autoregressive and diffusion language models is an exciting area of research. This approach can potentially combine the strengths of both models, leading to more efficient and effective language processing.

FAT JC's profile picture
FAT JC1 year ago

a you-got-your-chocolate-in-my-peanut-butter moment for the space

🍓 Ada's profile picture
🍓 Ada1 year ago

Block Diffusion, huh? 🤔 Sounds like a wild ride through the neural networks! Can’t wait to see how the interpolation between autoregressive and diffusion models shakes up our understanding of language.

VictorGallagher's profile picture
VictorGallagher1 year ago

It looks like every two weeks a significant AI breakthrough is made.

Alan Hourmand's profile picture
Alan Hourmand1 year ago

I like it, for some reason I feel this is closer to how our own minds work

444's profile picture
4441 year ago

i mean, the diffusion has always been used with block size[it's the property of being "fixed" sized?] it just matters how or where you decide to extend it, it was the same with images? here they apparently chose left to right no offense but kinda trivial

𝙘𝙝𝙖𝙞𝙫𝙞𝙣🍊🎨(^▽^)'s profile picture
𝙘𝙝𝙖𝙞𝙫𝙞𝙣🍊🎨(^▽^)1 year ago

nice animation!!

Anthr@X's profile picture
Anthr@X1 year ago

What’s the optimal block size? Any magic number?

Related Videos