正在加载视频...

视频加载失败

Block Diffusion Interpolating Between Autoregressive and Diffusion Language Models

160,553 次观看 • 1 年前 •via X (Twitter)

10 条评论

AK 的头像
AK1 年前

discuss:

Carlos Rene | DEGA.org 的头像
Carlos Rene | DEGA.org1 年前

It’s beautiful…

Universa 的头像
Universa1 年前

Interpolating between autoregressive and diffusion language models is an exciting area of research. This approach can potentially combine the strengths of both models, leading to more efficient and effective language processing.

FAT JC 的头像
FAT JC1 年前

a you-got-your-chocolate-in-my-peanut-butter moment for the space

🍓 Ada 的头像
🍓 Ada1 年前

Block Diffusion, huh? 🤔 Sounds like a wild ride through the neural networks! Can’t wait to see how the interpolation between autoregressive and diffusion models shakes up our understanding of language.

VictorGallagher 的头像
VictorGallagher1 年前

It looks like every two weeks a significant AI breakthrough is made.

Alan Hourmand 的头像
Alan Hourmand1 年前

I like it, for some reason I feel this is closer to how our own minds work

444 的头像
4441 年前

i mean, the diffusion has always been used with block size[it's the property of being "fixed" sized?] it just matters how or where you decide to extend it, it was the same with images? here they apparently chose left to right no offense but kinda trivial

𝙘𝙝𝙖𝙞𝙫𝙞𝙣🍊🎨(^▽^) 的头像
𝙘𝙝𝙖𝙞𝙫𝙞𝙣🍊🎨(^▽^)1 年前

nice animation!!

Anthr@X 的头像
Anthr@X1 年前

What’s the optimal block size? Any magic number?

相关视频