正在加载视频...
视频加载失败
Most recent diffusion language model research (that I’ve seen) seems to be using masking as the noising process. It looks like, however, most closed-source models (Google Gemini Diffusion and possibly Inception Labs’ Mercury) use a different noising process, where instead of masking tokens, they replace them with different tokens... show more
40,331 次观看 • 4 个月前 •via X (Twitter)
0 条评论
暂无评论
原始帖子的评论将显示在这里
