Loading video...
Video Failed to Load
Most recent diffusion language model research (that I’ve seen) seems to be using masking as the noising process. It looks like, however, most closed-source models (Google Gemini Diffusion and possibly Inception Labs’ Mercury) use a different noising process, where instead of masking tokens, they replace them with different tokens... show more
40,331 views • 5 months ago •via X (Twitter)
0 Comments
No comments available
Comments from the original post will appear here
