Loading video...
Video Failed to Load
We developed a simple, sample-efficient online RL technique for post-training image generation models. We see it as a possible steerable alternative to CFG, driven by any scalar reward, including human preference.
63,255 views • 1 month ago •via X (Twitter)
0 Comments
No comments available
Comments from the original post will appear here
