
Haotian Ye
@haotian_yeee • 1,064 subscribers
CS PhD student at @Stanford | BS at Peking University @PKU1898. Working on Generative AI | Prev @nvidia @GoogleResearch
Shorts
Videos

🤔Want a principled way to RL your diffusion model? Check Data-regularized Reinforcement Learning (DDRL)! Post-train NVIDIA #Cosmos World Foundation models with a million GPU hours! 🤯 Novel formulation ➡️ Theoretically integrates SFT into RL ➡️ Robust to Reward Hacking 🛑 Details: #DDRL #Diffusion #RL #NVIDIA #Cosmos
Haotian Ye77,504 görüntüleme • 6 ay önce
Daha fazla içerik yok.