
Xuanchi Ren
@xuanchi13 • 1,409 subscribers
Senior Research Scientist @NVIDIAAI. PhD @UofTCompSci. Working on GenAI and world models
Shorts
Videos

The latent-vs-pixel debate misses the point. GPT Image 2 shows what users notice: pixel-level fidelity. Latent models show what scales: compact semantic structure. We connect them by replacing VAE/RAE decoders with a Pixel Diffusion Decoder. Code and Model available: 🧵(1/N)
Xuanchi Ren667,957 görüntüleme • 10 gün önce

🚀Excited to introduce GEN3C #CVPR2025, a generative video model with an explicit 3D cache for precise camera control. 🎥It applies to multiple use cases, including single-view and sparse-view NVS🖼️ and challenging settings like monocular dynamic NVS and driving simulation🚗. Project page:
Xuanchi Ren59,920 görüntüleme • 1 yıl önce
Daha fazla içerik yok.