
Grace Luo
@graceluo_ • 2,223 subscribers
phd student @berkeley_ai, vision + language
Videos

✨New preprint: Dual-Process Image Generation! We distill *feedback from a VLM* into *feed-forward image generation*, at inference time. The result is flexible control: parameterize tasks as multimodal inputs, visually inspect the images with the VLM, and update the generator.🧵
Grace Luo133,297 görüntüleme • 1 yıl önce
Daha fazla içerik yok.