
Grace Luo
@graceluo_ • 2,223 subscribers
phd student @berkeley_ai, vision + language
Videos

✨New preprint: Dual-Process Image Generation! We distill *feedback from a VLM* into *feed-forward image generation*, at inference time. The result is flexible control: parameterize tasks as multimodal inputs, visually inspect the images with the VLM, and update the generator.🧵
Grace Luo133,297 Aufrufe • vor 1 Jahr
Keine weiteren Inhalte verfügbar