
Jun-Yan Zhu
@junyanz89 • 13,346 subscribers
Assistant Professor at Generative Intelligence Lab @CMU_Robotics @CarnegieMellon. Understanding and creating pixels.
Videos

We've released the code for LegoGPT. This autoregressive model generates physically stable and buildable designs from text prompts, by integrating physics laws and assembly constraints into LLM training and inference. This work is led by PhD students Ava Pun, Kangle Deng, Ruixuan Liu, and in collaboration with CMU faculty Changliu Liu and Deva Ramanan. LegoGPT is a small first step towards the ultimate goal of generative manufacturing of physical objects. Our implementation is limited to 20x20x20 dimensions, 21 object categories, and simple brick types, but we are working on scaling it up! Code: Website: Demo:
Jun-Yan Zhu38,582 görüntüleme • 1 yıl önce

[1/2] We’ve released the code for #pix2pixturbo and #CycleGANTurbo. These conditional GANs are able to adapt a text-to-image model such as SD-Turbo for both paired and unpaired image translation with a single step (0.11 sec on A100 and 0.29 sec on A6000). Try our code and the Gradio demo. Paper: Code: Demo: This is a joint work with Gaurav Parmar (the leading author), Taesung Park, and Srinivasa Narasimhan. This work shows that a pre-trained one-step model can be easily adapted to conditional GANs frameworks for downstream image editing and synthesis tasks. #Edges2Cats
Jun-Yan Zhu36,473 görüntüleme • 2 yıl önce
Daha fazla içerik yok.