Loading video...
Video Failed to Load
Accepted by #CVPR2023! X-Decoder is the FIRST generalist decoder that supports all segmentation tasks (ins/sem/pano/ref) in OPEN VOCABULARY, both inter- AND intra-image VL tasks, and even helps instruct image inpainting/editing! New demo below and more at
51,930 views • 3 years ago •via X (Twitter)
6 Comments

Jianwei Yang3 years ago
This project was led by our two wonderful interns @xueyanzou1, @ZiYiDou! With joint mentorship from @zhegan4, @LINJIEFUN, @ChunyuanLi, Xiyang Dai, @HarkiratBehl, Jianfeng Wang, and senior advisory from Violet Peng, Lu Yuan, Lijuan Wang, @yong_jae_lee and @JianfengGao0217!

Akarsh G3 years ago
Used your instruct demo. Still not perfect.

Naoto Usuyama3 years ago
Congrats!

Dan Benyamin (Æ)3 years ago
Cc @levelsio

Akarsh G3 years ago
How is it different from pix2pix?

Jianwei Yang3 years ago
We used our x-decoder as a plug into the original pix2pix to make the edit more grounded.

