Loading video...

Video Failed to Load

Go Home

Accepted by #CVPR2023! X-Decoder is the FIRST generalist decoder that supports all segmentation tasks (ins/sem/pano/ref) in OPEN VOCABULARY, both inter- AND intra-image VL tasks, and even helps instruct image inpainting/editing! New demo below and more at

51,930 views • 3 years ago •via X (Twitter)

6 Comments

Jianwei Yang's profile picture
Jianwei Yang3 years ago

This project was led by our two wonderful interns @xueyanzou1, @ZiYiDou! With joint mentorship from @zhegan4, @LINJIEFUN, @ChunyuanLi, Xiyang Dai, @HarkiratBehl, Jianfeng Wang, and senior advisory from Violet Peng, Lu Yuan, Lijuan Wang, @yong_jae_lee and @JianfengGao0217!

Akarsh G's profile picture
Akarsh G3 years ago

Used your instruct demo. Still not perfect.

Naoto Usuyama's profile picture
Naoto Usuyama3 years ago

Congrats!

Dan Benyamin (Æ)'s profile picture
Dan Benyamin (Æ)3 years ago

Cc @levelsio

Akarsh G's profile picture
Akarsh G3 years ago

How is it different from pix2pix?

Jianwei Yang's profile picture
Jianwei Yang3 years ago

We used our x-decoder as a plug into the original pix2pix to make the edit more grounded.

Related Videos