正在加载视频...
视频加载失败
Accepted by #CVPR2023! X-Decoder is the FIRST generalist decoder that supports all segmentation tasks (ins/sem/pano/ref) in OPEN VOCABULARY, both inter- AND intra-image VL tasks, and even helps instruct image inpainting/editing! New demo below and more at
6 条评论

Jianwei Yang3 年前
This project was led by our two wonderful interns @xueyanzou1, @ZiYiDou! With joint mentorship from @zhegan4, @LINJIEFUN, @ChunyuanLi, Xiyang Dai, @HarkiratBehl, Jianfeng Wang, and senior advisory from Violet Peng, Lu Yuan, Lijuan Wang, @yong_jae_lee and @JianfengGao0217!

Akarsh G3 年前
Used your instruct demo. Still not perfect.

Naoto Usuyama3 年前
Congrats!

Dan Benyamin (Æ)3 年前
Cc @levelsio

Akarsh G3 年前
How is it different from pix2pix?

Jianwei Yang3 年前
We used our x-decoder as a plug into the original pix2pix to make the edit more grounded.

