Video wird geladen...
Video konnte nicht geladen werden
Accepted by #CVPR2023! X-Decoder is the FIRST generalist decoder that supports all segmentation tasks (ins/sem/pano/ref) in OPEN VOCABULARY, both inter- AND intra-image VL tasks, and even helps instruct image inpainting/editing! New demo below and more at
51,930 Aufrufe • vor 3 Jahren •via X (Twitter)
6 Kommentare

Jianwei Yangvor 3 Jahren
This project was led by our two wonderful interns @xueyanzou1, @ZiYiDou! With joint mentorship from @zhegan4, @LINJIEFUN, @ChunyuanLi, Xiyang Dai, @HarkiratBehl, Jianfeng Wang, and senior advisory from Violet Peng, Lu Yuan, Lijuan Wang, @yong_jae_lee and @JianfengGao0217!

Akarsh Gvor 3 Jahren
Used your instruct demo. Still not perfect.

Naoto Usuyamavor 3 Jahren
Congrats!

Dan Benyamin (Æ)vor 3 Jahren
Cc @levelsio

Akarsh Gvor 3 Jahren
How is it different from pix2pix?

Jianwei Yangvor 3 Jahren
We used our x-decoder as a plug into the original pix2pix to make the edit more grounded.

