正在加载视频...

视频加载失败

Accepted by #CVPR2023! X-Decoder is the FIRST generalist decoder that supports all segmentation tasks (ins/sem/pano/ref) in OPEN VOCABULARY, both inter- AND intra-image VL tasks, and even helps instruct image inpainting/editing! New demo below and more at

51,930 次观看 • 3 年前 •via X (Twitter)

6 条评论

Jianwei Yang 的头像
Jianwei Yang3 年前

This project was led by our two wonderful interns @xueyanzou1, @ZiYiDou! With joint mentorship from @zhegan4, @LINJIEFUN, @ChunyuanLi, Xiyang Dai, @HarkiratBehl, Jianfeng Wang, and senior advisory from Violet Peng, Lu Yuan, Lijuan Wang, @yong_jae_lee and @JianfengGao0217!

Akarsh G 的头像
Akarsh G3 年前

Used your instruct demo. Still not perfect.

Naoto Usuyama 的头像
Naoto Usuyama3 年前

Congrats!

Dan Benyamin (Æ) 的头像
Dan Benyamin (Æ)3 年前

Cc @levelsio

Akarsh G 的头像
Akarsh G3 年前

How is it different from pix2pix?

Jianwei Yang 的头像
Jianwei Yang3 年前

We used our x-decoder as a plug into the original pix2pix to make the edit more grounded.

相关视频