正在加载视频...
视频加载失败
MVDiffusion: How to take a pre-trained text2image model for a perspective view (e.g., Stable Diffusion) and retrain to generate multiple consistent views (e.g., a panorama). Project site: Hugging Face demo: Code out in a month.
8 条评论

Yasutaka Furukawa3 年前
I am very sorry. Typo fixing. Had to delete the old one and retweet.

Yasutaka Furukawa3 年前
I had to delete and repost. Highly appreciate it if you could reshare/retweet this post

Ziyu Wan3 年前
Awesome work!!!!! I wonder where we could find the paper😊

Yasutaka Furukawa3 年前
Thank you Ziyu. Did not realize that we haven't uploaded to arxiv yet... Asking students to upload in a day.

Jake Harrison3 年前
I would add to my newsletter

Philipp Tsipman3 年前
🔥

Asriel H3 年前
does it support image prompt as input or only text prompt?

Yasutaka Furukawa3 年前
Only text for this work. But it seems trivial to add image-prompt capability.
