正在加载视频...

视频加载失败

MVDiffusion: How to take a pre-trained text2image model for a perspective view (e.g., Stable Diffusion) and retrain to generate multiple consistent views (e.g., a panorama). Project site: Hugging Face demo: Code out in a month.

33,418 次观看 • 3 年前 •via X (Twitter)

8 条评论

Yasutaka Furukawa 的头像
Yasutaka Furukawa3 年前

I am very sorry. Typo fixing. Had to delete the old one and retweet.

Yasutaka Furukawa 的头像
Yasutaka Furukawa3 年前

I had to delete and repost. Highly appreciate it if you could reshare/retweet this post

Ziyu Wan 的头像
Ziyu Wan3 年前

Awesome work!!!!! I wonder where we could find the paper😊

Yasutaka Furukawa 的头像
Yasutaka Furukawa3 年前

Thank you Ziyu. Did not realize that we haven't uploaded to arxiv yet... Asking students to upload in a day.

Jake Harrison 的头像
Jake Harrison3 年前

I would add to my newsletter

Philipp Tsipman 的头像
Philipp Tsipman3 年前

🔥

Asriel H 的头像
Asriel H3 年前

does it support image prompt as input or only text prompt?

Yasutaka Furukawa 的头像
Yasutaka Furukawa3 年前

Only text for this work. But it seems trivial to add image-prompt capability.

相关视频

New short course: Building Code Agents with Hugging Face smolagents! Learn how to build code agents in this course, created in collaboration with Hugging Face, and taught by Thomas Wolf, its co-founder and CSO, and m_ric, Hugging Face’s Project Lead on Agents. Tool-calling agents use LLMs to generate multiple function calls sequentially to complete a complex sequence of tasks. They generate one function call, execute it, observe, reason, and decide what to do next. Code agents take a different approach. They consolidate all these calls into a single block of code, letting the LLM lay out an entire action plan at once, which can be executed efficiently to provide more reliable results. You’ll learn how to code agents using smolagents, a lightweight agentic framework from Hugging Face. Along the way, you’ll learn how to run LLM-generated code safely and develop an evaluation system to optimize your code agent for production. In detail, you’ll learn: - How agentic systems have evolved, gaining greater levels of agency over time—and why code agents are a next step. - How code agents write their actions in code. - When code agents outperform function-calling agents. - How to run code agents safely in your system using a constrained Python interpreter and sandboxing using E2B. - To trace, debug, and assess the code agent to optimize its behaviours for complex requests. - How to build a research multi-agent system that can find information online and organize it into an interactive report. By the end of this course, you’ll know how to build and run code agents using smolagents, and deploy them safely with a structured evaluation system in your projects. Please sign up here!

Andrew Ng

124,382 次观看 • 1 年前