Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

Dreamix: Video Diffusion Models are General Video Editors abs: project page: present diffusion-based method that is able to perform text-based motion and appearance editing of general videos

398,132 Aufrufe • vor 3 Jahren •via X (Twitter)

10 Kommentare

Profilbild von Abi
Abivor 3 Jahren

pre: "You won't believe your eyes!" post: "You can't believe your eyes!"

Profilbild von Rekt Adult (🩸,🩸)
Rekt Adult (🩸,🩸)vor 3 Jahren

where/when/how can someone use it? they didn't include the code with the github page

Profilbild von Minute Movies
Minute Moviesvor 3 Jahren

excuse me what

Profilbild von William Lamkin
William Lamkinvor 3 Jahren

Nice find. 3D + Motion and editing tools seem to be the subject of the next wave of papers behind audio-based GenAI models

Profilbild von Stern - uɹǝʇS
Stern - uɹǝʇSvor 3 Jahren

@CoffeeVectors @TomLikesRobots @GanWeaving as soon as models of this fidelity ad coherence arrive publicly …… wow

Profilbild von Jason Murphy
Jason Murphyvor 3 Jahren

Amazing!

Profilbild von Vic J
Vic Jvor 3 Jahren

@OliverLaufer

Profilbild von Bernard Bontemps
Bernard Bontempsvor 3 Jahren

@Francescu

Profilbild von Leo Enin 🍰
Leo Enin 🍰vor 3 Jahren

Absolutely astonishing! Imagine graphic tools 10 years from now. Now imagine video games and movies in 20 years, given the progress we got over the last 20. And I'm not even talking about other industries here. The world is rapidly changing, and it's beautiful.

Profilbild von Guilherme
Guilhermevor 3 Jahren

@BaixaEssaPorra

Ähnliche Videos

InstantDrag Improving Interactivity in Drag-based Image Editing discuss: Drag-based image editing has recently gained popularity for its interactivity and precision. However, despite the ability of text-to-image models to generate samples within a second, drag editing still lags behind due to the challenge of accurately reflecting user interaction while maintaining image content. Some existing approaches rely on computationally intensive per-image optimization or intricate guidance-based methods, requiring additional inputs such as masks for movable regions and text prompts, thereby compromising the interactivity of the editing process. We introduce InstantDrag, an optimization-free pipeline that enhances interactivity and speed, requiring only an image and a drag instruction as input. InstantDrag consists of two carefully designed networks: a drag-conditioned optical flow generator (FlowGen) and an optical flow-conditioned diffusion model (FlowDiffusion). InstantDrag learns motion dynamics for drag-based image editing in real-world video datasets by decomposing the task into motion generation and motion-conditioned image generation. We demonstrate InstantDrag's capability to perform fast, photo-realistic edits without masks or text prompts through experiments on facial video datasets and general scenes. These results highlight the efficiency of our approach in handling drag-based image editing, making it a promising solution for interactive, real-time applications.

AK

71,201 Aufrufe • vor 1 Jahr