Video wird geladen...

Video konnte nicht geladen werden

Beim Laden dieses Videos ist ein Problem aufgetreten. Dies könnte an einem vorübergehenden Netzwerkproblem liegen oder das Video ist möglicherweise nicht verfügbar.

Meta presents Video Editing via Factorized Diffusion Distillation We introduce Emu Video Edit (EVE), a model that establishes a new state-of-the art in video editing without relying on any supervised video editing data. To develop EVE we separately train an image editing

AK

504,352 subscribers

115,597 Aufrufe • vor 2 Jahren •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

7 Kommentare

Profilbild von AK

AKvor 2 Jahren

adapter and a video generation adapter, and attach both to the same text-to-image model. Then, to align the adapters towards video editing we introduce a new unsupervised distillation procedure, Factorized Diffusion Distillation. This procedure distills knowledge from one or

Profilbild von AK

AKvor 2 Jahren

more teachers simultaneously, without any supervised data. We utilize this procedure to teach EVE to edit videos by jointly distilling knowledge to (i) precisely edit each individual frame from the image editing adapter, and (ii) ensure temporal consistency among the

Profilbild von AK

AKvor 2 Jahren

edited frames using the video generation adapter. Finally, to demonstrate the potential of our approach in unlocking other capabilities, we align additional combinations of adapters

Profilbild von AK

AKvor 2 Jahren

paper page:

Profilbild von Uri Gil

Uri Gilvor 2 Jahren

that is not what the term "video editing" usually refers to. It should be called video manipulation or something

Profilbild von Jing Gu

Jing Guvor 2 Jahren

Using two adapters to function for editing and video part. Good idea 👍

Profilbild von Simulacra Latens

Simulacra Latensvor 2 Jahren

What is the edit? All I see is image swapping/IPAdapater style transfer which we already have?

Ähnliche Videos

Today we’re sharing two new advances in our generative AI research: Emu Video & Emu Edit. Details ➡️ These new models deliver exciting results in high quality, diffusion-based text-to-video generation & controlled image editing w/ text instructions. 🧵

Today we’re sharing two new advances in our generative AI research: Emu Video & Emu Edit. Details ➡️ These new models deliver exciting results in high quality, diffusion-based text-to-video generation & controlled image editing w/ text instructions. 🧵

AI at Meta

798,183 Aufrufe • vor 2 Jahren

🕹️We are excited to introduce "ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation" ChronoEdit reframes image editing as a video generation task to encourage temporal consistency. It leverages a temporal reasoning stage that denoises with “video reasoning tokens” to "reason" on physically plausible edits. See the attached video for results. Project Page: Arxiv: Code and model are coming.

🕹️We are excited to introduce "ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation" ChronoEdit reframes image editing as a video generation task to encourage temporal consistency. It leverages a temporal reasoning stage that denoises with “video reasoning tokens” to "reason" on physically plausible edits. See the attached video for results. Project Page: Arxiv: Code and model are coming.

Huan Ling

36,802 Aufrufe • vor 8 Monaten

TurboEdit Instant text-based image editing discuss: We address the challenges of precise image inversion and disentangled image editing in the context of few-step diffusion models. We introduce an encoder based iterative inversion technique. The inversion network is conditioned on the input image and the reconstructed image from the previous step, allowing for correction of the next reconstruction towards the input image. We demonstrate that disentangled controls can be easily achieved in the few-step diffusion model by conditioning on an (automatically generated) detailed text prompt. To manipulate the inverted image, we freeze the noise maps and modify one attribute in the text prompt (either manually or via instruction based editing driven by an LLM), resulting in the generation of a new image similar to the input image with only one attribute changed. It can further control the editing strength and accept instructive text prompt. Our approach facilitates realistic text-guided image edits in real-time, requiring only 8 number of functional evaluations (NFEs) in inversion (one-time cost) and 4 NFEs per edit. Our method is not only fast, but also significantly outperforms state-of-the-art multi-step diffusion editing techniques.

TurboEdit Instant text-based image editing discuss: We address the challenges of precise image inversion and disentangled image editing in the context of few-step diffusion models. We introduce an encoder based iterative inversion technique. The inversion network is conditioned on the input image and the reconstructed image from the previous step, allowing for correction of the next reconstruction towards the input image. We demonstrate that disentangled controls can be easily achieved in the few-step diffusion model by conditioning on an (automatically generated) detailed text prompt. To manipulate the inverted image, we freeze the noise maps and modify one attribute in the text prompt (either manually or via instruction based editing driven by an LLM), resulting in the generation of a new image similar to the input image with only one attribute changed. It can further control the editing strength and accept instructive text prompt. Our approach facilitates realistic text-guided image edits in real-time, requiring only 8 number of functional evaluations (NFEs) in inversion (one-time cost) and 4 NFEs per edit. Our method is not only fast, but also significantly outperforms state-of-the-art multi-step diffusion editing techniques.

AK

16,062 Aufrufe • vor 1 Jahr

We do a little video editing

We do a little video editing

Lee (Greater)

13,243 Aufrufe • vor 4 Monaten

video editing is stuck in 2005, time for something new introducing diffusion the first infinite canvas for video and motion graphics like figma, but for editing

video editing is stuck in 2005, time for something new introducing diffusion the first infinite canvas for video and motion graphics like figma, but for editing

konstantinpaulus

128,573 Aufrufe • vor 1 Monat

Exciting milestones in our generative AI research: Emu Video, which lets you create high quality videos from a text prompt, and Emu Edit, which enables detailed image editing based on your instructions. These new models are built on Emu, our foundation model for image generation and technology from them will underpin new creative features across our apps next year. Try it out: Emu Video: Emu Edit:

Exciting milestones in our generative AI research: Emu Video, which lets you create high quality videos from a text prompt, and Emu Edit, which enables detailed image editing based on your instructions. These new models are built on Emu, our foundation model for image generation and technology from them will underpin new creative features across our apps next year. Try it out: Emu Video: Emu Edit:

Boz

110,713 Aufrufe • vor 2 Jahren

is this the end of video editing 🤯 i found AI-powered video editing tool that do most of my videos creation with ease.... it's cursor for video editing

is this the end of video editing 🤯 i found AI-powered video editing tool that do most of my videos creation with ease.... it's cursor for video editing

Fakhr

29,070 Aufrufe • vor 8 Monaten

Grok Imagine API just released A world-class video generation + video editing model Text-to-Video: Turn simple prompts into rich video clips with audio Image Generation + Editing: Bring ideas to life with visuals from scratch Video Editing Tools: Restyle scenes, add/remove props, control motion Best-in-Class Quality + Low Latency: Designed to deliver fast, cost-efficient results API pricing: Image input: $0.002 Video input : $0.01 Video output : $0.05

Grok Imagine API just released A world-class video generation + video editing model Text-to-Video: Turn simple prompts into rich video clips with audio Image Generation + Editing: Bring ideas to life with visuals from scratch Video Editing Tools: Restyle scenes, add/remove props, control motion Best-in-Class Quality + Low Latency: Designed to deliver fast, cost-efficient results API pricing: Image input: $0.002 Video input : $0.01 Video output : $0.05

X Freeze

15,078 Aufrufe • vor 4 Monaten

we built Cursor for video editing

we built Cursor for video editing

Timothy Wang

911,583 Aufrufe • vor 1 Jahr

Here’s what we are working on at Adobe Layered image editing 🤯 the future of AI image editing

Here’s what we are working on at Adobe Layered image editing 🤯 the future of AI image editing

Kris Kashtanova

59,769 Aufrufe • vor 7 Monaten

InstantDrag Improving Interactivity in Drag-based Image Editing discuss: Drag-based image editing has recently gained popularity for its interactivity and precision. However, despite the ability of text-to-image models to generate samples within a second, drag editing still lags behind due to the challenge of accurately reflecting user interaction while maintaining image content. Some existing approaches rely on computationally intensive per-image optimization or intricate guidance-based methods, requiring additional inputs such as masks for movable regions and text prompts, thereby compromising the interactivity of the editing process. We introduce InstantDrag, an optimization-free pipeline that enhances interactivity and speed, requiring only an image and a drag instruction as input. InstantDrag consists of two carefully designed networks: a drag-conditioned optical flow generator (FlowGen) and an optical flow-conditioned diffusion model (FlowDiffusion). InstantDrag learns motion dynamics for drag-based image editing in real-world video datasets by decomposing the task into motion generation and motion-conditioned image generation. We demonstrate InstantDrag's capability to perform fast, photo-realistic edits without masks or text prompts through experiments on facial video datasets and general scenes. These results highlight the efficiency of our approach in handling drag-based image editing, making it a promising solution for interactive, real-time applications.

InstantDrag Improving Interactivity in Drag-based Image Editing discuss: Drag-based image editing has recently gained popularity for its interactivity and precision. However, despite the ability of text-to-image models to generate samples within a second, drag editing still lags behind due to the challenge of accurately reflecting user interaction while maintaining image content. Some existing approaches rely on computationally intensive per-image optimization or intricate guidance-based methods, requiring additional inputs such as masks for movable regions and text prompts, thereby compromising the interactivity of the editing process. We introduce InstantDrag, an optimization-free pipeline that enhances interactivity and speed, requiring only an image and a drag instruction as input. InstantDrag consists of two carefully designed networks: a drag-conditioned optical flow generator (FlowGen) and an optical flow-conditioned diffusion model (FlowDiffusion). InstantDrag learns motion dynamics for drag-based image editing in real-world video datasets by decomposing the task into motion generation and motion-conditioned image generation. We demonstrate InstantDrag's capability to perform fast, photo-realistic edits without masks or text prompts through experiments on facial video datasets and general scenes. These results highlight the efficiency of our approach in handling drag-based image editing, making it a promising solution for interactive, real-time applications.

AK

71,201 Aufrufe • vor 1 Jahr

Google announces Dreamix: a model that generates videos when given: - video + prompt (Video editing) - input images + prompt (Subject Driven Generation) - input image + prompt (Image-toVideo

Google announces Dreamix: a model that generates videos when given: - video + prompt (Video editing) - input images + prompt (Subject Driven Generation) - input image + prompt (Image-toVideo

bleedingedge.ai

1,323,744 Aufrufe • vor 3 Jahren

we just built git for video editing.

we just built git for video editing.

Lucas Jin

2,035,550 Aufrufe • vor 2 Monaten

Introducing Higgsfield Canvas: a state-of-the-art image editing model. Paint products directly onto your image with pixel-perfect control. Say hi to your new go-to for product placement, editing, and layout! 👋🏻 Comment Canvas to get the full guide in the DM.

Introducing Higgsfield Canvas: a state-of-the-art image editing model. Paint products directly onto your image with pixel-perfect control. Say hi to your new go-to for product placement, editing, and layout! 👋🏻 Comment Canvas to get the full guide in the DM.

Higgsfield AI 🧩

2,627,127 Aufrufe • vor 11 Monaten

Bytedance drops an open-source Gemini Omni!!! Bernini is a new AI video generation + editing framework. > Edit videos with text prompts > Image/video references > Code available

Bytedance drops an open-source Gemini Omni!!! Bernini is a new AI video generation + editing framework. > Edit videos with text prompts > Image/video references > Code available

⚡AI Search⚡

42,745 Aufrufe • vor 10 Tagen

A video editing tool, made by a succesful YouTuber. We love to see it. 👏

A video editing tool, made by a succesful YouTuber. We love to see it. 👏

Product Hunt 😸

12,113 Aufrufe • vor 2 Jahren

The editing of this video 👌

The editing of this video 👌

Interesting AF

725,367 Aufrufe • vor 10 Monaten

Introducing ChatGPT Images 2.0 A state-of-the-art image model that can take on complex visual tasks and produce precise, immediately usable visuals, with sharper editing, richer layouts, and thinking-level intelligence. Video made with ChatGPT Images

Introducing ChatGPT Images 2.0 A state-of-the-art image model that can take on complex visual tasks and produce precise, immediately usable visuals, with sharper editing, richer layouts, and thinking-level intelligence. Video made with ChatGPT Images

OpenAI

12,859,073 Aufrufe • vor 1 Monat

We built Cursor for video editing (and its live)

We built Cursor for video editing (and its live)

Sabba Keynejad

160,946 Aufrufe • vor 1 Jahr