正在加载视频...

视频加载失败

加载此视频时出现问题。这可能是由于临时网络问题，或视频可能不可用。

Grok Imagine API just released A world-class video generation + video editing model Text-to-Video: Turn simple prompts into rich video clips with audio Image Generation + Editing: Bring ideas to life with visuals from scratch Video Editing Tools: Restyle scenes, add/remove props, control motion Best-in-Class Quality + Low Latency:... show more

X Freeze

135,596 subscribers

15,078 次观看 • 4 个月前 •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

0 条评论

暂无评论

原始帖子的评论将显示在这里

相关视频

Grok Imagine API now supports built-in video editing on top of powerful video generation You can feed in your own videos and edit them directly with Grok Imagine, transforming or enhancing footage without traditional timeline editing Developers can generate high-quality videos at scale with fast iteration and simple integration into existing workflows One API does it all: • Text-to-video • Image-to-video • Edit existing videos directly • Replace, remove, or add objects inside a video • Built-in audio sync for every clip Turn your ideas into finished videos – at scale, through a single API

Grok Imagine API now supports built-in video editing on top of powerful video generation You can feed in your own videos and edit them directly with Grok Imagine, transforming or enhancing footage without traditional timeline editing Developers can generate high-quality videos at scale with fast iteration and simple integration into existing workflows One API does it all: • Text-to-video • Image-to-video • Edit existing videos directly • Replace, remove, or add objects inside a video • Built-in audio sync for every clip Turn your ideas into finished videos – at scale, through a single API

X Freeze

19,054 次观看 • 4 个月前

Google announces Dreamix: a model that generates videos when given: - video + prompt (Video editing) - input images + prompt (Subject Driven Generation) - input image + prompt (Image-toVideo

Google announces Dreamix: a model that generates videos when given: - video + prompt (Video editing) - input images + prompt (Subject Driven Generation) - input image + prompt (Image-toVideo

bleedingedge.ai

1,323,774 次观看 • 3 年前

Bring ideas to life with Grok Imagine. Introducing the fastest video and image generation experience.

Bring ideas to life with Grok Imagine. Introducing the fastest video and image generation experience.

Grok

1,053,169 次观看 • 21 天前

Bring ideas to life with Grok Imagine. Introducing the fastest video and image generation experience.

Bring ideas to life with Grok Imagine. Introducing the fastest video and image generation experience.

Grok

338,585 次观看 • 11 天前

Bring ideas to life with Grok Imagine. Introducing the fastest video and image generation experience.

Bring ideas to life with Grok Imagine. Introducing the fastest video and image generation experience.

Grok

260,862 次观看 • 11 天前

Bring ideas to life with Grok Imagine. Introducing the fastest video and image generation experience.

Bring ideas to life with Grok Imagine. Introducing the fastest video and image generation experience.

Grok

940,725 次观看 • 16 天前

Grok Imagine is now LIVE in Hedra DAY 0 ! 🚀 Stunning photorealistic images & videos. Lightning-fast generation Easy edits for elements, styles & more Text to Image. Image Editing. Text to Video. Image to Video All inside Hedra. Try it now.

Grok Imagine is now LIVE in Hedra DAY 0 ! 🚀 Stunning photorealistic images & videos. Lightning-fast generation Easy edits for elements, styles & more Text to Image. Image Editing. Text to Video. Image to Video All inside Hedra. Try it now.

Hedra

749,171 次观看 • 4 个月前

1/2 Meet Wan2.7-Video — The Comprehensive Model for Controllable Video Storytelling! From single clips to full-scale narrative direction, we’ve built more than just a generator. We’ve built a director’s suite: • Multimodal control over performance and style via text, image, audio, and video. • Character customization with up to 5 reference inputs and voice profiles. • Video editing with simple, intuitive instructions. • Full-stack creative toolkit: generation, editing, cloning, restyling, continuation, and more. • Sustained improvements in visual fidelity, motion stability, and prompt adherence.

1/2 Meet Wan2.7-Video — The Comprehensive Model for Controllable Video Storytelling! From single clips to full-scale narrative direction, we’ve built more than just a generator. We’ve built a director’s suite: • Multimodal control over performance and style via text, image, audio, and video. • Character customization with up to 5 reference inputs and voice profiles. • Video editing with simple, intuitive instructions. • Full-stack creative toolkit: generation, editing, cloning, restyling, continuation, and more. • Sustained improvements in visual fidelity, motion stability, and prompt adherence.

Wan

25,555,427 次观看 • 2 个月前

Wan2.1-VACE 14B & 1.3B are now natively supported in ComfyUI! This model from Wan brings all-in-one editing capability to your video generation: 🔹Text-to-Video & Image-to-Video 🔹 Video-to-video (Pose & depth control) 🔹 Inpainting & Outpainting 🔹 Character + object reference

Wan2.1-VACE 14B & 1.3B are now natively supported in ComfyUI! This model from Wan brings all-in-one editing capability to your video generation: 🔹Text-to-Video & Image-to-Video 🔹 Video-to-video (Pose & depth control) 🔹 Inpainting & Outpainting 🔹 Character + object reference

ComfyUI

21,517 次观看 • 1 年前

TLDR: Meet ✨Lumiere✨ our new text-to-video model from Google AI! Lumiere is designed to create entire clips in just one go! Seamlessly opening up possibilities for many applications: Image-to-video 🖼️ Stylized generation 🖌️ Video editing 🪩 and beyond. See 🧵👇

TLDR: Meet ✨Lumiere✨ our new text-to-video model from Google AI! Lumiere is designed to create entire clips in just one go! Seamlessly opening up possibilities for many applications: Image-to-video 🖼️ Stylized generation 🖌️ Video editing 🪩 and beyond. See 🧵👇

Hila Chefer

252,577 次观看 • 2 年前

Complete story entirely created with Grok Imagine 4.20: -Image, image editing, image reference -video, Images for video reference, video extension I did in a few hours what used to take me weeks and of much better quality!

Complete story entirely created with Grok Imagine 4.20: -Image, image editing, image reference -video, Images for video reference, video extension I did in a few hours what used to take me weeks and of much better quality!

Déborah

57,945 次观看 • 3 个月前

Today we’re sharing two new advances in our generative AI research: Emu Video & Emu Edit. Details ➡️ These new models deliver exciting results in high quality, diffusion-based text-to-video generation & controlled image editing w/ text instructions. 🧵

Today we’re sharing two new advances in our generative AI research: Emu Video & Emu Edit. Details ➡️ These new models deliver exciting results in high quality, diffusion-based text-to-video generation & controlled image editing w/ text instructions. 🧵

AI at Meta

798,204 次观看 • 2 年前

Wan2.5: Let Sound Take the Director’s Chair! 🎬 Today, we’re excited to unveil another major feature in our powerful Wan 2.5 Preview: Native Audio-Driven Video Generation. ✨ Now you can use audio input directly for both text-to-video and image-to-video generation. Combine audio with text prompts or a reference image to shape your video's narrative. ✨ With support for videos up to 10 seconds and enhanced video quality, unlock a richer visual space where more engaging stories come to life.

Wan2.5: Let Sound Take the Director’s Chair! 🎬 Today, we’re excited to unveil another major feature in our powerful Wan 2.5 Preview: Native Audio-Driven Video Generation. ✨ Now you can use audio input directly for both text-to-video and image-to-video generation. Combine audio with text prompts or a reference image to shape your video's narrative. ✨ With support for videos up to 10 seconds and enhanced video quality, unlock a richer visual space where more engaging stories come to life.

Wan

52,088 次观看 • 8 个月前

Bytedance drops an open-source Gemini Omni!!! Bernini is a new AI video generation + editing framework. > Edit videos with text prompts > Image/video references > Code available

Bytedance drops an open-source Gemini Omni!!! Bernini is a new AI video generation + editing framework. > Edit videos with text prompts > Image/video references > Code available

⚡AI Search⚡

43,132 次观看 • 20 天前

Thanks to the Quality mode of Grok Imagine I was able to make this video which I couldn't do before. Image: Quality mode of Grok Imagine Editing image: Grok Imagine Animation with extension : Grok Imagine

Thanks to the Quality mode of Grok Imagine I was able to make this video which I couldn't do before. Image: Quality mode of Grok Imagine Editing image: Grok Imagine Animation with extension : Grok Imagine

Déborah

10,719 次观看 • 2 个月前

🕹️We are excited to introduce "ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation" ChronoEdit reframes image editing as a video generation task to encourage temporal consistency. It leverages a temporal reasoning stage that denoises with “video reasoning tokens” to "reason" on physically plausible edits. See the attached video for results. Project Page: Arxiv: Code and model are coming.

🕹️We are excited to introduce "ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation" ChronoEdit reframes image editing as a video generation task to encourage temporal consistency. It leverages a temporal reasoning stage that denoises with “video reasoning tokens” to "reason" on physically plausible edits. See the attached video for results. Project Page: Arxiv: Code and model are coming.

Huan Ling

36,841 次观看 • 8 个月前

Grok Imagine also lets you edit video by adding, removing, or swapping objects with precision. Change props, remove distractions, or refine a scene while keeping everything consistent. No reshoots, no complicated editing tools. Edit video as easily as editing text.

Grok Imagine also lets you edit video by adding, removing, or swapping objects with precision. Change props, remove distractions, or refine a scene while keeping everything consistent. No reshoots, no complicated editing tools. Edit video as easily as editing text.

DogeDesigner

61,587 次观看 • 4 个月前

The first truly open-source audio-video model. LTX-2 is a DiT-based foundation model with all core video generation capabilities in one unified model. Designed to run locally on consumer GPUs. - text-to-video - image-to-video - and video-to-video modes 100% open-source.

The first truly open-source audio-video model. LTX-2 is a DiT-based foundation model with all core video generation capabilities in one unified model. Designed to run locally on consumer GPUs. - text-to-video - image-to-video - and video-to-video modes 100% open-source.

Akshay 🚀

66,012 次观看 • 5 个月前

Most AI video tools still feel like traditional editors with a few AI features added in. You still end up adjusting timelines, fixing frames, and spending time editing. But Hailuo AI felt different to me. I gave it a simple prompt and one image, and it generated a full video on its own. No complicated workflow, no manual editing just idea to video in minutes. The text-to-video results are surprisingly good, image-to-video transitions look smooth, and the overall output feels cinematic. What I liked most is the speed. You can quickly test ideas without getting stuck in editing. Try it here: #Hailuo

Most AI video tools still feel like traditional editors with a few AI features added in. You still end up adjusting timelines, fixing frames, and spending time editing. But Hailuo AI felt different to me. I gave it a simple prompt and one image, and it generated a full video on its own. No complicated workflow, no manual editing just idea to video in minutes. The text-to-video results are surprisingly good, image-to-video transitions look smooth, and the overall output feels cinematic. What I liked most is the speed. You can quickly test ideas without getting stuck in editing. Try it here: #Hailuo

Markandey Sharma

32,326 次观看 • 1 个月前

InstantDrag Improving Interactivity in Drag-based Image Editing discuss: Drag-based image editing has recently gained popularity for its interactivity and precision. However, despite the ability of text-to-image models to generate samples within a second, drag editing still lags behind due to the challenge of accurately reflecting user interaction while maintaining image content. Some existing approaches rely on computationally intensive per-image optimization or intricate guidance-based methods, requiring additional inputs such as masks for movable regions and text prompts, thereby compromising the interactivity of the editing process. We introduce InstantDrag, an optimization-free pipeline that enhances interactivity and speed, requiring only an image and a drag instruction as input. InstantDrag consists of two carefully designed networks: a drag-conditioned optical flow generator (FlowGen) and an optical flow-conditioned diffusion model (FlowDiffusion). InstantDrag learns motion dynamics for drag-based image editing in real-world video datasets by decomposing the task into motion generation and motion-conditioned image generation. We demonstrate InstantDrag's capability to perform fast, photo-realistic edits without masks or text prompts through experiments on facial video datasets and general scenes. These results highlight the efficiency of our approach in handling drag-based image editing, making it a promising solution for interactive, real-time applications.

InstantDrag Improving Interactivity in Drag-based Image Editing discuss: Drag-based image editing has recently gained popularity for its interactivity and precision. However, despite the ability of text-to-image models to generate samples within a second, drag editing still lags behind due to the challenge of accurately reflecting user interaction while maintaining image content. Some existing approaches rely on computationally intensive per-image optimization or intricate guidance-based methods, requiring additional inputs such as masks for movable regions and text prompts, thereby compromising the interactivity of the editing process. We introduce InstantDrag, an optimization-free pipeline that enhances interactivity and speed, requiring only an image and a drag instruction as input. InstantDrag consists of two carefully designed networks: a drag-conditioned optical flow generator (FlowGen) and an optical flow-conditioned diffusion model (FlowDiffusion). InstantDrag learns motion dynamics for drag-based image editing in real-world video datasets by decomposing the task into motion generation and motion-conditioned image generation. We demonstrate InstantDrag's capability to perform fast, photo-realistic edits without masks or text prompts through experiments on facial video datasets and general scenes. These results highlight the efficiency of our approach in handling drag-based image editing, making it a promising solution for interactive, real-time applications.

AK

71,201 次观看 • 1 年前