Загрузка видео...

Не удалось загрузить видео

Возникла проблема при загрузке этого видео. Это может быть связано с временными проблемами сети или видео может быть недоступно.

На главную

Testing LCM LORAs in an AnimateDiff & multi-controlnet workflow in ComfyUI. I was able to process this entire Black Pink music video as a single .mp4 input. The LCM lets me render at 6 steps (vs 20+) on my 4090 and uses up only 10.5 GB of VRAM. Here's... show more

CoffeeVectors

42,382 subscribers

182,419 просмотров • 2 лет назад •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

Комментарии: 10

Фото профиля CoffeeVectors

CoffeeVectors2 лет назад

Entire thing took 81 minutes to render 2,467 frames, so about 2 seconds per frame. This isn't including the time to extract the img sequence from video and gen the ControlNet maps. Used Zoe Depth and Canny ControlNets in SD 1.5 at 910 x 512. [2/11]

Фото профиля CoffeeVectors

CoffeeVectors2 лет назад

Improving the output to give it a stronger style, more details & feel less rotoscope-ish, will require adjusting individual shots. But doing the entire video in one go lays down a rough draft for you to iterate on—build on fun surprises, troubleshoot problem areas. [3/11]

Фото профиля CoffeeVectors

CoffeeVectors2 лет назад

For the input video I used every other frame in order to target 12 fps. [4/11]

Фото профиля CoffeeVectors

CoffeeVectors2 лет назад

Here's a screen shot of how I added the LCM LORA. I went with the baked in VAE from the checkpoint. [5/11]

Фото профиля CoffeeVectors

CoffeeVectors2 лет назад

Kept the prompt pretty generic to see how it would apply to all the various shots. [6/11]

Фото профиля CoffeeVectors

CoffeeVectors2 лет назад

In the K Sampler, I used the LCM Sampler. You need to update to the latest version of ComfyUI to access it. [7/11]

Фото профиля CoffeeVectors

CoffeeVectors2 лет назад

And here's how I arranged the nodes for multi-control net. [8/11]

Фото профиля CoffeeVectors

CoffeeVectors2 лет назад

If you want to learn more about LCM LORAs, I mainly referred to @NerdyRodent’s tutorial. Go check it out! It speeds up all rendering in SD. It's not just for videos! [9/11]

Фото профиля CoffeeVectors

CoffeeVectors2 лет назад

If you want to learn more about Animate Diff, go check @PurzBeats’ live stream videos! [10/11]

Фото профиля CoffeeVectors

CoffeeVectors2 лет назад

Lastly, shout out to @rainisto for giving me the idea to try this on a full music video, and @PurzBeats again for answering some of my questions about AnimateDiff! [11/11]

Похожие видео

A range of motion test applying LCM LORA-HotshotXL-ControlNet to a Metahuman input. Here I used Zoe Depth and Realistic Line Control Nets, but was also experimenting with OpenPose, etc. Still trying to get more expression in the mouth for visemes & stabilize the hairline.

A range of motion test applying LCM LORA-HotshotXL-ControlNet to a Metahuman input. Here I used Zoe Depth and Realistic Line Control Nets, but was also experimenting with OpenPose, etc. Still trying to get more expression in the mouth for visemes & stabilize the hairline.

CoffeeVectors

42,959 просмотров • 2 лет назад

6 hours of editing. 1 click. That’s the difference between doing it manually vs. building a reusable AI workflow in Magnific Spaces. I built my own video factory in 20 minutes. Here’s the breakdown 🧵

6 hours of editing. 1 click. That’s the difference between doing it manually vs. building a reusable AI workflow in Magnific Spaces. I built my own video factory in 20 minutes. Here’s the breakdown 🧵

Perks

19,448 просмотров • 1 месяц назад

Video to 70's Cartoon AI exploration. Created with #AnimateDiff and #IPAdapter to stylize input video for a retro cartoon look. 🎥 Result (10fps) vs input (30fps iPhone footage) #aivideo #stablediffusion #ComfyUI More imagery (and result video by itself) in thread!🧵

Video to 70's Cartoon AI exploration. Created with #AnimateDiff and #IPAdapter to stylize input video for a retro cartoon look. 🎥 Result (10fps) vs input (30fps iPhone footage) #aivideo #stablediffusion #ComfyUI More imagery (and result video by itself) in thread!🧵

Nathan Shipley

233,679 просмотров • 2 лет назад

Made this video with iPhone photos I took of my friend Stephanie that I used as keyframes in Luma! With the camera controls I can gen transitions between shots. I also built a custom web app in Next.js to help me speedramp and edit all the clips! Breakdown 🧵(1/18)

Made this video with iPhone photos I took of my friend Stephanie that I used as keyframes in Luma! With the camera controls I can gen transitions between shots. I also built a custom web app in Next.js to help me speedramp and edit all the clips! Breakdown 🧵(1/18)

CoffeeVectors

338,209 просмотров • 1 год назад

Built this free ComfyUI workflow using Qwen Image Edit to recreate the process of storyboarding. Direct entire scenes shot by shot with prompts, move cameras in 360° environments, and maintain character likeness without training LoRAs. Full tutorial + free workflow below!👇

Built this free ComfyUI workflow using Qwen Image Edit to recreate the process of storyboarding. Direct entire scenes shot by shot with prompts, move cameras in 360° environments, and maintain character likeness without training LoRAs. Full tutorial + free workflow below!👇

Mickmumpitz

25,447 просмотров • 7 месяцев назад

This video was made almost entirely by AI. I used ChatGPT to write a script, Midjourney to create reference images, Runway Gen-1 to apply the style of the images to my source video, and Boomy AI for the music. Workflow breakdown w/ comparisons in thread. 🧵

This video was made almost entirely by AI. I used ChatGPT to write a script, Midjourney to create reference images, Runway Gen-1 to apply the style of the images to my source video, and Boomy AI for the music. Workflow breakdown w/ comparisons in thread. 🧵

Nick St. Pierre

2,279,504 просмотров • 3 лет назад

Made this video (🎶) with a Midjourney v6 image! Started by upscaling/refining with Magnific.ai, pulled a Marigold Depth Map from that in ComfyUI, then used as a displacement map in Blender where I animated this camera pass with some relighting and narrow depth of field.🧵1/12

Made this video (🎶) with a Midjourney v6 image! Started by upscaling/refining with Magnific.ai, pulled a Marigold Depth Map from that in ComfyUI, then used as a displacement map in Blender where I animated this camera pass with some relighting and narrow depth of field.🧵1/12

CoffeeVectors

135,658 просмотров • 2 лет назад

Creator seungho__yeo ( IG ) built an entire anime-inspired 2D motion sequence - storyboard to final render in one hour. Inside Comfy. seungho__yeo created this entirely in a single workflow pipeline: → Poster art → Storyboard frames → Transitions + animation → Edit + final render What usually takes multiple apps was built in one tool.

Creator seunghoyeo ( IG ) built an entire anime-inspired 2D motion sequence - storyboard to final render in one hour. Inside Comfy. seunghoyeo created this entirely in a single workflow pipeline: → Poster art → Storyboard frames → Transitions + animation → Edit + final render What usually takes multiple apps was built in one tool.

ComfyUI

28,740 просмотров • 8 дней назад

Turn a multi-step process into a single workflow, powered by NVIDIA RTX Spark. See how an agent like Nous Research Hermes Agent helps transform a concept sketch into a photoreal render, connecting Rhino, ComfyUI, and Blender 🔶 into one seamless workflow so designers can stay focused on their vision.

Turn a multi-step process into a single workflow, powered by NVIDIA RTX Spark. See how an agent like Nous Research Hermes Agent helps transform a concept sketch into a photoreal render, connecting Rhino, ComfyUI, and Blender 🔶 into one seamless workflow so designers can stay focused on their vision.

NVIDIA RTX Spark

84,297 просмотров • 19 дней назад

Kling 3.0's Multi Shot feature is underrated. I built this ComfyUI workflow to have balance between creative control & automation. Here's how it works: Input images of product/character → Select total duration and # of scenes → LLM gens all prompts and timing → You refine manually → Generate multi scene video Workflow attached 👇

Kling 3.0's Multi Shot feature is underrated. I built this ComfyUI workflow to have balance between creative control & automation. Here's how it works: Input images of product/character → Select total duration and # of scenes → LLM gens all prompts and timing → You refine manually → Generate multi scene video Workflow attached 👇

rob - comfyui

15,191 просмотров • 3 месяцев назад

added sliding sampling [as in comfyui] to animatediff and zeroscope on

added sliding sampling [as in comfyui] to animatediff and zeroscope on

vadim epstein

62,214 просмотров • 2 лет назад

Eze vs Rice Arsenal's new LCM Notice: - The zones of reception & occupation - Difference in attacking intention

Eze vs Rice Arsenal's new LCM Notice: - The zones of reception & occupation - Difference in attacking intention

Wasi

102,377 просмотров • 10 месяцев назад

This is probably the most complex workflow I’ve ever built, only with open-source tools. It took my 4 days. It takes four inputs: author, title, and style; and generates a full visual animated story in one click in ComfyUI . I worked on it for four days. There are still some bugs, but here’s the first preview. Here’s a quick breakdown: - The four inputs are sent to LLMs with precise instructions to generate: first, prompts for images and image modifications; second, prompts for animations; third, prompts for generating music. - All voices are generated from the text and timed precisely, as they determine the length of each animation segment. - The first image and video are generated to serve as the title, but also as the guide for all other images created for the video. - Titles and subtitles are also added automatically in Comfy. - I also developed a lot of custom nodes for minor frame calculations, mostly to match audio and video. - The full system is a large loop that, for each line of text, generates an image and then a video from that image. The loop was the hardest part to build in this workflow, so it can process either a 20-second video or a 2-minute video with the same input. - There are multiple combinations of LLMs that try to understand the text in the best way to provide the best prompts for images and video. - The final video is assembled entirely within ComfyUI. - The music is generated based on the LLM output and matches the exact timing of the full animation. - Done! For reference, this workflow uses a lot of models and only works on an RTX 6000 Pro with plenty of RAM. My goal is not to replace humans, as I’ll try to explain later, this workflow is highly controlled and can be adapted or reworked at any point by real artists! My aim was to create a tool that can animate text in one go, allowing the AI some freedom while keeping a strict flow. I don’t know yet how I’ll share this workflow with people, I still need to polish it properly, but maybe through Patreon. Anyway, I hope you enjoy my research, and let’s always keep pushing further! :)

This is probably the most complex workflow I’ve ever built, only with open-source tools. It took my 4 days. It takes four inputs: author, title, and style; and generates a full visual animated story in one click in ComfyUI . I worked on it for four days. There are still some bugs, but here’s the first preview. Here’s a quick breakdown: - The four inputs are sent to LLMs with precise instructions to generate: first, prompts for images and image modifications; second, prompts for animations; third, prompts for generating music. - All voices are generated from the text and timed precisely, as they determine the length of each animation segment. - The first image and video are generated to serve as the title, but also as the guide for all other images created for the video. - Titles and subtitles are also added automatically in Comfy. - I also developed a lot of custom nodes for minor frame calculations, mostly to match audio and video. - The full system is a large loop that, for each line of text, generates an image and then a video from that image. The loop was the hardest part to build in this workflow, so it can process either a 20-second video or a 2-minute video with the same input. - There are multiple combinations of LLMs that try to understand the text in the best way to provide the best prompts for images and video. - The final video is assembled entirely within ComfyUI. - The music is generated based on the LLM output and matches the exact timing of the full animation. - Done! For reference, this workflow uses a lot of models and only works on an RTX 6000 Pro with plenty of RAM. My goal is not to replace humans, as I’ll try to explain later, this workflow is highly controlled and can be adapted or reworked at any point by real artists! My aim was to create a tool that can animate text in one go, allowing the AI some freedom while keeping a strict flow. I don’t know yet how I’ll share this workflow with people, I still need to polish it properly, but maybe through Patreon. Anyway, I hope you enjoy my research, and let’s always keep pushing further! :)

Lovis Odin

58,571 просмотров • 9 месяцев назад

I thought it was impossible to generate a 23-minute TV episode in 4 days until I saw the workflow. This crazy approach completely changes the way you create. I broke down the entire process into 10 simple steps. Bookmark this thread 🧵👇

I thought it was impossible to generate a 23-minute TV episode in 4 days until I saw the workflow. This crazy approach completely changes the way you create. I broke down the entire process into 10 simple steps. Bookmark this thread 🧵👇

PJ Ace

190,722 просмотров • 2 месяцев назад

Generate image, 3D, and video from a single sketch in one click in ComfyUI with AI! I’m excited to share a cutting-edge workflow I’ve developed that combines inside ComfyUI: Runway with fal. ai api (yes there is a node for it :) ), Stable AI’s stable fast 3D , powerful Flux, ControlNet, IPAdapter, Gemini LLm in ComfyUI. This innovation allows you to generate high-quality visuals and 3D models, reposition objects, and create videos—completely automated from a simple sketch input. Would you link a tutorial for it ? #AI #ComfyUI #3DModeling #Automation #CreativeTech

Generate image, 3D, and video from a single sketch in one click in ComfyUI with AI! I’m excited to share a cutting-edge workflow I’ve developed that combines inside ComfyUI: Runway with fal. ai api (yes there is a node for it :) ), Stable AI’s stable fast 3D , powerful Flux, ControlNet, IPAdapter, Gemini LLm in ComfyUI. This innovation allows you to generate high-quality visuals and 3D models, reposition objects, and create videos—completely automated from a simple sketch input. Would you link a tutorial for it ? #AI #ComfyUI #3DModeling #Automation #CreativeTech

Lovis Odin

36,789 просмотров • 1 год назад

Finally got #gaussiansplatting working! I’m really impressed by how well it captured the fine details of the sneaker fabric and shoelaces, as well as the text. This was generated from 75 DSLR photos I shot on a 24mm lens, took about 30 minutes to process on an RTX 3090, and is running at a smooth 60fps on the SIBR Viewer. Going to run more tests to see how this compares to some of my experiments in photogrammetry. I’m hoping a plugin for #UnrealEngine5 comes out soon! #nerf

Finally got #gaussiansplatting working! I’m really impressed by how well it captured the fine details of the sneaker fabric and shoelaces, as well as the text. This was generated from 75 DSLR photos I shot on a 24mm lens, took about 30 minutes to process on an RTX 3090, and is running at a smooth 60fps on the SIBR Viewer. Going to run more tests to see how this compares to some of my experiments in photogrammetry. I’m hoping a plugin for #UnrealEngine5 comes out soon! #nerf

CoffeeVectors

183,486 просмотров • 2 лет назад

Here's how to instantly render your animation as an .MP4 in Blender (and how to render transparent animations/images) (full video with more tips linked below)

Here's how to instantly render your animation as an .MP4 in Blender (and how to render transparent animations/images) (full video with more tips linked below)

Poole

45,840 просмотров • 2 месяцев назад

brain rotting, but the thoughts still racing // Testing img2vid lipsyncing with the InfiniteTalk model on WaveSpeedAI and a song I made in Suno v5. Love how it handles images with multiple faces and how it animates even when it doesn’t see a mouth. Quick tutorial in 🧵👇 (1/6)

brain rotting, but the thoughts still racing // Testing img2vid lipsyncing with the InfiniteTalk model on WaveSpeedAI and a song I made in Suno v5. Love how it handles images with multiple faces and how it animates even when it doesn’t see a mouth. Quick tutorial in 🧵👇 (1/6)

CoffeeVectors

29,987 просмотров • 7 месяцев назад

ComfyUI MCP server - A project from Pixelle AI Shoutout to Pixelle-MCP - an amazing open-source project that bridges ComfyUI with LLMs through the MCP protocol. Zero-code, talk to your LLM to iterate ComfyUI workflow! Highlights: - Full TISV (Text, Image, Sound/Speech, Video) support - Flexible deployment options - Easy setup with Docker or one-click scripts - Custom workflow tools in just a few steps 🧵👇

ComfyUI MCP server - A project from Pixelle AI Shoutout to Pixelle-MCP - an amazing open-source project that bridges ComfyUI with LLMs through the MCP protocol. Zero-code, talk to your LLM to iterate ComfyUI workflow! Highlights: - Full TISV (Text, Image, Sound/Speech, Video) support - Flexible deployment options - Easy setup with Docker or one-click scripts - Custom workflow tools in just a few steps 🧵👇

ComfyUI

27,873 просмотров • 10 месяцев назад

A trick to easily solve any HCF and LCM questions in CSAT

A trick to easily solve any HCF and LCM questions in CSAT

Legacy IAS

18,677 просмотров • 1 месяц назад