Загрузка видео...

Не удалось загрузить видео

На главную

This is probably the most complex workflow I’ve ever built, only with open-source tools. It took my 4 days. It takes four inputs: author, title, and style; and generates a full visual animated story in one click in ComfyUI . I worked on it for four days. There are...

56,518 просмотров • 8 месяцев назад •via X (Twitter)

Комментарии: 0

Нет доступных комментариев

Здесь появятся комментарии из оригинального поста

Похожие видео

Seedance 2.0 is allowing us to enter a new era of music video creation. Here is how I created HONEY. It was a quick test to see how well this workflow holds up. 🐝 1 - Write your song and generate the music with Suno 5.5. 2 - Use an image generator of your choice. For HONEY I combined both Grok Imagine for aesthetics and Nano Banana Pro for refined editing. 3 - In Capcut I import my audio and just save out a blank video video containing the audio. This step is important because this video file containing audio will now be used with Seedance 2.0 as a video reference with Omni. This allows the AI to apply automatic and realistic lipsync and movement to the music, it's extremely powerful! 4 - Once I have a both my image and video with audio as reference, I use Seedance 2.0 Omni and upload my starting image and then the video reference with the audio. 5 - From here I'm simply prompting like normal, specifying what's happening in my scene with detailed instructions, mentioning multi shots and camera angle changes and then specifying that the person is singing along to the song. I type out the lyrics that are present to have better lipsync accuracy. 6 - Once I have generated a video and like the result, I do video to video, so i upload that video that just got generated and type "The scene continues" and prompt new actions to take place. This allows you to expand on a narrative. These new shots can be used as B-ROLL and since I uploaded my video as reference I have full consistency of everything it saw in the video. This is also extremely powerful. 7 - This is actually the most difficult part. Edit in Capcut. This is where you need to understand pacing and shot selection from all the scenes you generated to bring it all together. You must be strategic with the editing. Goodluck! I'll probably record a video tutorial at some point as it's easier to see what is being done.

Travis Davids

18,334 просмотров • 1 месяц назад

🎥 Today we’re premiering Meta Movie Gen: the most advanced media foundation models to-date. Developed by AI research teams at Meta, Movie Gen delivers state-of-the-art results across a range of capabilities. We’re excited for the potential of this line of research to usher in entirely new possibilities for casual creators and creative professionals alike. More details and examples of what Movie Gen can do ➡️ 🛠️ Movie Gen models and capabilities Movie Gen Video: 30B parameter transformer model that can generate high-quality and high-definition images and videos from a single text prompt. Movie Gen Audio: A 13B parameter transformer model that can take a video input along with optional text prompts for controllability to generate high-fidelity audio synced to the video. It can generate ambient sound, instrumental background music and foley sound — delivering state-of-the-art results in audio quality, video-to-audio alignment and text-to-audio alignment. Precise video editing: Using a generated or existing video and accompanying text instructions as an input it can perform localized edits such as adding, removing or replacing elements — or global changes like background or style changes. Personalized videos: Using an image of a person and a text prompt, the model can generate a video with state-of-the-art results on character preservation and natural movement in video. We’re continuing to work closely with creative professionals from across the field to integrate their feedback as we work towards a potential release. We look forward to sharing more on this work and the creative possibilities it will enable in the future.

AI at Meta

2,263,572 просмотров • 1 год назад

I asked Garry Tan how to use meta prompting to get better at AI: "My partners at YC Jared Friedman and Pete Koomen showed me how to do this. You can take almost anything that you do all the time and just drop it into a context window. And then say, “Here’s a bunch of inputs and outputs." And maybe you also add a bunch of notes. And then you tell it, “Write me a prompt that can act as an agent that takes this input and makes this output over here.” You can do this for almost any type of knowledge work. And you can even introspect. "What are things you notice that I did to convert this from the input to the output?”. And then you can just start using the prompt. Initially, it’s going to suck. Because it’s just not that smart yet. But what’s funny is now, I also use it to Iterate my writing. You can be very direct, "I would never say that", "Don’t say it like this", or "Oh, you used the long word there, use the short word". Just speak to it conversationally. And then when you're happy with the output, you can use that new output to make a new prompt. "Based on this conversation, give me a better initial prompt that incorporates all the things we talked about." And you can do this with literally everything. And in theory, there’s so much it applies to that people do day-to-day. You could use it for tweets. You could use it for editing podcasts. You can use it for pretty much everything. I have a folder of prompts that I use all the time. My YouTube prompt is on v27 or something. I'll go through this process with all the different max models. I'll use GPT 5.2 Pro. I’ll use Grok. I'll use Claude. Then, I’ll take all the outputs from all the models and put them into Claude and say "Here’s my prompt, here’s the output from four LLMs, including yourself. Rate each response and tell me what the pros and cons of each approach are." And I usually say "give it to me in numbered form". And then you can agree with one, disagree with two, tell it three is this or that. And then after that, you say given all of this, synthesize it."

The Peel

51,632 просмотров • 3 месяцев назад

Here's my first series of Anime created by Generative AI. Vidu 2.0 is a groundbreaking advancement for storytelling in the Anime style. #vidu @Viduforhuman Hi everyone 😊 😉 Here's my first AI-created Anime series. The name of my anime is Kuro & Yuki. I won't spoil the story for you, but it begins with these two boys locked up in what appears to be a highly secure institute/prison. Yuki has reached the age where he has awakened his power. While wandering the corridors of the institute, he meets Kuro, a boy with autism. It's a fleeting encounter as they end up being separated. As the story unfolds, you'll understand who they really are and why they are locked up. I don't plan on making episode 2 yet, even though I've grown attached to my characters' story. I'll continue the story only if a lot of people are interested and want to know what happens next. Otherwise, I'll explore other universes and styles of Anime 😉 Tools: - Epidemic Sound (For sound effects) - Vidu AI (to transform frames into animation) - ElevenLabs (for certain expressions) When I say Vidu 2.0 is a game-changer, I mean it. I had already made my animation with the previous version of Vidu, but when they gave me access to the beta, I urgently changed my plans and recreated all the animations. Let me tell you, it's a whole new level. My first animation was ultra frustrating to bring my ideas to life! Really! And since Vidu couldn't handle many image styles, I clearly couldn't bring my ideas to life. But with Vidu 2.0, I enjoy it much more! For the first time, I really get to bring my ideas to life! Until now, I always had to make compromises. Of course, Vidu is still not perfect, and there are still many obstacles, but it is a truly magnificent advancement! (and it surpasses all the Anime-style AIs I've tested recently) Moreover, Vidu 2.0 is fantastic for special effects, but also for embedding elements into the video (like a hand that appears and interacts with the character, or another character; and the best part is that they perfectly match the style of the original images!) * The animations/images and voices are AI-generated * The SoundDesign is traditional and done by me * The story was entirely written by me (I don't use AI to create my stories, it's important to me to write them myself) The voices were generated thanks to Nijivoice (for secondary characters) and Hailuo Audio (for main characters). I thank Yachimat (yachimat - AI Short Anime) for introducing me to Nijivoice, it's very generous of him! During the beta, we didn't consume credits, otherwise, this animation would have cost me around 20,000 credits, ha ha. Vidu 2.0, with its superb stability and fidelity to the style of images, offers brand new horizons for storytelling! One problem with Vidu 2.0 is that for now, you have to manually extend videos (by exporting the last frame). As you can see, this allowed me to create long scenes with different actions, and everything fits together perfectly! There are still many obstacles to storytelling, such as character consistency (I use the image to video function; I produce my images with Nijijourney) and it's always laborious to have a consistent character! The same goes for backgrounds. Here are the points I've identified that would facilitate storytelling: For Vidu: - Function to extend videos (you can already do this manually by exporting the last frame) - Function to invent the beginning or end of a video using a single frame (you can already do this manually by putting a completely white or completely black image as the first or last frame; this technique also allows you to have very dynamic results with Vidu 2.0! ) - It's difficult to obtain facial expressions for certain image styles (the same image style that the previous version couldn't handle at all). - More dynamism There's always room for improvement, but Vidu 2.0 has truly opened up exciting new possibilities for creative storytelling. ai aiart aianimation aianime ainews anime animenews aitools aitool

Naegiko

65,610 просмотров • 1 год назад

Sora 2 has the capabilities to make an entire anime episode with time and effort. This is a turning point for generative AI models and it is terrifying to think what is next to come. The difference between Sora and Sora 2 is absolutely staggering. Yes, this looks cool. I put the time into it to make it as close to art as possible. This was not simple to make and not just prompting and throwing it in some editor. Every frame is generated of another scene and even the reference image for characters I used to start this project were AI generated. This was all created with 10 second clips pulled apart and chunked out into the final product. Regardless of the imagery you see, I did not directly use any artist's art for this video. This 10-minute video was generated over the course of a week from more than 700 text prompts. It was built on technology trained by scraping the uncredited, uncompensated contributions of countless human artists and animators. The creation of this single video consumed an enormous amount of energy and water, equivalent to powering a home for days and requiring hundreds of liters of fresh water for cooling. But that is nothing. In the 10 minutes you spend watching this, the global network of AI video generators will create over 6,250 more short videos. The combined energy required for that 10 minutes of global creation is enough to power an average household for nearly 2 years. - It's a double-edged sword. While AI uses immense amount of energy has clear immoral issues with scalping the hard work of artists, this also provides those a medium who have potentially spent their life attempting to draw out the ideas in their head and failing to grasp it. My sister is an artist, I grew up always attempting to draw but never could get the image in my head on paper, I've spent the better part of over 20 years to teach myself through watching videos and practice, however I don't have steady hands and frankly have just been unable to make any vision come to life. Which brings me to the conflict here. This would have been a dream of mine to be able to get frames created and flesh out the story in my head. To make this video professionally, it would be an absurd amount of money. A small studio by itself would be over a million dollars just to start up. To hire a studio would be most likely over ~200k. I would love to make this story I have in my head through legit and traditional means, I would love to start a Kickstarter to get funding and hire artists, voice actors, and a production team. However I know that would most likely be an impossible reach that would further fuel the hate. Regardless, I understand the worry this causes. The fear this produces. However, we can't just ignore where AI is currently at and just tell people not to use it or even worse threaten people who use it. People will always use the shiny new toy in front of them, so the real question is how do we either make it work for us and work along side it, or how do we ACTUALLY implement a method to protect art rather than tell people to not use it. Respect your artists. Review their ToS and don't upload their hard work to a model without permission.

LUͦʷCͦK 🔜 ??

17,966 просмотров • 7 месяцев назад