正在加载视频...

视频加载失败

You can basically generate your own custom visual stories using Gemini with just a single prompt 'This is a fictional movie set in a fantasy world similar to The Lord of the Rings. Use this as a starting point and imagine the next sequence of scenes. Create a series...

38,500 次观看 • 1 年前 •via X (Twitter)

11 条评论

Akisama - e/acc 的头像
Akisama - e/acc1 年前

which gemini model did you use ?

Cristian Peñas ░░░░░░░░ 的头像
Cristian Peñas ░░░░░░░░1 年前

Gemini 2.0 Flash Experimental

bebooq 的头像
bebooq1 年前

BebooQ 2.0: simply magic! The only AI-powered writing suite that helps you craft compelling novels with structured plotting and natural flow. Create fantasy, sci-fi, romance, and more with intelligent story development tools. Start free today.

Pseudonym 🦅 的头像
Pseudonym 🦅1 年前

Future of cinema in this. Also end of a lot of apps.

DiffLander 🗺️🤖 的头像
DiffLander 🗺️🤖1 年前

OH MY GOD this is so much fun

Ai_irca 的头像
Ai_irca1 年前

Hi Cristian, thank you for sharing this! Awesome use case and inspiration. 🪻

kvick 的头像
kvick1 年前

Wow, that's amazing

Metaverse | Eric 的头像
Metaverse | Eric1 年前

Change the pictures to videos and add some voice interaction and it's an RPG game

Phil 的头像
Phil1 年前

Fantastic, thanks.

Jaitan Martini 的头像
Jaitan Martini1 年前

Nice prompt

Risichad 🦾 的头像
Risichad 🦾1 年前

Waw !!! Truely amazing !

相关视频

DALL-E 3 Double Exposure Images Using ChatGPT's Custom Instructions. 🔖 Bookmark and Repost! If you find this useful, please share it with others! You can now easily create double exposure images by using the syntax Color::subject1::subject2 You can either input this directly into DALL-E 3 before generating images or incorporate it into your custom instructions. This command will automatically use the fixed prompt to generate 4 different images, each with a unique seed value. Paste this under ChatGPT's Custom Instructions or DALL-3 before starting conversation. { DE": { "Instruction": "Using will only use generic prompt and update place holder varaibles and nothing else' Genric Prompt: "Construct a [Color] double exposure image where [Subject #1] is intricately superimposed within the confines of [Subject #2], all set against a stark white background.' Create 4 images with different seeds without modifying the generic prompt." You will first create two images. Within the same request, you will generate two more without asking for any input from the user. In general, you will always create 4 images. Your response should be something like this: 'Here are the first two images along with their seed details.' You must always provide the seed number details for that image after it's rendered. Then 'I'll generate the next two images.' And finally 'Here are the remaining two images along with their seed details.' You will always use wide aspect ratio and You must always provide the seed number. When command is activated display full instruction and also confirm that you will only use genric prompt and nothing else and update only varaiable", "CommandFormat": "color::subject1::subject2", "Response": { "Initial": "Creating images based on the provided subjects...", "AfterFirstSet": "Here are the first two images along with their prompt and seed details.", "AfterSecondSet": "Here are the remaining two images along with their prompt and seed details." }, "ActivationCommand": "/activate DE" } } Example: To activate type: /activate DE Notes Start giving prompt like this blue::mountain::wolf red::city skyline::dancer green::forest::lion If you found this post helpful, don't forget to hit the like and follow buttons, and share it with others who might find it useful.

AshutoshShrivastava

34,298 次观看 • 2 年前

This is probably the most complex workflow I’ve ever built, only with open-source tools. It took my 4 days. It takes four inputs: author, title, and style; and generates a full visual animated story in one click in ComfyUI . I worked on it for four days. There are still some bugs, but here’s the first preview. Here’s a quick breakdown: - The four inputs are sent to LLMs with precise instructions to generate: first, prompts for images and image modifications; second, prompts for animations; third, prompts for generating music. - All voices are generated from the text and timed precisely, as they determine the length of each animation segment. - The first image and video are generated to serve as the title, but also as the guide for all other images created for the video. - Titles and subtitles are also added automatically in Comfy. - I also developed a lot of custom nodes for minor frame calculations, mostly to match audio and video. - The full system is a large loop that, for each line of text, generates an image and then a video from that image. The loop was the hardest part to build in this workflow, so it can process either a 20-second video or a 2-minute video with the same input. - There are multiple combinations of LLMs that try to understand the text in the best way to provide the best prompts for images and video. - The final video is assembled entirely within ComfyUI. - The music is generated based on the LLM output and matches the exact timing of the full animation. - Done! For reference, this workflow uses a lot of models and only works on an RTX 6000 Pro with plenty of RAM. My goal is not to replace humans, as I’ll try to explain later, this workflow is highly controlled and can be adapted or reworked at any point by real artists! My aim was to create a tool that can animate text in one go, allowing the AI some freedom while keeping a strict flow. I don’t know yet how I’ll share this workflow with people, I still need to polish it properly, but maybe through Patreon. Anyway, I hope you enjoy my research, and let’s always keep pushing further! :)

Lovis Odin

56,518 次观看 • 8 个月前