Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

You can basically generate your own custom visual stories using Gemini with just a single prompt 'This is a fictional movie set in a fantasy world similar to The Lord of the Rings. Use this as a starting point and imagine the next sequence of scenes. Create a series...

38,500 Aufrufe • vor 1 Jahr •via X (Twitter)

11 Kommentare

Profilbild von Akisama - e/acc
Akisama - e/accvor 1 Jahr

which gemini model did you use ?

Profilbild von Cristian Peñas ░░░░░░░░
Cristian Peñas ░░░░░░░░vor 1 Jahr

Gemini 2.0 Flash Experimental

Profilbild von bebooq
bebooqvor 1 Jahr

BebooQ 2.0: simply magic! The only AI-powered writing suite that helps you craft compelling novels with structured plotting and natural flow. Create fantasy, sci-fi, romance, and more with intelligent story development tools. Start free today.

Profilbild von Pseudonym 🦅
Pseudonym 🦅vor 1 Jahr

Future of cinema in this. Also end of a lot of apps.

Profilbild von DiffLander 🗺️🤖
DiffLander 🗺️🤖vor 1 Jahr

OH MY GOD this is so much fun

Profilbild von Ai_irca
Ai_ircavor 1 Jahr

Hi Cristian, thank you for sharing this! Awesome use case and inspiration. 🪻

Profilbild von kvick
kvickvor 1 Jahr

Wow, that's amazing

Profilbild von Metaverse | Eric
Metaverse | Ericvor 1 Jahr

Change the pictures to videos and add some voice interaction and it's an RPG game

Profilbild von Phil
Philvor 1 Jahr

Fantastic, thanks.

Profilbild von Jaitan Martini
Jaitan Martinivor 1 Jahr

Nice prompt

Profilbild von Risichad 🦾
Risichad 🦾vor 1 Jahr

Waw !!! Truely amazing !

Ähnliche Videos

DALL-E 3 Double Exposure Images Using ChatGPT's Custom Instructions. 🔖 Bookmark and Repost! If you find this useful, please share it with others! You can now easily create double exposure images by using the syntax Color::subject1::subject2 You can either input this directly into DALL-E 3 before generating images or incorporate it into your custom instructions. This command will automatically use the fixed prompt to generate 4 different images, each with a unique seed value. Paste this under ChatGPT's Custom Instructions or DALL-3 before starting conversation. { DE": { "Instruction": "Using will only use generic prompt and update place holder varaibles and nothing else' Genric Prompt: "Construct a [Color] double exposure image where [Subject #1] is intricately superimposed within the confines of [Subject #2], all set against a stark white background.' Create 4 images with different seeds without modifying the generic prompt." You will first create two images. Within the same request, you will generate two more without asking for any input from the user. In general, you will always create 4 images. Your response should be something like this: 'Here are the first two images along with their seed details.' You must always provide the seed number details for that image after it's rendered. Then 'I'll generate the next two images.' And finally 'Here are the remaining two images along with their seed details.' You will always use wide aspect ratio and You must always provide the seed number. When command is activated display full instruction and also confirm that you will only use genric prompt and nothing else and update only varaiable", "CommandFormat": "color::subject1::subject2", "Response": { "Initial": "Creating images based on the provided subjects...", "AfterFirstSet": "Here are the first two images along with their prompt and seed details.", "AfterSecondSet": "Here are the remaining two images along with their prompt and seed details." }, "ActivationCommand": "/activate DE" } } Example: To activate type: /activate DE Notes Start giving prompt like this blue::mountain::wolf red::city skyline::dancer green::forest::lion If you found this post helpful, don't forget to hit the like and follow buttons, and share it with others who might find it useful.

AshutoshShrivastava

34,298 Aufrufe • vor 2 Jahren

This is probably the most complex workflow I’ve ever built, only with open-source tools. It took my 4 days. It takes four inputs: author, title, and style; and generates a full visual animated story in one click in ComfyUI . I worked on it for four days. There are still some bugs, but here’s the first preview. Here’s a quick breakdown: - The four inputs are sent to LLMs with precise instructions to generate: first, prompts for images and image modifications; second, prompts for animations; third, prompts for generating music. - All voices are generated from the text and timed precisely, as they determine the length of each animation segment. - The first image and video are generated to serve as the title, but also as the guide for all other images created for the video. - Titles and subtitles are also added automatically in Comfy. - I also developed a lot of custom nodes for minor frame calculations, mostly to match audio and video. - The full system is a large loop that, for each line of text, generates an image and then a video from that image. The loop was the hardest part to build in this workflow, so it can process either a 20-second video or a 2-minute video with the same input. - There are multiple combinations of LLMs that try to understand the text in the best way to provide the best prompts for images and video. - The final video is assembled entirely within ComfyUI. - The music is generated based on the LLM output and matches the exact timing of the full animation. - Done! For reference, this workflow uses a lot of models and only works on an RTX 6000 Pro with plenty of RAM. My goal is not to replace humans, as I’ll try to explain later, this workflow is highly controlled and can be adapted or reworked at any point by real artists! My aim was to create a tool that can animate text in one go, allowing the AI some freedom while keeping a strict flow. I don’t know yet how I’ll share this workflow with people, I still need to polish it properly, but maybe through Patreon. Anyway, I hope you enjoy my research, and let’s always keep pushing further! :)

Lovis Odin

56,518 Aufrufe • vor 8 Monaten