Loading video...

Video Failed to Load

There was a problem loading this video. This could be due to a temporary network issue or the video might be unavailable.

You can basically generate your own custom visual stories using Gemini with just a single prompt 'This is a fictional movie set in a fantasy world similar to The Lord of the Rings. Use this as a starting point and imagine the next sequence of scenes. Create a series... of separate images, each depicting a distinct moment in the story, presented in a sequential order like a storyboard. Include quotes for each image to narrate the events happening within it. Ensure the flow between images is consistent and logical, with each one styled like a cinematic movie shot that advances the narrative. Maintain the same format and visual style across all images to keep them cohesive'show more

Cristian Peñas ░░░░░░░░

10,739 subscribers

38,500 views • 1 year ago •via X (Twitter)

Science & Technology Arts

Anya Rossi• Live Now

Private livecam show

11 Comments

Akisama - e/acc1 year ago

which gemini model did you use ?

Cristian Peñas ░░░░░░░░1 year ago

Gemini 2.0 Flash Experimental

bebooq1 year ago

BebooQ 2.0: simply magic! The only AI-powered writing suite that helps you craft compelling novels with structured plotting and natural flow. Create fantasy, sci-fi, romance, and more with intelligent story development tools. Start free today.

Pseudonym 🦅1 year ago

Future of cinema in this. Also end of a lot of apps.

DiffLander 🗺️🤖1 year ago

OH MY GOD this is so much fun

Ai_irca1 year ago

Hi Cristian, thank you for sharing this! Awesome use case and inspiration. 🪻

kvick1 year ago

Wow, that's amazing

Metaverse | Eric1 year ago

Change the pictures to videos and add some voice interaction and it's an RPG game

Phil1 year ago

Fantastic, thanks.

Jaitan Martini1 year ago

Nice prompt

Risichad 🦾1 year ago

Waw !!! Truely amazing !

Related Videos

When creating ingredient images, try to create stylistically consistent images. If you use input images that are too disparate, the output may lead to a cut scene. For example, when generating a reference image, try using a prompt like: “Create the next scene in this storyboard, now show a roadside diner with the same look and feel as the original image.”

When creating ingredient images, try to create stylistically consistent images. If you use input images that are too disparate, the output may lead to a cut scene. For example, when generating a reference image, try using a prompt like: “Create the next scene in this storyboard, now show a roadside diner with the same look and feel as the original image.”

Google AI

23,526 views • 9 months ago

Here's a movie trailer from ChatGPT Images 2.0. Create a storyboard style image with Images 2.0, and use it in Seedream 2.0 as a reference. Enter a prompt to create a 15 second video from the image. The model will use the character and environments. Quick tutorial:

Here's a movie trailer from ChatGPT Images 2.0. Create a storyboard style image with Images 2.0, and use it in Seedream 2.0 as a reference. Enter a prompt to create a 15 second video from the image. The model will use the character and environments. Quick tutorial:

Jerrod Lew

15,768 views • 3 months ago

One prompt. Multiple gaming scenes 🎮 The Nano Banana agent can generate several images at once, using a reference image to keep the same person consistent across scenes. This example spans different gaming eras, but the same approach works for any theme or style. Images and videos, all in one place. Agent + prompt 👇

One prompt. Multiple gaming scenes 🎮 The Nano Banana agent can generate several images at once, using a reference image to keep the same person consistent across scenes. This example spans different gaming eras, but the same approach works for any theme or style. Images and videos, all in one place. Agent + prompt 👇

GLIF

10,882 views • 5 months ago

I was able to create a prompt that helps me generate bulks of images using flow agent. this is a 61 image I generated in one go just by using one character image and it used the style across every 61 prompt And everything was done on mobile.. Yt automators this is for you😎

I was able to create a prompt that helps me generate bulks of images using flow agent. this is a 61 image I generated in one go just by using one character image and it used the style across every 61 prompt And everything was done on mobile.. Yt automators this is for you😎

Peter Buildweb | Al

20,596 views • 20 days ago

Collaborative Score Distillation for Consistent Visual Synthesis paper page: Generative priors of large-scale text-to-image diffusion models enable a wide range of new generation and editing applications on diverse visual modalities. However, when adapting these priors to complex visual modalities, often represented as multiple images (e.g., video), achieving consistency across a set of images is challenging. In this paper, we address this challenge with a novel method, Collaborative Score Distillation (CSD). CSD is based on the Stein Variational Gradient Descent (SVGD). Specifically, we propose to consider multiple samples as "particles" in the SVGD update and combine their score functions to distill generative priors over a set of images synchronously. Thus, CSD facilitates seamless integration of information across 2D images, leading to a consistent visual synthesis across multiple samples. We show the effectiveness of CSD in a variety of tasks, encompassing the visual editing of panorama images, videos, and 3D scenes. Our results underline the competency of CSD as a versatile method for enhancing inter-sample consistency, thereby broadening the applicability of text-to-image diffusion models.

Collaborative Score Distillation for Consistent Visual Synthesis paper page: Generative priors of large-scale text-to-image diffusion models enable a wide range of new generation and editing applications on diverse visual modalities. However, when adapting these priors to complex visual modalities, often represented as multiple images (e.g., video), achieving consistency across a set of images is challenging. In this paper, we address this challenge with a novel method, Collaborative Score Distillation (CSD). CSD is based on the Stein Variational Gradient Descent (SVGD). Specifically, we propose to consider multiple samples as "particles" in the SVGD update and combine their score functions to distill generative priors over a set of images synchronously. Thus, CSD facilitates seamless integration of information across 2D images, leading to a consistent visual synthesis across multiple samples. We show the effectiveness of CSD in a variety of tasks, encompassing the visual editing of panorama images, videos, and 3D scenes. Our results underline the competency of CSD as a versatile method for enhancing inter-sample consistency, thereby broadening the applicability of text-to-image diffusion models.

AK

33,500 views • 3 years ago

None of this is real. Every shot, every angle, every detail of the necklace -all Al. Here is what you need: Use ChatGPT to create a visual storyboard and video generation prompt Generate cinematic footage with Seedance 2.0 CapCut for editing Full Prompt in the comment

None of this is real. Every shot, every angle, every detail of the necklace -all Al. Here is what you need: Use ChatGPT to create a visual storyboard and video generation prompt Generate cinematic footage with Seedance 2.0 CapCut for editing Full Prompt in the comment

FATHELA ESQ

37,568 views • 26 days ago

It’s Moodboard Monday, and this week we’re exploring a style we’re calling Jelly Pop, inspired by the textures of candy. A quick and easy way to maintain a consistent visual style across outputs is to repeat key descriptors in the prompt. In this case we used: translucent materials, bright candy-colored palettes, and strong direct lighting. As a result, the style remained visually consistent across very different subjects, like a handbag, a jelly burger, and a gummy bear jacket. You can use these techniques to create with the Jelly Pop style 👇(1/2)

It’s Moodboard Monday, and this week we’re exploring a style we’re calling Jelly Pop, inspired by the textures of candy. A quick and easy way to maintain a consistent visual style across outputs is to repeat key descriptors in the prompt. In this case we used: translucent materials, bright candy-colored palettes, and strong direct lighting. As a result, the style remained visually consistent across very different subjects, like a handbag, a jelly burger, and a gummy bear jacket. You can use these techniques to create with the Jelly Pop style 👇(1/2)

Stability AI

13,786 views • 1 year ago

Here's the secret behind Cinematic Video Overviews: We put Gemini in the director's chair. This means that Gemini decides the best format (tutorial vs. documentary etc), visual style, and visual capabilities to tell the story of your sources. It then critiques its own footage, refining the visuals and narrative to ensure a seamless, consistent final cut. The result? A bespoke video that converts even the most mundane sources into an engaging, immersive story.

Here's the secret behind Cinematic Video Overviews: We put Gemini in the director's chair. This means that Gemini decides the best format (tutorial vs. documentary etc), visual style, and visual capabilities to tell the story of your sources. It then critiques its own footage, refining the visuals and narrative to ensure a seamless, consistent final cut. The result? A bespoke video that converts even the most mundane sources into an engaging, immersive story.

NotebookLM

24,429,957 views • 4 months ago

Filmmaker Öner S. Biberkökü shares his "screenshot strategy" for building a full narrative from a single starting point during this In the Flow interview. Thanks Oner for the partnership! "At the end of the shot, you can take a screenshot, give it again [to Flow], and you now have a new image of that character in a different situation... So I think, with one image, you can stay in the flow, and make a story." Watch his breakdown:

Filmmaker Öner S. Biberkökü shares his "screenshot strategy" for building a full narrative from a single starting point during this In the Flow interview. Thanks Oner for the partnership! "At the end of the shot, you can take a screenshot, give it again [to Flow], and you now have a new image of that character in a different situation... So I think, with one image, you can stay in the flow, and make a story." Watch his breakdown:

Google Flow

35,150 views • 5 months ago

Create dialogue and action scenes with ChatGPT Images 2.0! Set up a storyboard, including characters, dialogue and the scene. Then bring the image to use as a reference for Seedance 2.0! You can have so much more control over your scene like this.

Create dialogue and action scenes with ChatGPT Images 2.0! Set up a storyboard, including characters, dialogue and the scene. Then bring the image to use as a reference for Seedance 2.0! You can have so much more control over your scene like this.

Jerrod Lew

12,364 views • 2 months ago

Within two years, you can create a full game like this from scratch in less than a week. In the meantime, here's the prompt to create the images for a game like this with Midjourney 👇🏼

Within two years, you can create a full game like this from scratch in less than a week. In the meantime, here's the prompt to create the images for a game like this with Midjourney 👇🏼

PJ Ace

354,025 views • 1 year ago

I just created this 36-second video in just five minutes with a simple prompt using Agent Mode in Grok Imagine All it took was a simple prompt, and it was made in just five minutes With a single image, it can generate matching scenes from different angles, build a full visual storyline, and even add subtitles. Grok Imagine Agent is one of the best tools for visual storytelling It's amazing how quickly you can now bring an idea to life just by expressing it in a few words The barrier between imagination and creation is getting smaller every day

I just created this 36-second video in just five minutes with a simple prompt using Agent Mode in Grok Imagine All it took was a simple prompt, and it was made in just five minutes With a single image, it can generate matching scenes from different angles, build a full visual storyline, and even add subtitles. Grok Imagine Agent is one of the best tools for visual storytelling It's amazing how quickly you can now bring an idea to life just by expressing it in a few words The barrier between imagination and creation is getting smaller every day

X Freeze

25,919 views • 7 days ago

Sequels A photograph tells us the objective truth of what the photographer and the camera saw in that first moment of recognition. But besides delighting in the purity of the singular image photographs can also act like an alphabet of meanings when seen in relationships with other photographs. For me, this asset has been the secret thrill of the medium. I have always loved the way images can sit next to each other and give off sparks of fresh associations through sequencing. These pairings and runs of images produce a rhythm between them that brings to my mind what I call the 'third voice.' And what is The Third Voice? It is recognizing the fresh sensations that occur when a sequence of unrelated images produces a quickening sense of new meaning which isn't in any of the photographs on their own but is the product of their being seen together. Sequels is the outcome of many years of playing this 'game of seeing.' My hope is that it brings to all of you who see these images a new and playful sense of the range of photography's ability to make unexpected connections that carry new meaning.

Sequels A photograph tells us the objective truth of what the photographer and the camera saw in that first moment of recognition. But besides delighting in the purity of the singular image photographs can also act like an alphabet of meanings when seen in relationships with other photographs. For me, this asset has been the secret thrill of the medium. I have always loved the way images can sit next to each other and give off sparks of fresh associations through sequencing. These pairings and runs of images produce a rhythm between them that brings to my mind what I call the 'third voice.' And what is The Third Voice? It is recognizing the fresh sensations that occur when a sequence of unrelated images produces a quickening sense of new meaning which isn't in any of the photographs on their own but is the product of their being seen together. Sequels is the outcome of many years of playing this 'game of seeing.' My hope is that it brings to all of you who see these images a new and playful sense of the range of photography's ability to make unexpected connections that carry new meaning.

Joel Meyerowitz

128,956 views • 3 years ago

Testing Grok Imagine 1.5's abilities with the burst frame technique (a brainstorm exercise in which you use a reference image to create a bunch of new scenes, locations, and characters in the same visual style, from which you can then extract key frames to create new scenes). This one is for an imaginary Italian Giallo thriller. The image quality is quite good in Imagine 1.5 - a little too good - I had to grunge it up a bit for a more vintage look. Seedance 2 gives you a more rhythmic burst frame video with shorter snippets - Grok Imagine gave each snippet a little more room to breathe.

Testing Grok Imagine 1.5's abilities with the burst frame technique (a brainstorm exercise in which you use a reference image to create a bunch of new scenes, locations, and characters in the same visual style, from which you can then extract key frames to create new scenes). This one is for an imaginary Italian Giallo thriller. The image quality is quite good in Imagine 1.5 - a little too good - I had to grunge it up a bit for a more vintage look. Seedance 2 gives you a more rhythmic burst frame video with shorter snippets - Grok Imagine gave each snippet a little more room to breathe.

Christopher Gwinn | Grindhouse Glitch

17,869 views • 1 month ago

I'm playing around with generative AI tools and stitching them together into visual stories. Here I took the first few sentences of Pride and Prejudice and made it into a video. The gen stack used for this one: - Anthropic Claude took the first chapter, generated the scenes and the individual prompts to to the image generator. - Ideogram took the prompts and generate the images - Luma took the images and animated them - for narration - VEED | AI Video Creation to stitch it together (Many of these choices are just what I happened to use for this one while exploring a bunch of things). Anyway honestly it was pretty messy and there is a ton of copy pasting between all of the tools, and even this little video with 3 scenes took me about an hour. There is a huge storytelling opportunity here for whoever can make this convenient. Who is building the first 100% AI-native movie maker?

I'm playing around with generative AI tools and stitching them together into visual stories. Here I took the first few sentences of Pride and Prejudice and made it into a video. The gen stack used for this one: - Anthropic Claude took the first chapter, generated the scenes and the individual prompts to to the image generator. - Ideogram took the prompts and generate the images - Luma took the images and animated them - for narration - VEED | AI Video Creation to stitch it together (Many of these choices are just what I happened to use for this one while exploring a bunch of things). Anyway honestly it was pretty messy and there is a ton of copy pasting between all of the tools, and even this little video with 3 scenes took me about an hour. There is a huge storytelling opportunity here for whoever can make this convenient. Who is building the first 100% AI-native movie maker?

Andrej Karpathy

609,008 views • 2 years ago

Create animated fight scenes with ChatGPT Images 2.0. Ask Images 2.0 to choreograph a scene and place it into a storyboard. Then use that image as a reference in Seedance 2.0 to bring it to life. It follows the storyboard sheet really well. Some examples:

Create animated fight scenes with ChatGPT Images 2.0. Ask Images 2.0 to choreograph a scene and place it into a storyboard. Then use that image as a reference in Seedance 2.0 to bring it to life. It follows the storyboard sheet really well. Some examples:

Jerrod Lew

12,501 views • 3 months ago

DALL-E 3 Double Exposure Images Using ChatGPT's Custom Instructions. 🔖 Bookmark and Repost! If you find this useful, please share it with others! You can now easily create double exposure images by using the syntax Color::subject1::subject2 You can either input this directly into DALL-E 3 before generating images or incorporate it into your custom instructions. This command will automatically use the fixed prompt to generate 4 different images, each with a unique seed value. Paste this under ChatGPT's Custom Instructions or DALL-3 before starting conversation. { DE": { "Instruction": "Using will only use generic prompt and update place holder varaibles and nothing else' Genric Prompt: "Construct a [Color] double exposure image where [Subject #1] is intricately superimposed within the confines of [Subject #2], all set against a stark white background.' Create 4 images with different seeds without modifying the generic prompt." You will first create two images. Within the same request, you will generate two more without asking for any input from the user. In general, you will always create 4 images. Your response should be something like this: 'Here are the first two images along with their seed details.' You must always provide the seed number details for that image after it's rendered. Then 'I'll generate the next two images.' And finally 'Here are the remaining two images along with their seed details.' You will always use wide aspect ratio and You must always provide the seed number. When command is activated display full instruction and also confirm that you will only use genric prompt and nothing else and update only varaiable", "CommandFormat": "color::subject1::subject2", "Response": { "Initial": "Creating images based on the provided subjects...", "AfterFirstSet": "Here are the first two images along with their prompt and seed details.", "AfterSecondSet": "Here are the remaining two images along with their prompt and seed details." }, "ActivationCommand": "/activate DE" } } Example: To activate type: /activate DE Notes Start giving prompt like this blue::mountain::wolf red::city skyline::dancer green::forest::lion If you found this post helpful, don't forget to hit the like and follow buttons, and share it with others who might find it useful.

DALL-E 3 Double Exposure Images Using ChatGPT's Custom Instructions. 🔖 Bookmark and Repost! If you find this useful, please share it with others! You can now easily create double exposure images by using the syntax Color::subject1::subject2 You can either input this directly into DALL-E 3 before generating images or incorporate it into your custom instructions. This command will automatically use the fixed prompt to generate 4 different images, each with a unique seed value. Paste this under ChatGPT's Custom Instructions or DALL-3 before starting conversation. { DE": { "Instruction": "Using will only use generic prompt and update place holder varaibles and nothing else' Genric Prompt: "Construct a [Color] double exposure image where [Subject #1] is intricately superimposed within the confines of [Subject #2], all set against a stark white background.' Create 4 images with different seeds without modifying the generic prompt." You will first create two images. Within the same request, you will generate two more without asking for any input from the user. In general, you will always create 4 images. Your response should be something like this: 'Here are the first two images along with their seed details.' You must always provide the seed number details for that image after it's rendered. Then 'I'll generate the next two images.' And finally 'Here are the remaining two images along with their seed details.' You will always use wide aspect ratio and You must always provide the seed number. When command is activated display full instruction and also confirm that you will only use genric prompt and nothing else and update only varaiable", "CommandFormat": "color::subject1::subject2", "Response": { "Initial": "Creating images based on the provided subjects...", "AfterFirstSet": "Here are the first two images along with their prompt and seed details.", "AfterSecondSet": "Here are the remaining two images along with their prompt and seed details." }, "ActivationCommand": "/activate DE" } } Example: To activate type: /activate DE Notes Start giving prompt like this blue::mountain::wolf red::city skyline::dancer green::forest::lion If you found this post helpful, don't forget to hit the like and follow buttons, and share it with others who might find it useful.

AshutoshShrivastava

34,305 views • 2 years ago

If editing is the operation that determines the position of images in consciousness — that is, in the time of the viewer — artificial imagination, by contrast, does not juxtapose images. It continuously transforms one visual state into another within a latent vector space. The early signs of this new language can already be seen in Jean-Luc Godard’s remarkable Histoire(s) du cinéma. The film thus becomes a trajectory, a continuous deformation that strikingly corresponds to the way images transform within the flow of thought. The Flow (work in progress)

If editing is the operation that determines the position of images in consciousness — that is, in the time of the viewer — artificial imagination, by contrast, does not juxtapose images. It continuously transforms one visual state into another within a latent vector space. The early signs of this new language can already be seen in Jean-Luc Godard’s remarkable Histoire(s) du cinéma. The film thus becomes a trajectory, a continuous deformation that strikingly corresponds to the way images transform within the flow of thought. The Flow (work in progress)

Benjamin Bardou

13,425 views • 5 months ago