ilker

@ailker • 6,385 subscribers

Creative Engineer at @FAL

Shorts

📷 a new GPT Image 2 use case: text-to-360 turn text or an image into a 360° panorama.

📷 a new GPT Image 2 use case: text-to-360 turn text or an image into a 360° panorama.

80,670 görüntüleme

made the sports fan cam trend into fal workflow. just add your photos and type something like “lakers vs celtics finals game 7” or “wimbledon final center court”

made the sports fan cam trend into fal workflow. just add your photos and type something like “lakers vs celtics finals game 7” or “wimbledon final center court”

25,585 görüntüleme

the best way to create the cute AI videos is seedance 2.0

the best way to create the cute AI videos is seedance 2.0

31,517 görüntüleme

my inbox these days

my inbox these days

15,504 görüntüleme

I've prepared a workflow for those struggling with Kling v3 multi-shot. Just type in what you want, and it will automatically create multi-shot prompts and generate a multi-shot video for you.

I've prepared a workflow for those struggling with Kling v3 multi-shot. Just type in what you want, and it will automatically create multi-shot prompts and generate a multi-shot video for you.

11,780 görüntüleme

Videos

Anya Rossi

sweetdream.ai

SweetDream.ai•Sponsored•Livecam

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Exclusive private shows

1.2k viewers online

Private Show

Join now for exclusive access

Free preview available • Premium content

Found a way to make nano-banana-pro insanely cost-effective on fal I built a workflow that generates 9 4K images for a total of $0.21. That’s just $0.023 per image. That is a ~92% cost reduction. The key is grid generation. I added a 'Crop Node' as a workflow utility specifically for this. It takes the grid output, splits it, and then we upscale everything using SeedVR

Found a way to make nano-banana-pro insanely cost-effective on fal I built a workflow that generates 9 4K images for a total of $0.21. That’s just $0.023 per image. That is a ~92% cost reduction. The key is grid generation. I added a 'Crop Node' as a workflow utility specifically for this. It takes the grid output, splits it, and then we upscale everything using SeedVR

148,277 görüntüleme • 7 ay önce

🎨 I trained 999 Style LoRAs with the fal Krea 2 Trainer. Every single image in this video is a different LoRA. That's 999 unique Style LoRAs, each trained for just 100 steps at 0.00035 LR, taking an average of ~30 seconds to train. As I said before, Krea 2 continues to prove that it's an incredible model for open-source Style LoRAs. Next week I'll be sharing even more LoRAs, along with a few surprises.

🎨 I trained 999 Style LoRAs with the fal Krea 2 Trainer. Every single image in this video is a different LoRA. That's 999 unique Style LoRAs, each trained for just 100 steps at 0.00035 LR, taking an average of ~30 seconds to train. As I said before, Krea 2 continues to prove that it's an incredible model for open-source Style LoRAs. Next week I'll be sharing even more LoRAs, along with a few surprises.

12,346 görüntüleme • 24 gün önce

MEOWPOCALYPSE I’m honestly considering making a short film for this with Seedance 2.0 (even if I'm the only one who ever watches it)

MEOWPOCALYPSE I’m honestly considering making a short film for this with Seedance 2.0 (even if I'm the only one who ever watches it)

48,181 görüntüleme • 5 ay önce

Soon everyone will be able to make their own cartoon with Seedance 2.0 Here's one I made for noah

Soon everyone will be able to make their own cartoon with Seedance 2.0 Here's one I made for noah

24,438 görüntüleme • 5 ay önce

As I promised yesterday, I'll briefly explain LoRA training and share a workflow I made so you can do it quickly. First, let me answer a very common question: 'Why train LoRAs when we have such advanced models?' Even though we have incredibly advanced models now (like NBP), we still can't always get them to do specific things we want. Simplest example: the spritesheet LoRA I made the other day. I generated 1000 images with Nano Banana and only 100 were what I wanted. The LoRA I trained using those 100 images gives me nearly 100% consistent results. Second point is cost and speed. With LoRA, we can cut costs by 4-5x. And while doing that, we're generating 4-5x faster. How many images do you need for a good LoRA? This depends on your LoRA's complexity. For example, when I training the spritesheet LoRA, even though I used 100 images, I didn't include buildings in the training data, so this LoRA doesn't work for buildings. So think about your LoRA's use cases and add examples for as many use cases as possible to improve quality. What are paired images and how to train LoRAs for image-editing? When training LoRAs for image editing on fal, we call each edit example paired images - one with _start suffix, one with _end suffix. For example, if you're training a background remove LoRA, the unedited original photo will be your '_start' image. The image with background removed will be the '_end' image. Simply put: images we want to edit or use as reference get _start, target images we want to achieve get '_end'. Important: save both images with the same name. Like image332_start.jpg and image332_end.jpg. This way the system knows which images pair together. What about training LoRAs for models with multiple image inputs? Same logic. We still use _start and _end suffixes, but with one difference. Since there are multiple input images, we can number them: _start, _start1, _start2. Example: start images, 1st image = Woman portrait (image35_start.jpg) 2nd image = Glasses photo (image35_start1.jpg) 3rd image = Hat photo (image35_start2.jpg) Output image = portrait of woman wearing glasses and hat (image35_end.jpg) Can we do more detailed captioning? Yes. Similarly, you can improve training quality by creating a txt file for each set with the caption inside. Example: create image35.txt and write: 'Recreate the image by putting the glasses from the second image and the hat from the third image on the woman in the first image.' What are Steps? How many should I use? What's Learning Rate? Steps determines how many times the model sees and processes your training data (your images). Each step, the model learns a bit more. But as steps increase, so does the risk of overfitting. So there's no real default. But for a simpler LoRA with 20 paired images, 1000 steps is ideal. Here's a metaphor for the Steps and Learning Rate relationship: Imagine you have a balloon. Our goal is to inflate it to the optimal size. Steps = How many times we blow into the balloon Learning rate = How hard we blow each time If we blow too softly, we need to blow many more times. If we blow too hard, we risk popping it quickly and can't reach optimal size. Of course training won't explode, but it won't work as intended because it wasn't trained optimally. Training's done, now what? Once training's complete, you'll have a safetensors file. Every model you train on fal has a LoRA inference endpoint. In that inference, add your safetensors file link to the LoRA url input, and you can use your LoRA. Thanks for the read! The workflow in the video: If I forgot anything, let me know in the replies.

As I promised yesterday, I'll briefly explain LoRA training and share a workflow I made so you can do it quickly. First, let me answer a very common question: 'Why train LoRAs when we have such advanced models?' Even though we have incredibly advanced models now (like NBP), we still can't always get them to do specific things we want. Simplest example: the spritesheet LoRA I made the other day. I generated 1000 images with Nano Banana and only 100 were what I wanted. The LoRA I trained using those 100 images gives me nearly 100% consistent results. Second point is cost and speed. With LoRA, we can cut costs by 4-5x. And while doing that, we're generating 4-5x faster. How many images do you need for a good LoRA? This depends on your LoRA's complexity. For example, when I training the spritesheet LoRA, even though I used 100 images, I didn't include buildings in the training data, so this LoRA doesn't work for buildings. So think about your LoRA's use cases and add examples for as many use cases as possible to improve quality. What are paired images and how to train LoRAs for image-editing? When training LoRAs for image editing on fal, we call each edit example paired images - one with _start suffix, one with _end suffix. For example, if you're training a background remove LoRA, the unedited original photo will be your '_start' image. The image with background removed will be the '_end' image. Simply put: images we want to edit or use as reference get _start, target images we want to achieve get '_end'. Important: save both images with the same name. Like image332_start.jpg and image332_end.jpg. This way the system knows which images pair together. What about training LoRAs for models with multiple image inputs? Same logic. We still use _start and _end suffixes, but with one difference. Since there are multiple input images, we can number them: _start, _start1, _start2. Example: start images, 1st image = Woman portrait (image35_start.jpg) 2nd image = Glasses photo (image35_start1.jpg) 3rd image = Hat photo (image35_start2.jpg) Output image = portrait of woman wearing glasses and hat (image35_end.jpg) Can we do more detailed captioning? Yes. Similarly, you can improve training quality by creating a txt file for each set with the caption inside. Example: create image35.txt and write: 'Recreate the image by putting the glasses from the second image and the hat from the third image on the woman in the first image.' What are Steps? How many should I use? What's Learning Rate? Steps determines how many times the model sees and processes your training data (your images). Each step, the model learns a bit more. But as steps increase, so does the risk of overfitting. So there's no real default. But for a simpler LoRA with 20 paired images, 1000 steps is ideal. Here's a metaphor for the Steps and Learning Rate relationship: Imagine you have a balloon. Our goal is to inflate it to the optimal size. Steps = How many times we blow into the balloon Learning rate = How hard we blow each time If we blow too softly, we need to blow many more times. If we blow too hard, we risk popping it quickly and can't reach optimal size. Of course training won't explode, but it won't work as intended because it wasn't trained optimally. Training's done, now what? Once training's complete, you'll have a safetensors file. Every model you train on fal has a LoRA inference endpoint. In that inference, add your safetensors file link to the LoRA url input, and you can use your LoRA. Thanks for the read! The workflow in the video: If I forgot anything, let me know in the replies.

15,133 görüntüleme • 6 ay önce

I really like this model. Are there any specific use cases where you think Avatar models struggle? Let's discuss it together

15,747 görüntüleme • 7 ay önce

xAI Grok Imagine exclusively on fal! You should definitely try out the edit feature.

xAI Grok Imagine exclusively on fal! You should definitely try out the edit feature.

12,138 görüntüleme • 5 ay önce

Osimhen's journey is one of the most powerful stories I've ever come across. That's why I made this video for Victor Osimhen

Osimhen's journey is one of the most powerful stories I've ever come across. That's why I made this video for Victor Osimhen

10,652 görüntüleme • 5 ay önce

Daha fazla içerik yok.