正在加载视频...
视频加载失败
Deep Dive Video: Complex image editing used to take hours — now Google's Gemini 2.0 turns advanced ComfyUI & Photoshop workflows into simple text prompts. Here's exactly how to try it (completely free). Chapters: 00:00 Conversational Editing with Google's Multimodal AI 00:53 Image Generation w/ LLM World Knowledge 02:12... show more
16 条评论

For those who prefer YT (w/ chapters):

It's simple. The faster your Amazon business is, the more money you make And Boxem makes your shipping faster than ever & our custom 2D barcodes have led to faster check-in times Get a free trial today:

Keep up the great content. You are my most valued follow this year.

Appreciate it!

Love it! Thanks for featuring Hacky Experiments! 🙏

My pleasure! Keep hacking, and lean into some wildness — the failure cases were almost more fun that the utilitarian ones lol

Nice, I look forward to checking it out! Meanwhile, in case you and @oliver_wang2 don’t yet know one another, let’s fix that. 😌

@oliver_wang2 Thanks dude. We’re mutuals on X but we should def chat sometime Oliver!

Thanks for this breakdown!

Sweet!

how to show all the x accounts you mentioned in the videos?

Check out the video on YouTube — links to the x posts are in the description:

I’ve noticed the output quality to not be ideal, so a few other things would have to happen in post to fix this unless Google begins to natively output hq images. They are able in their other models but this one is not based on Imagen 3, or so it has told me.

bilaw imagine just WHAT DEY HAVE HIDDEN

Dude I bet there’s some really advanced tech in a few narrow domains but I legit think as far as gen ai goes we’re all on the same roller coaster together

Thank you for this @bilawalsidhu !!
