正在加载视频...

视频加载失败

Deep Dive Video: Complex image editing used to take hours — now Google's Gemini 2.0 turns advanced ComfyUI & Photoshop workflows into simple text prompts. Here's exactly how to try it (completely free). Chapters: 00:00 Conversational Editing with Google's Multimodal AI 00:53 Image Generation w/ LLM World Knowledge 02:12...

34,755 次观看 • 1 年前 •via X (Twitter)

16 条评论

Bilawal Sidhu 的头像
Bilawal Sidhu1 年前

For those who prefer YT (w/ chapters):

Boxem 的头像
Boxem1 年前

It's simple. The faster your Amazon business is, the more money you make And Boxem makes your shipping faster than ever & our custom 2D barcodes have led to faster check-in times Get a free trial today:

TacticalRNDR ⭕️ 的头像
TacticalRNDR ⭕️1 年前

Keep up the great content. You are my most valued follow this year.

Bilawal Sidhu 的头像
Bilawal Sidhu1 年前

Appreciate it!

Bilal 的头像
Bilal1 年前

Love it! Thanks for featuring Hacky Experiments! 🙏

Bilawal Sidhu 的头像
Bilawal Sidhu1 年前

My pleasure! Keep hacking, and lean into some wildness — the failure cases were almost more fun that the utilitarian ones lol

John Nack 的头像
John Nack1 年前

Nice, I look forward to checking it out! Meanwhile, in case you and @oliver_wang2 don’t yet know one another, let’s fix that. 😌

Bilawal Sidhu 的头像
Bilawal Sidhu1 年前

@oliver_wang2 Thanks dude. We’re mutuals on X but we should def chat sometime Oliver!

VentureMind AI 的头像
VentureMind AI1 年前

Thanks for this breakdown!

Neville Medhora 的头像
Neville Medhora1 年前

Sweet!

Dexter | FeelDesign AI, Comfy UI, Interior Design 的头像
Dexter | FeelDesign AI, Comfy UI, Interior Design1 年前

how to show all the x accounts you mentioned in the videos?

Bilawal Sidhu 的头像
Bilawal Sidhu1 年前

Check out the video on YouTube — links to the x posts are in the description:

A T Wilkinson 的头像
A T Wilkinson1 年前

I’ve noticed the output quality to not be ideal, so a few other things would have to happen in post to fix this unless Google begins to natively output hq images. They are able in their other models but this one is not based on Imagen 3, or so it has told me.

BowtiedWhitebat + Read Pinned Tweet or NGMI 的头像
BowtiedWhitebat + Read Pinned Tweet or NGMI1 年前

bilaw imagine just WHAT DEY HAVE HIDDEN

Bilawal Sidhu 的头像
Bilawal Sidhu1 年前

Dude I bet there’s some really advanced tech in a few narrow domains but I legit think as far as gen ai goes we’re all on the same roller coaster together

Bill Platt 的头像
Bill Platt1 年前

Thank you for this @bilawalsidhu !!

相关视频

InstantDrag Improving Interactivity in Drag-based Image Editing discuss: Drag-based image editing has recently gained popularity for its interactivity and precision. However, despite the ability of text-to-image models to generate samples within a second, drag editing still lags behind due to the challenge of accurately reflecting user interaction while maintaining image content. Some existing approaches rely on computationally intensive per-image optimization or intricate guidance-based methods, requiring additional inputs such as masks for movable regions and text prompts, thereby compromising the interactivity of the editing process. We introduce InstantDrag, an optimization-free pipeline that enhances interactivity and speed, requiring only an image and a drag instruction as input. InstantDrag consists of two carefully designed networks: a drag-conditioned optical flow generator (FlowGen) and an optical flow-conditioned diffusion model (FlowDiffusion). InstantDrag learns motion dynamics for drag-based image editing in real-world video datasets by decomposing the task into motion generation and motion-conditioned image generation. We demonstrate InstantDrag's capability to perform fast, photo-realistic edits without masks or text prompts through experiments on facial video datasets and general scenes. These results highlight the efficiency of our approach in handling drag-based image editing, making it a promising solution for interactive, real-time applications.

AK

71,232 次观看 • 1 年前