Video wird geladen...
Video konnte nicht geladen werden
How does an AI model actually learn to see? 🤖 Learn about the tech behind native multimodality, how models reason over visual data like documents and video, and the future of proactive AI assistants with Logan Kilpatrick and Gemini Model Behavior Product Lead, Ani Baddepudi. ↓ Timestamps: 01:12 Why... show more
58,703 Aufrufe • vor 11 Monaten •via X (Twitter)
11 Kommentare

@AniBaddepudi Watch the full episode here:

Scan any documents, convert images into text, PDF files, etc. 👍

@OfficialLoganK @AniBaddepudi AI’s ability to process multimodal data is captivating. It transforms how we interact with technology, bridging gaps between visual perception and reasoning. Excited for the insights from this discussion. #AIFuture

@OfficialLoganK @AniBaddepudi @GoogleAI, the blending of ai and visual data opens incredible possibilities for innovation.

@OfficialLoganK @AniBaddepudi @GoogleAI, the evolution of ai vision is fascinating – excited to dive deeper into this topic.

@OfficialLoganK @AniBaddepudi "AI models learn to see through a combination of advanced technology and continuous learning, paving the way for proactive AI assistants in the future."

@OfficialLoganK @AniBaddepudi Can’t wait for AI to start critiquing my interior design choices: ‘I can see this is a living room, but why did you choose that couch?’ 😅

@OfficialLoganK @AniBaddepudi this ain’t just code, it’s a glimpse at us living next to ai not just staring at screens but actually vibing with the damn thing

@OfficialLoganK @AniBaddepudi Neat. #RoarkSyntax

@OfficialLoganK @AniBaddepudi Like so i can come back

@OfficialLoganK @AniBaddepudi Fascinating topic—just remember that when a model “sees,” it also remembers unless we design for ephemerality. Teaching AI vision should come with equal lessons in how to forget.



