Loading video...
Video Failed to Load
Explore state-of-the-art multimodal prompting in our new short course Large Multimodal Model Prompting with Gemini, taught by Erwin Huizenga in collaboration with Google Cloud. One interesting insight from this course: with multimodal models, prompt structure matters significantly. Placing text inputs, such as a patient's medical history, before image inputs,... show more
73,915 views • 1 year ago •via X (Twitter)
10 Comments

@googlecloud When are we going to get personal Ai agents that can help our daily lives like scheduling dentist appointments bc it knows which insurance I have and finds the top rated dentist in my area and blocks my work calendar so I can go. People need help with daily life

@googlecloud thanks ❤️

@googlecloud This course on Large Multimodal Model Prompting with Gemini seems fascinating.

@googlecloud Thank you for your continued leadership in transferring capacities at the bleeding edge of educational innovation. 🙏🏼 🟢@privatecli ⚫️@CLILLCTX

@googlecloud Super 😍

@googlecloud This course seems incredibly useful for anyone looking to enhance their skills in multimodal AI. I think it’s a great opportunity to learn how to effectively use Gemini’s advanced features in real-world applications.

@googlecloud Intriguing course! How does multimodal prompting compare to text-only approaches? Do visual inputs significantly influence model outputs?

@googlecloud Multimodal fusion paradigm shifting digital realm. Fascinating course. Thoughts?

@googlecloud @AndrewYNg Curious how multimodal prompting differs from text-only? What unique challenges arise when combining visuals and language? Insightful course

@googlecloud Hi Andrew, how do we get data from Google Maps using GenAi?

