
Sriraam
@27upon2 • 1,886 subscribers
post-training research @chakra_ai prev harvard
Videos

🔥 Google Gemini 2.0 Flash is crazy good at pointing. I was over engineering before but now I'm just gonna bet on model capabilities. This is a demo of an AI cursor explaining a diagram on tldraw with just a prompt and an image. Streaming is also simple with Vercel AI SDK.
Sriraam186,978 Aufrufe • vor 1 Jahr

Introducing Gemini Cursor ✨ – a second multimodal AI cursor for your desktop that's open-source and free! Link below 👇 This experiment 🧪 reimagines how we interact with our computers because visual cues 👀 help us make sense of what we see on a screen. In this demo, I had my friend test it out by trying to add a payment method 💳 to Amazon. The cursor walks through the entire process 💬 while talking and pointing 🖱️ to the right parts of the website. Powered by Gemini 2.0 Flash (Experimental)⚡ from Google and their live multimodal API. Shoutout to Alexander Chen for sharing the starter code that powers most of this app 🙌🔥
Sriraam171,409 Aufrufe • vor 1 Jahr
Keine weiteren Inhalte verfügbar