正在加载视频...

视频加载失败

Google Gemini 2.0 realtime AI is insane. Watch me turn it into a live code tutor just by sharing my screen and talking to it. We’re living in future. I’m speechless.

624,751 次观看 • 1 年前 •via X (Twitter)

11 条评论

Mckay Wrigley 的头像
Mckay Wrigley1 年前

This is why the “Is it AGI???” conversations are so silly. 90% of people would’ve said this was AGI if you showed this to them 2 years ago. The goalposts will keep moving… And it won’t matter. Because it’s already magic.

Mckay Wrigley 的头像
Mckay Wrigley1 年前

Now I’m REALLY hoping one of OpenAI’s 12 days is AVM with video. Give me alllll the realtime products you can feed me. Feeling so lucky to be living in a genuine technological revolution. Imagine what this can do for education alone! So happy rn 🥲

Mckay Wrigley 的头像
Mckay Wrigley1 年前

Predictably, OpenAI has launched their version the next day. The race is on and I’m here for it.

Mckay Wrigley 的头像
Mckay Wrigley1 年前

Prompt engineering 101.

Pulze.ai 的头像
Pulze.ai1 年前

Seamlessly switch between AI models like ChatGPT and Gemini. Why limit yourself? Explore the power of choice with #AI #Flexibility #AdvancedFeatures

kwindla 的头像
kwindla1 年前

I had early access to this and have been building APIs/SDKs for the realtime/multimodal things that Google launched today. The voices are great and the video and spatial reasoning are super-impressive. If you want to build your own app that has conversational, multimodal features, there are Open Source client SDKs with Gemini 2.0 multimodal support. Web, React, Android, iOS, and C++ — part of the @pipecat_ai ecosystem and officially blessed by Google. These SDKs have device management, echo cancellation, and noise reduction built in. Plus lots of other features including hooks for function calling and tool use. They support both WebSocket and WebRTC network transport. Here’s a full-featured starter kit built on the React SDK — a chat application with: - a voice-to-voice WebSocket mode, - an HTTP mode for text and image input, and - a WebRTC mode with text, voice, camera video and screenshare video.

Mckay Wrigley 的头像
Mckay Wrigley1 年前

Excited to see what you and others have built 👀

Dan Mac 的头像
Dan Mac1 年前

Google beat OpenAI to the punch with this one This is a ridiculously powerful capability You’re right that we have arrived in the future

Mckay Wrigley 的头像
Mckay Wrigley1 年前

Hoping for AVM + video as one of the remaining 12 days

Shawn Chauhan 的头像
Shawn Chauhan1 年前

Live coding assistance in real-time is going to revolutionize how we learn and work with code.

Mckay Wrigley 的头像
Mckay Wrigley1 年前

Education revolution and nothing short of it

相关视频