Загрузка видео...

Не удалось загрузить видео

На главную

Google Gemini 2.0 realtime AI is insane. Watch me turn it into a live code tutor just by sharing my screen and talking to it. We’re living in future. I’m speechless.

624,751 просмотров • 1 год назад •via X (Twitter)

Комментарии: 11

Фото профиля Mckay Wrigley
Mckay Wrigley1 год назад

This is why the “Is it AGI???” conversations are so silly. 90% of people would’ve said this was AGI if you showed this to them 2 years ago. The goalposts will keep moving… And it won’t matter. Because it’s already magic.

Фото профиля Mckay Wrigley
Mckay Wrigley1 год назад

Now I’m REALLY hoping one of OpenAI’s 12 days is AVM with video. Give me alllll the realtime products you can feed me. Feeling so lucky to be living in a genuine technological revolution. Imagine what this can do for education alone! So happy rn 🥲

Фото профиля Mckay Wrigley
Mckay Wrigley1 год назад

Predictably, OpenAI has launched their version the next day. The race is on and I’m here for it.

Фото профиля Mckay Wrigley
Mckay Wrigley1 год назад

Prompt engineering 101.

Фото профиля Pulze.ai
Pulze.ai1 год назад

Seamlessly switch between AI models like ChatGPT and Gemini. Why limit yourself? Explore the power of choice with #AI #Flexibility #AdvancedFeatures

Фото профиля kwindla
kwindla1 год назад

I had early access to this and have been building APIs/SDKs for the realtime/multimodal things that Google launched today. The voices are great and the video and spatial reasoning are super-impressive. If you want to build your own app that has conversational, multimodal features, there are Open Source client SDKs with Gemini 2.0 multimodal support. Web, React, Android, iOS, and C++ — part of the @pipecat_ai ecosystem and officially blessed by Google. These SDKs have device management, echo cancellation, and noise reduction built in. Plus lots of other features including hooks for function calling and tool use. They support both WebSocket and WebRTC network transport. Here’s a full-featured starter kit built on the React SDK — a chat application with: - a voice-to-voice WebSocket mode, - an HTTP mode for text and image input, and - a WebRTC mode with text, voice, camera video and screenshare video.

Фото профиля Mckay Wrigley
Mckay Wrigley1 год назад

Excited to see what you and others have built 👀

Фото профиля Dan Mac
Dan Mac1 год назад

Google beat OpenAI to the punch with this one This is a ridiculously powerful capability You’re right that we have arrived in the future

Фото профиля Mckay Wrigley
Mckay Wrigley1 год назад

Hoping for AVM + video as one of the remaining 12 days

Фото профиля Shawn Chauhan
Shawn Chauhan1 год назад

Live coding assistance in real-time is going to revolutionize how we learn and work with code.

Фото профиля Mckay Wrigley
Mckay Wrigley1 год назад

Education revolution and nothing short of it

Похожие видео