Загрузка видео...
Не удалось загрузить видео
I've been trying Meta smart glasses' new multimodal AI - while it's pretty basic right now, it's still sick to see it combine what it sees from the camera with the language model to describe what it's seeing! Already solid for accessibility Full episode:
1,128,329 просмотров • 2 лет назад •via X (Twitter)
Комментарии: 10

We are getting there 👀

Multi-modality is the next step. We're moving from LLMs to world models, that will be even more helpful for practical reasons.

Um, we’re already here

When he says that phrase and ask questions, Glass takes photo and takes question from users using OpenAI Whisper API, Upload to Gta-4 Vision API with your prompt Get back the results and make it speak again using API. Is there anything i am missing here?

This is the way Marques !

Lower hanging use cases I could use this for already: reading while walking.

It is going to be the feature of AR

Epic 👏🔥🔥

I feel like we are seeing history repeat itself when Apple released the Newton and Palm made the Pilot.

Here for the David stares

