正在加载视频...
视频加载失败
I've been trying Meta smart glasses' new multimodal AI - while it's pretty basic right now, it's still sick to see it combine what it sees from the camera with the language model to describe what it's seeing! Already solid for accessibility Full episode:
10 条评论

Ben Geskin2 年前
We are getting there 👀

Everett World2 年前
Multi-modality is the next step. We're moving from LLMs to world models, that will be even more helpful for practical reasons.

Karthik Kannan2 年前
Um, we’re already here

Average Engineer2 年前
When he says that phrase and ask questions, Glass takes photo and takes question from users using OpenAI Whisper API, Upload to Gta-4 Vision API with your prompt Get back the results and make it speak again using API. Is there anything i am missing here?

David2 年前
This is the way Marques !

Rahul2 年前
Lower hanging use cases I could use this for already: reading while walking.

Pawel2 年前
It is going to be the feature of AR

Newtonian2 年前
Epic 👏🔥🔥

Joseph Bella2 年前
I feel like we are seeing history repeat itself when Apple released the Newton and Palm made the Pilot.

Ellie MacQueen2 年前
Here for the David stares

