
Norman Di Palo
@normandipalo • 1,518 subscribers
deep learning + robots @ deepmind
Shorts
Videos

Last month we announced Gemini Robotics (GR) and Gemini Robotics-ER (GR-ER). GR-ER is a powerful VLM specialised for spatial understanding, including detecting object poses in 2D/3D, pointing, and even *predicting grasp poses*. Take a look at this demo. Details below. 🧵
Norman Di Palo56,437 просмотров • 1 год назад

✨ Introducing Keypoint Action Tokens. 🤖 We translate visual observations and robot actions into a "language" that off-the-shelf LLMs can ingest and output. This transforms LLMs into *in-context, low-level imitation learning machines*. 🚀 Let me explain. 👇🧵
Norman Di Palo23,088 просмотров • 2 лет назад
Больше нет контента для загрузки