Video wird geladen...

Video konnte nicht geladen werden

Beim Laden dieses Videos ist ein Problem aufgetreten. Dies könnte an einem vorübergehenden Netzwerkproblem liegen oder das Video ist möglicherweise nicht verfügbar.

How can robots autonomously handle ambiguous situations that require commonsense reasoning? VLM-PC provides adaptive high-level planning, so robots can get unstuck by exploring multiple strategies. Paper:

Annie Chen

1,133 subscribers

24,112 Aufrufe • vor 1 Jahr •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

5 Kommentare

Profilbild von Annie Chen

Annie Chenvor 1 Jahr

Robots face a long tail of different possible situations. Handling this breadth of tasks typically requires heavy human supervision.

Profilbild von Annie Chen

Annie Chenvor 1 Jahr

How to effectively enable VLMs as a high-level policy with locomotion? Issue: Prompting a VLM naively can fail: - VLM can misinterpret the scene or the robot’s capabilities - Easy for the robot to get stuck

Profilbild von Annie Chen

Annie Chenvor 1 Jahr

We find 2 components important for eliciting on-the-fly, adaptive behavior selection with VLMs: 1) In-context reasoning over the history of interactions and (2) Outputting a multi-step plan

Profilbild von Annie Chen

Annie Chenvor 1 Jahr

Leveraging VLMs in this way allows a robot to handle (zero-shot!) a wide range of complex real-world situations that wide range of complex scenarios that would otherwise require environment-specific engineering or human guidance

Profilbild von Annie Chen

Annie Chenvor 1 Jahr

Thanks to wonderful collaborators @AlecLessing, @tangerinecoder, Govind Chada, @smithlaura1028, @svlevine, @chelseabfinn! I’ll be presenting this on Thursday at #ICRA2025 in Atlanta! Let me know if you’re around :)

Ähnliche Videos

How can #robots remember? 🤖 💭 For robots to understand and respond to questions that require complex multi-step reasoning in scenarios over long periods of time, we built ReMEmbR, a retrieval-augmented memory for embodied robots. 👀 Technical deep dive from #NVIDIAResearch ➡️

How can #robots remember? 🤖 💭 For robots to understand and respond to questions that require complex multi-step reasoning in scenarios over long periods of time, we built ReMEmbR, a retrieval-augmented memory for embodied robots. 👀 Technical deep dive from #NVIDIAResearch ➡️

NVIDIA AI Developer

30,485 Aufrufe • vor 1 Jahr

#NVIDIACosmos Reason, an open, customizable, 7-billion-parameter reasoning VLM for #PhysicalAI, enables robots, autonomous vehicles and visual AI agents to: 👀 See, reason, and act in the physical world. 🛠️ Solve multistep tasks and handle ambiguous or new experiences. Get started ➡️ #SIGGRAPH2025

#NVIDIACosmos Reason, an open, customizable, 7-billion-parameter reasoning VLM for #PhysicalAI, enables robots, autonomous vehicles and visual AI agents to: 👀 See, reason, and act in the physical world. 🛠️ Solve multistep tasks and handle ambiguous or new experiences. Get started ➡️ #SIGGRAPH2025

NVIDIA AI Developer

32,772 Aufrufe • vor 10 Monaten

Google DeepMind introduces Gemini Robotics 1.5, enabling robots to perceive, plan, think, use tools, and act on complex tasks. The agentic framework comprises: ⦿ Gemini Robotics-ER 1.5 (VLM): Orchestrates high-level embodied reasoning and planning. ⦿ Gemini Robotics 1.5 (VLA): Converts visuals and instructions provided by ER 1.5 into actions.

Google DeepMind introduces Gemini Robotics 1.5, enabling robots to perceive, plan, think, use tools, and act on complex tasks. The agentic framework comprises: ⦿ Gemini Robotics-ER 1.5 (VLM): Orchestrates high-level embodied reasoning and planning. ⦿ Gemini Robotics 1.5 (VLA): Converts visuals and instructions provided by ER 1.5 into actions.

The Humanoid Hub

65,928 Aufrufe • vor 8 Monaten

How robots are learning human skills, like knitting. What if robots could take on tasks requiring human-level dexterity, like knitting, decorating cakes, or installing drill bits? Acumino’s 2024 showreel demonstrates exactly that. These AI-powered robots are designed to handle intricate tasks with precision, opening up new possibilities for industries and everyday applications. Why does it matter? Acumino’s technology showcases how robots can go beyond repetitive jobs to tackle tasks that require delicate, humanlike movements—potentially transforming fields like manufacturing, food preparation, and even home services.

How robots are learning human skills, like knitting. What if robots could take on tasks requiring human-level dexterity, like knitting, decorating cakes, or installing drill bits? Acumino’s 2024 showreel demonstrates exactly that. These AI-powered robots are designed to handle intricate tasks with precision, opening up new possibilities for industries and everyday applications. Why does it matter? Acumino’s technology showcases how robots can go beyond repetitive jobs to tackle tasks that require delicate, humanlike movements—potentially transforming fields like manufacturing, food preparation, and even home services.

Circuit

23,226 Aufrufe • vor 1 Jahr

Roboticists from Robotic Systems Lab and NVIDIA Embedded are teaching four-legged robots climb and jump. After training in simulation, the robots can autonomously decide how to scramble over and under obstacles, which will help them do dangerous jobs so that humans don't have to.

Roboticists from Robotic Systems Lab and NVIDIA Embedded are teaching four-legged robots climb and jump. After training in simulation, the robots can autonomously decide how to scramble over and under obstacles, which will help them do dangerous jobs so that humans don't have to.

Evan Ackerman

2,317,927 Aufrufe • vor 2 Jahren

Chatbots like ChatGPT can be jailbroken to output harmful text. But what about robots? Can AI-controlled robots be jailbroken to perform harmful actions in the real world? Our new paper finds that jailbreaking AI-controlled robots isn't just possible. It's alarmingly easy. 🧵

Chatbots like ChatGPT can be jailbroken to output harmful text. But what about robots? Can AI-controlled robots be jailbroken to perform harmful actions in the real world? Our new paper finds that jailbreaking AI-controlled robots isn't just possible. It's alarmingly easy. 🧵

Alex Robey

111,136 Aufrufe • vor 1 Jahr

Robots in america: show how they can be dogs and slaves Robots in china: Dance and produced TikTok’s Robots in Europe:

Robots in america: show how they can be dogs and slaves Robots in china: Dance and produced TikTok’s Robots in Europe:

Michael 🇪🇺🌺

43,691 Aufrufe • vor 1 Monat

No jobs are safe. 🙂 With its frying cobots, Doosan Robotics is changing how commercial kitchens operate. These robots can handle high-temperature frying with precision and quality consistency.

No jobs are safe. 🙂 With its frying cobots, Doosan Robotics is changing how commercial kitchens operate. These robots can handle high-temperature frying with precision and quality consistency.

Rohan Paul

29,554 Aufrufe • vor 4 Monaten

🇩🇪 - Germany is BACK! Agile Robots launched Agile One, a Humanoid Robot that is designed for industry. It can handle many tasks autonomously. It learns quickly by doing the tasks, weighs 69kg, and is powered by their own AI foundation models! Europe will win 🇪🇺

🇩🇪 - Germany is BACK! Agile Robots launched Agile One, a Humanoid Robot that is designed for industry. It can handle many tasks autonomously. It learns quickly by doing the tasks, weighs 69kg, and is powered by their own AI foundation models! Europe will win 🇪🇺

NXT EU

28,619 Aufrufe • vor 6 Monaten

Scientists have developed a neural network-based planning framework that enables robots to navigate complex environments and mazes autonomously. Learn more in Science Robotics:

Scientists have developed a neural network-based planning framework that enables robots to navigate complex environments and mazes autonomously. Learn more in Science Robotics:

Science Magazine

25,783 Aufrufe • vor 11 Monaten

Living robots that can reproduce. Probably nothing.

Living robots that can reproduce. Probably nothing.

illuminatibot

49,266 Aufrufe • vor 11 Monaten

Some jobs can not be replaced by robots😂

Some jobs can not be replaced by robots😂

Tansu Yegen

3,794,021 Aufrufe • vor 1 Jahr

For robots to be actually useful, they need to be reliable. We’re sharing an RL recipe for VLA models that takes a step in this direction, allowing robots to operate autonomously for hours at a time. Blog & paper:

For robots to be actually useful, they need to be reliable. We’re sharing an RL recipe for VLA models that takes a step in this direction, allowing robots to operate autonomously for hours at a time. Blog & paper:

Chelsea Finn

74,377 Aufrufe • vor 7 Monaten

Robots autonomously preparing and serving fresh baguette sandwiches.

Robots autonomously preparing and serving fresh baguette sandwiches.

Massimo

56,499 Aufrufe • vor 11 Monaten

U.S. company Figure AI says its humanoid robots can now autonomously work full eight-hour shifts using its Helix-02 AI system.

U.S. company Figure AI says its humanoid robots can now autonomously work full eight-hour shifts using its Helix-02 AI system.

NEXTA

74,266 Aufrufe • vor 1 Monat

Chain-of-thought reasoning is a powerful tool to enable language models to work through complex problems. Can we use this with robots? With embodied chain-of-thought, vision-language-action (VLA) models can think through perception and planning! A 🧵👇

Chain-of-thought reasoning is a powerful tool to enable language models to work through complex problems. Can we use this with robots? With embodied chain-of-thought, vision-language-action (VLA) models can think through perception and planning! A 🧵👇

Sergey Levine

30,388 Aufrufe • vor 1 Jahr

UBTECH's Swarm Intelligence, powered by the 'BrainNet' framework, enables Walker S1 humanoid robots to collaborate across multiple tasks and scenarios. At Zeekr's car factory, these robots showcase their ability to handle collaborative tasks.

UBTECH's Swarm Intelligence, powered by the 'BrainNet' framework, enables Walker S1 humanoid robots to collaborate across multiple tasks and scenarios. At Zeekr's car factory, these robots showcase their ability to handle collaborative tasks.

The Humanoid Hub

102,243 Aufrufe • vor 1 Jahr

Flow reversal steering allows "steering" diffusion-based VLAs with high-level actions, for example from VLM reasoning. This also lets us run RL in the diffusion noise space with exploration guided by high-level reasoning: think through a task, then practice it! 👇

Flow reversal steering allows "steering" diffusion-based VLAs with high-level actions, for example from VLM reasoning. This also lets us run RL in the diffusion noise space with exploration guided by high-level reasoning: think through a task, then practice it! 👇

Sergey Levine

62,094 Aufrufe • vor 5 Tagen

🤖 New paper: MobileVLA-R1 A unified VLA system that brings real reasoning + continuous control to quadruped robots. CoT dataset, 2-stage training, real-world deployment. 📄paper & code & demo:

🤖 New paper: MobileVLA-R1 A unified VLA system that brings real reasoning + continuous control to quadruped robots. CoT dataset, 2-stage training, real-world deployment. 📄paper & code & demo:

Hao Tang (hiring postdocs)

27,232 Aufrufe • vor 6 Monaten