正在加载视频...

视频加载失败

Excited to share RoCo: Dialectic Multi-Robot Collaboration with Large Language Models. We propose a novel approach to multi-robot collaboration that leverages LLMs for both high-level communication and low-level path planning. w/ Shreeya Jain, Shuran Song

88,691 次观看 • 2 年前 •via X (Twitter)

11 条评论

Mandi Zhao 的头像
Mandi Zhao2 年前

Much of the recent work in LLM+robotics has been task planning for a single robot - in this work, we venture into multi-robot settings, and introduce a new benchmark, RoCoBench: 6 multi-robot collaboration tasks that span a variety of scenarios and robot capabilities.

Mandi Zhao 的头像
Mandi Zhao2 年前

These tasks require efficient high-level allocation, coordinated low-level execution, and generalization to variations. Our key idea: use LLMs to simulate a multi-agent dialog that facilitates collaborative reasoning, then propose sub-task plans to guide motion planning.

Mandi Zhao 的头像
Mandi Zhao2 年前

We show our system 1) benefits from the generality of LLMs in understanding varying task semantics, and 2) the multi-agent dialog offers flexibility to easily incorporate one human participation - see this human-robot collaboration task in our real-world experiments:

Mandi Zhao 的头像
Mandi Zhao2 年前

We also curate a text-based dataset that focuses on evaluating LLMs' agent representation and task reasoning ability, without requiring environment interactions. It contains open-ended questions about the RoCoBench tasks, going beyond simply finding the best action plan.

Mandi Zhao 的头像
Mandi Zhao2 年前

For more details, check out: arxiv: website: code & RoCoBench:

AssemblyAI 的头像
AssemblyAI1 年前

Announcing: Our most advanced speech-to-text model goes beyond accuracy to capture the real-world complexity of human conversation and deliver reliable, source-of-truth audio data. Explore Universal-2 updates 👇

Ted Xiao 的头像
Ted Xiao2 年前

@shreeyaajain @SongShuran Nice work! The field's been searching for years for effective "robot languages" (usually in some learned latent space) to enable emergent communication in MARL settings. ...maybe the best "robot language" is just English?

Mandi Zhao 的头像
Mandi Zhao2 年前

@shreeyaajain @SongShuran Thanks Ted! I suspect even if there is some kind of optimal latent communication, it just might be more appealing to have robots talk in language that human can also understand & supervise

Avi Singh 的头像
Avi Singh2 年前

@shreeyaajain @SongShuran Exciting stuff!

Alper Canberk 的头像
Alper Canberk2 年前

@shreeyaajain @SongShuran RoCo's basilisk...

Mandi Zhao 的头像
Mandi Zhao2 年前

@AlperCanberk1 @shreeyaajain @SongShuran I did Not think of that lol

相关视频