Heng Yang's banner

Heng Yang

@hankyang94 • 5,005 subscribers

Assistant Professor @Harvard SEAS @hseas, Lead the Harvard Computational Robotics Lab. #Robotics, #Optimization, #Control, #Vision, #Learning

Shorts

How robust can model predictive control be if we can solve each trajectory optimization to global optimality? On the contact-rich push-T problem, we show that model-based global optimization is so robust that it never fails, even if the model is not even correct! We achieve global optimality via sparse Moment and SOS relaxations. -- Yes, we managed to solve SDPs online on a robot. Amazing work by Shucheng Kang and Guorui Liu.

How robust can model predictive control be if we can solve each trajectory optimization to global optimality? On the contact-rich push-T problem, we show that model-based global optimization is so robust that it never fails, even if the model is not even correct! We achieve global optimality via sparse Moment and SOS relaxations. -- Yes, we managed to solve SDPs online on a robot. Amazing work by Shucheng Kang and Guorui Liu.

28,718 次观看

Videos

Anya Rossi

sweetdream.ai

SweetDream.ai•Sponsored•Livecam

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Exclusive private shows

1.2k viewers online

Private Show

Join now for exclusive access

Free preview available • Premium content

Glad that our work “Inference-Time Enhancement of Generative Robot Policies via Predictive World Modeling”, led by Han Qi, has been accepted to IEEE Robotics and Automation Letters! 🎉 We propose Generative Predictive Control (GPC): sample action proposals from a pretrained diffusion policy (“look back”), roll them out with a diffusion-based action-conditioned video world model (“look forward”), then rank or optimize the actions using either a learned reward model or VLM preferences. Conceptually, this is trajectory optimization / MPC with hybrid sampling + gradient optimization, interpreted through modern diffusion priors and video world models. Interestingly, we first posted the paper on arXiv in Feb 2025, when action-conditioned video world models for planning were still rare—now this direction is rapidly gaining traction. Still many open questions, e.g., • how to avoid local minima in planning • what representations work best for world models • how to balance physics priors vs. data-driven learning Paper:

Glad that our work “Inference-Time Enhancement of Generative Robot Policies via Predictive World Modeling”, led by Han Qi, has been accepted to IEEE Robotics and Automation Letters! 🎉 We propose Generative Predictive Control (GPC): sample action proposals from a pretrained diffusion policy (“look back”), roll them out with a diffusion-based action-conditioned video world model (“look forward”), then rank or optimize the actions using either a learned reward model or VLM preferences. Conceptually, this is trajectory optimization / MPC with hybrid sampling + gradient optimization, interpreted through modern diffusion priors and video world models. Interestingly, we first posted the paper on arXiv in Feb 2025, when action-conditioned video world models for planning were still rare—now this direction is rapidly gaining traction. Still many open questions, e.g., • how to avoid local minima in planning • what representations work best for world models • how to balance physics priors vs. data-driven learning Paper:

18,994 次观看 • 4 个月前

Diffusion has shown great promise for generating robot **actions**, can it act as a **world model** to generate the future conditioned on actions? In our work led by han qi Haocheng Yin and in collaboration with Yilun Du, we show a **controllable** action-conditioned video diffusion model can produce photorealistic and (near) physics-accurate future predictions. This ability strengthens the policy via: - ranking different action proposals and selecting the best, or - **visual** trajectory optimization by optimizing the action proposals using gradient ascent. Learn more about Generative Predictive Control (GPC) at:

Diffusion has shown great promise for generating robot actions, can it act as a world model to generate the future conditioned on actions? In our work led by han qi Haocheng Yin and in collaboration with Yilun Du, we show a controllable action-conditioned video diffusion model can produce photorealistic and (near) physics-accurate future predictions. This ability strengthens the policy via: - ranking different action proposals and selecting the best, or - visual trajectory optimization by optimizing the action proposals using gradient ascent. Learn more about Generative Predictive Control (GPC) at:

38,428 次观看 • 1 年前

"Building Rome with Convex Optimization" has been accepted to #RSS2025! Try XM, our new structure from motion pipeline powered by GPU-accelerated convex semidefinite optimization: XM solves large-scale (nonconvex) global bundle adjustment problem via learned depth and a tight convex semidefinite relaxation. By implementing the Burer-Monteiro low-rank factorization algorithm in CUDA, XM can solve bundle adjustment problems with more than 10,000 images/views. Technical details in the paper: Kudos to Haoyu Han

"Building Rome with Convex Optimization" has been accepted to #RSS2025! Try XM, our new structure from motion pipeline powered by GPU-accelerated convex semidefinite optimization: XM solves large-scale (nonconvex) global bundle adjustment problem via learned depth and a tight convex semidefinite relaxation. By implementing the Burer-Monteiro low-rank factorization algorithm in CUDA, XM can solve bundle adjustment problems with more than 10,000 images/views. Technical details in the paper: Kudos to Haoyu Han

27,486 次观看 • 1 年前

没有更多内容可加载