Robots Digest 🤖's banner

Robots Digest 🤖

@robotsdigest • 5,322 subscribers

Follow @RobotsDigest for latest in Robotics, Humanoids, and Hardware + AI.

Videos

Anya Rossi

sweetdream.ai

SweetDream.ai•Sponsored•Livecam

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Exclusive private shows

1.2k viewers online

Private Show

Join now for exclusive access

Free preview available • Premium content

Physical intelligence needs touch, not just vision. eFlesh is a fully 3D-printable magnetic tactile sensor that enables reliable slip detection and contact awareness on real robots, using low-cost hardware and scalable fabrication

Physical intelligence needs touch, not just vision. eFlesh is a fully 3D-printable magnetic tactile sensor that enables reliable slip detection and contact awareness on real robots, using low-cost hardware and scalable fabrication

Robots Digest 🤖

243,580 views • 5 months ago

OpenDriveLab just dropped RISE — a new paradigm for robot learning. Instead of expensive real-world trials, robots learn in imagination. A compositional world model simulates futures → evaluates outcomes → updates policy.

OpenDriveLab just dropped RISE — a new paradigm for robot learning. Instead of expensive real-world trials, robots learn in imagination. A compositional world model simulates futures → evaluates outcomes → updates policy.

Robots Digest 🤖

23,754 views • 2 months ago

Qwen-VLA feels like one of the first real robotics foundation models. A single system trained across robot manipulation, navigation, egocentric human video, simulation, and vision-language reasoning instead of isolated robot policies.

Qwen-VLA feels like one of the first real robotics foundation models. A single system trained across robot manipulation, navigation, egocentric human video, simulation, and vision-language reasoning instead of isolated robot policies.

Robots Digest 🤖

14,722 views • 1 month ago

By unifying prediction and control, NavWAM acts as a closed-loop policy out of the box. In evaluations, it outperforms planning-based world models without needing test-time action search, matches a much larger 7B VLA policy, and transfers successfully to real mobile robots.

By unifying prediction and control, NavWAM acts as a closed-loop policy out of the box. In evaluations, it outperforms planning-based world models without needing test-time action search, matches a much larger 7B VLA policy, and transfers successfully to real mobile robots.

Robots Digest 🤖

11,076 views • 1 month ago

Realtime-VLA FLASH tackles one of the biggest deployment bottlenecks for diffusion-based VLAs: inference latency. The key idea is speculative inference for flow-matching VLAs. A lightweight draft model predicts an action chunk, while the main model’s Action Expert verifies it in parallel using flow-consistency checks instead of running full denoising every replanning round. This lets the system replace many expensive 58 ms full inference rounds with speculative rounds as fast as 7.8 ms, reducing average latency to 19.1 ms and achieving a 3.04× speedup on LIBERO while largely preserving success rate. Interesting systems insight: they profile π0 and show VLM prefill is compute-bound, while Action Denoise is memory-bound. FLASH exploits this by reusing KV cache and parallelizing verification instead of repeatedly running sequential denoising.

Robots Digest 🤖

14,438 views • 2 months ago

Are all robots equally UMI-able? New research from CMU Robotics Institute + Stanford AI Lab : UMI-on-Air proves that they can be .

Are all robots equally UMI-able? New research from CMU Robotics Institute + Stanford AI Lab : UMI-on-Air proves that they can be .

Robots Digest 🤖

31,330 views • 8 months ago

Robots forget because vision is expensive. AstraNav-Memory shows you can compress vision 20x and still remember hundreds of past frames, unlocking true lifelong navigation.

Robots forget because vision is expensive. AstraNav-Memory shows you can compress vision 20x and still remember hundreds of past frames, unlocking true lifelong navigation.

Robots Digest 🤖

26,667 views • 6 months ago

Cosmos Policy turns a pretrained video diffusion model into a robot controller. Instead of redesigning the architecture, it injects robot state, actions, and values directly as latent frames inside the video model

Cosmos Policy turns a pretrained video diffusion model into a robot controller. Instead of redesigning the architecture, it injects robot state, actions, and values directly as latent frames inside the video model

Robots Digest 🤖

22,933 views • 6 months ago

Boston Dynamics Atlas Robot Powered by AI "Large Behavior Models" Boston Dynamics and Toyota just showed Atlas doing something wild, packing boxes using a Large Behavior Model. One AI brain controls walking, crouching, lifting, everything. Just learned from human demos.

Boston Dynamics Atlas Robot Powered by AI "Large Behavior Models" Boston Dynamics and Toyota just showed Atlas doing something wild, packing boxes using a Large Behavior Model. One AI brain controls walking, crouching, lifting, everything. Just learned from human demos.

Robots Digest 🤖

25,305 views • 7 months ago

Touch was the missing sense in robotics But FlexiTac is here to change that. An open-source, low-cost, scalable tactile sensor that’s: • ~3 min to fabricate • ~$2.5 per unit • Real-time + ML-ready This is what making hardware actually accessible looks like.

Touch was the missing sense in robotics But FlexiTac is here to change that. An open-source, low-cost, scalable tactile sensor that’s: • ~3 min to fabricate • ~$2.5 per unit • Real-time + ML-ready This is what making hardware actually accessible looks like.

Robots Digest 🤖

12,982 views • 4 months ago

Everyone is scaling VLAs with more robot data. TiPToP shows another path. No robot training, no policy learning. Just RGB + language → 3D scene → GPU TAMP planner → trajectory. Foundation models + planning alone can run real manipulation tasks.

Everyone is scaling VLAs with more robot data. TiPToP shows another path. No robot training, no policy learning. Just RGB + language → 3D scene → GPU TAMP planner → trajectory. Foundation models + planning alone can run real manipulation tasks.

Robots Digest 🤖

10,362 views • 4 months ago

Robot learning is starved of data not only because it is hard, but because it is outright BORING! RoboCade: “what if we made it feel like Clash Royale instead of manual labor?”

Robot learning is starved of data not only because it is hard, but because it is outright BORING! RoboCade: “what if we made it feel like Clash Royale instead of manual labor?”

Robots Digest 🤖

10,835 views • 6 months ago

No more content to load