
Robots Digest 🤖
@robotsdigest • 4,897 subscribers
Follow @RobotsDigest for latest in Robotics, Humanoids, and Hardware + AI.
Videos

Realtime-VLA FLASH tackles one of the biggest deployment bottlenecks for diffusion-based VLAs: inference latency. The key idea is speculative inference for flow-matching VLAs. A lightweight draft model predicts an action chunk, while the main model’s Action Expert verifies it in parallel using flow-consistency checks instead of running full denoising every replanning round. This lets the system replace many expensive 58 ms full inference rounds with speculative rounds as fast as 7.8 ms, reducing average latency to 19.1 ms and achieving a 3.04× speedup on LIBERO while largely preserving success rate. Interesting systems insight: they profile π0 and show VLM prefill is compute-bound, while Action Denoise is memory-bound. FLASH exploits this by reusing KV cache and parallelizing verification instead of repeatedly running sequential denoising.
Robots Digest 🤖14,066 views • 25 days ago

Boston Dynamics Atlas Robot Powered by AI "Large Behavior Models" Boston Dynamics and Toyota just showed Atlas doing something wild, packing boxes using a Large Behavior Model. One AI brain controls walking, crouching, lifting, everything. Just learned from human demos.
Robots Digest 🤖25,305 views • 6 months ago
No more content to load