Loading video...

Video Failed to Load

Go Home

✨ Massive Pipeline Refactor → One Framework for Ego + Exo Datasets, Visualized with Rerun 🚀 After a deep refactoring and cleanup, my entire egocentric/exocentric pipeline is now fully modular. One codebase handles different sensor layouts and generates a unified, multimodal timeseries RRD file that you can open instantly...

20,836 views • 11 months ago •via X (Twitter)

3 Comments

Pablo Vela's profile picture
Pablo Vela11 months ago

Try it / follow along Code is landing in the pi0-lerobot repo (hand‑kinematic‑fitting branch): < The current viewer can be found here: < Stay tuned—Gradio front‑end + full‑body exo labels up next!

Himanshu Kumar's profile picture
Himanshu Kumar11 months ago

@rerundotio Intriguing, a unified pipeline could streamline varied data analysis in unexpected ways.

Astrid Wilde 🌞's profile picture
Astrid Wilde 🌞11 months ago

@rerundotio are you looking for work? if so call me

Related Videos

MVP of Multiview Video → Camera parameters + 3D keypoints. Visualized with Rerun The basic pipeline as of right now looks like this: 1. Capture 🔴 – Using 4 iPhones and an Insta360 Go. iPhone videos are captured via Final Cut Pro Multicam for easy sync and the exocentric view; the Insta360 Go is used for the egocentric view. 2. Sync 🕒 – Custom Gradio app using two Rerun viewers and callbacks for easily aligning frame timestamps so the ego and exo views are aligned. 3. Calibrate 🎯 – Use VGGT from Jianyuan and AI at Meta to get intrinsics/extrinsics for sparse cameras. 4. Estimate 3D 🕺 – Use RTMLib whole‑body keypoint estimator on each frame, then triangulate in 3D. What's missing? 1. No temporal coherence: I’m estimating keypoints one frame at a time and one camera at a time. This leads to a lot of jittering. For now, I plan on adding a One Euro Filter to help with jittering. Long term, I'd want to train a multiview keypoint estimator 2. Kinematic fitting is still missing; this is my next goal. The output will be joint angles, as explored in my previous posts. 3. Missing dense point cloud: VGGT seems to fail for me here. I’m looking to explore using MP‑SFM as a method for generating dense multiview depth maps + normals (plus it has a friendlier license compared to VGGT). 4. Eventually, creation of 4D Gaussian splatting using something akin to DN‑splatter—my long‑term goal is a data engine that provides poses/depths/splats/keypoints/etc.

Pablo Vela

42,785 views • 1 year ago

I've been on a SLAM/SFM kick. It's one of the more underexplored and lacking areas when it comes to human teleop/data collections, so I've brought over Deep Patch Visual Odometry/SLAM to Rerun and Gradio. With this example, we now have 1. pycuvslam 2. pycolmap/glomap 3. mast3r-slam 4. dpvo/slam all integrated into rerun. The question becomes, which method should be used in what situations? They all make different trade-offs with different camera requirements and throughput/accuracy. What about when a new method comes out? Now that I have several different methods, I plan to use VSLAM-LAB for evaluation. It uses prefix.dev to isolate all the dependencies of each of these methods and easily compare them against each other. In particular, I'll be converting the data preprocessing, algorithm outputs, and evaluation into rerun recordings (rrd files). This will allow both programmatic querying of anything stored in the files (which method had the highest ATE-to-FPS ratio? Which dataset/sequence caused the most difficulty? etc. etc.), all with easy visual inspection using the rerun server to link them all together. Another really important side effect of this is how it impacts agents. As Karpathy said ``` LLMs are exceptionally good at looping until they meet specific goals, and this is where most of the "feel the AGI" magic is to be found. Don't tell it what to do, give it success criteria, and watch it go. ``` by having accuracy and throughput metrics deeply tied with human inspectable artifacts. One can really accelerate agentic development with an actual understanding of how the method/data performs. I think this is another killer use case that I'll be really leaning into to make ingestion of new datasets/methods trivial with an agent. I'm making it my mission for folks to understand that rerun as a visualization tool only scratches the surface of what its true benefit is. Deep integration between data and visuals, with powerful query capabilities. I'll be focusing on the SLAM use case first and then bringing this into the full egocentric/exocentric data collection domain!

Pablo Vela

40,651 views • 1 month ago

Colmap 4.0 was very recently released, so it inspired me to do some work to better understand it and its new capabilities with Rerun. I want to really understand how Colmap, and in particular, pycolmap, works outside of just calling it via the CLI. So my goal is to use the low-level pycolmap API to log every part of the pipeline. The explicit goal is to have an alternative to the SQLite database that I can utilize. Instead of SQLite, I want to try logging everything directly to rerun and use RRD. This means I can have deep inspectability and still save the features/matches/2D view geometry, but be able to view it directly in rerun. I think this is one of the superpowers that rerun provides; data and visualizations are deeply integrated. As I'm often working with sequential data (videos), I'm going to specifically focus on four things: 1. Monocular Video Simple: Calls high-level APIs such as pycolmap.extract_features, pycolmap.match_sequential, pycolmap.incremental_mapping. These are basically identical to the CLI options and provide a good baseline. 2. Monocular Video Streamed: Take the above high-level APIs and break them down to their iterator version, logging each component in a streamed manner. This way, I can stream the intermediate features to rerun while the extraction/matching/mapping is happening. 3. Rig with unknown calibration: <- WHAT THE VIDEO SHOWS This is probably the most interesting version and the first one I've been working on. It allows one to set a rig between known sensors, such as in VR/AR devices, leading to much better reconstructions with multiple cameras. This is the case where we don't know the calibration a priori, so we have to run a reconstruction twice: once as a normal Colmap reconstruction with no rig constraints, use this to generate the constraints, and then do it again with the newly found rig. 4. Rig with known calibration: This is the RoboCap example, where we have a pre-calibrated set of sensors, so we don't need to run the two reconstructions and also gain better matching between cameras, both spatially and temporally. Again, this leads to a much better reconstruction! Along with all this, GLOMAP has become a first-class global mapper, making it super easy to use directly within pycolmap! I'm excited to do more with this and compare it to things like pycuvslam, vipe, and other alternatives.

Pablo Vela

30,070 views • 2 months ago

Love and Deepspace | Rerun Event Preview The 5-Star Rate UP Pool [Everlasting Dream] Limited-Time Rerun will start soon! "Since you seem to know everything, you should enlighten me." 💫Event Duration: From 05:00 on Nov. 13 to 04:59 on Nov. 20 (Server Time) 💫The 5-Star Rate UP Pool [Everlasting Dream] Limited-Time Rerun Event 1. During the event, make a wish with [Deepspace Wish] or [Time Wish: Limited] to participate in the wish event. The drop rate of the event-limited 5-Star Memory [Xavier: Heartstring Symphony] will go up drastically. 2. After the event ends, this limited 5-Star Memory will not be obtainable through other means and will not enter the permanent Wish Pool: Xspace Echo. 3. All Rerun Wish Pools share one pity system. A 5-Star Memory is guaranteed within a specific attempt of wishes. If the 5-Star Memory you've obtained from the Rerun Wish Pool is not the event-limited Memory, you will obtain the event-limited Memory the next time you obtain a 5-Star Memory. The pity count from the last Rerun Wish Pool can be applied to this Rerun Wish Pool, and the pity count in this Rerun Wish Pool will also be applied to the upcoming Rerun Wish Pool. *You can read more about the event on the in-game rules page. 🎁New Packs During the event, the Rerun event-exclusive [Dreamheart Pack] series, which includes [Time Wish: Limited] and other materials, will be available in Shop. Notes: 1. [Time Wish: Limited] can be used in 5-Star Memory Wish Pool Rerun and will be used first when you make a wish. 2. After the event ends, [Time Wish: Limited] will automatically convert to Empyrean Wish. ——— 🪐Official Discord: #LoveandDeepspace #Xavier

Love and Deepspace

267,818 views • 6 months ago

Love and Deepspace | Rerun Event Preview The 5-Star Rate UP Pool [Heating Up] Limited-Time Rerun will start soon! "Some people can make others blush and their heart rate soar by just lifting a finger. It's impressive." 💫Event Duration: From 05:00 on Apr. 6 to 04:59 on Apr. 13 (Server Time) 💫The 5-Star Rate UP Pool [Heating Up] Limited-Time Rerun Event 1. During the event, make a wish with [Deepspace Wish] or [Time Wish: Limited] to participate in the wish event. The drop rate of the event-limited 5-Star Memory [Xavier: Faint Sensation] will go up drastically. 2. After the event ends, this limited 5-Star Memory will not be obtainable through other means and will not enter the permanent Wish Pool: Xspace Echo. 3. All Rerun Wish Pools share one pity system. A 5-Star Memory is guaranteed within a specific attempt of wishes. If the 5-Star Memory you have obtained from the Rerun Wish Pool is not the event-limited Memory, you will obtain the event-limited Memory the next time you obtain a 5-Star Memory. The pity count from the last Rerun Wish Pool can be applied to this Rerun Wish Pool, and the pity count in this Rerun Wish Pool will also be applied to the upcoming Rerun Wish Pool. *You can read more about the event on the in-game rules page. 🎁New Packs During the event, the Rerun event-exclusive [Smoldering Heart Pack] series, which includes [Time Wish: Limited] and other materials, will be available in Shop. Notes: 1. [Time Wish: Limited] can be used in 5-Star Memory Wish Pool Rerun and will be used first when you make a wish. 2. After the event ends, [Time Wish: Limited] will automatically convert to Empyrean Wish. ——— 🪐Official Discord: 🪐Download and Log in Now: #LoveandDeepspace #Xavier

Love and Deepspace

188,482 views • 2 months ago