Video yükleniyor...
Video Yüklenemedi
✨ CVPR 2025 highlight: A Distractor-Aware Memory for Visual Object Tracking with SAM2 the authors propose a new distractor-aware memory model for SAM2 and an introspection-based update strategy that jointly addresses the segmentation accuracy as well as tracking robustness 🏡 (1/n)🧵👇
32,669 görüntüleme • 11 ay önce •via X (Twitter)
7 Yorum

the authors redesign SAM2’s memory into two complementary parts: Recent-Appearance Memory (RAM) – a small FIFO buffer that stores the most recent frames (time-stamped) to keep segmentation accurate as the target’s appearance changes. Distractor-Resolving Memory (DRM) – a second buffer that keeps anchor frames able to disambiguate the target from hard external or internal distractors; these slots are not time-stamped, so their influence does not decay. (2/n)🧵👇

plugging DAM and the new update rules into the off-the-shelf SAM 2.1 backbone without any retraining, yields large practical gains, setting a new SoTA (3/n)🧵👇

the authors also create a distractor-distilled tracking dataset DiDi, to address the limitation of low distractor presence in current visual object tracking benchmarks 📀 (4/n)🧵👇

Overall, the paper’s novelty lies in recognising that “one size fits all” memory is insufficient for distractor-heavy tracking and providing a simple, training-free remedy that lifts SAM-based tracking to state-of-the-art levels (5/5)🧵🏁

Good stuff brother

very cool

I feel like this example is very edge-casey
