Loading video...

Video Failed to Load

Go Home

Tracking Anything with Decoupled Video Segmentation paper page: Training data for video segmentation are expensive to annotate. This impedes extensions of end-to-end algorithms to new video segmentation tasks, especially in large-vocabulary settings. To 'track anything' without training on video data for every individual task, we develop a decoupled video...

305,560 views • 2 years ago •via X (Twitter)

10 Comments

Enrique Moreno's profile picture
Enrique Moreno2 years ago

Next phase is to track the traffic in India. If you can do that, you have perfected the technology.

kache's profile picture
kache2 years ago

project page

B0tak 👺 Zaddy's profile picture
B0tak 👺 Zaddy2 years ago

A lot of word salad to me. Should of listened more at school.

Christopher Moonlight Productions's profile picture
Christopher Moonlight Productions2 years ago

Can it be an extension of Automatic 1111? This is rad.

Alessandro Lamberti's profile picture
Alessandro Lamberti2 years ago

Is the code available? Seems amazing!

Egido Val's profile picture
Egido Val2 years ago

wow.

T's profile picture
T2 years ago

poor beings

WHNBH's profile picture
WHNBH2 years ago

@SaveToNotion #tweet #ai

Max Ivy's profile picture
Max Ivy2 years ago

We should consider the computational overhead of using bi-directional propagation in real-time applications. How should it scale with longer videos or higher resolutions?

Not Financial Advice's profile picture
Not Financial Advice2 years ago

What do the numbers represent,,,, .71,,, .57, etc?

Related Videos

🚀 The Segment Anything Model (SAM) has been upgraded to SAM2, featuring an efficient image encoder for segmenting images and videos. But does SAM2 outperform SAM1 in medical image and video segmentation? We're thrilled to present our paper "Segment Anything in Medical Images and Videos: Benchmark and Deployment"! We comprehensively benchmark SAM2 across 11 medical image modalities and videos. 📄 Paper: 💻 Code: **Highlights:** 1. SAM2 doesn’t always outperform SAM1 in 2D medical images, but excels in video segmentation, making it more accurate and efficient for 3D images, such as CT and MR scans. 2. MedSAM still outperforms SAM2 on most 2D modalities, but SAM2 surpasses MedSAM for 3D image segmentation in a slice-by-slice approach. 3. Segmentation performance varies with model size; sometimes the smallest model outperforms larger ones. 4. Fine-tuning SAM2 significantly boosts its performance for medical image segmentation. While SAM2 may struggle with challenging objects that have unclear boundaries or low contrast, it excels in generating good initial segmentation masks for common medical images and videos. However, the official interface doesn’t support medical data formats and has limitations on video length. To address this, we've developed a 3D Slicer Plugin and Gradio API for efficient 3D medical image and video segmentation. We invite you to try them out and provide feedback! 🔧 Deployment: - 3D Slicer Plugin: - Gradio API: (Note: Due to GPU limitations, the online API is available for only 12 hours and may be slow. We highly recommend deploying the Gradio API with your own computing resources: A big shoutout to Jun Ma (JunMa) who recently joined our UHN AI hub (UHN AI Hub) as Machine Learning Lead, and kudos to all co-authors: Sumin Kim, Feifei Li, Mohammed Baharoon (Mohammed Baharoon), Reza Asakereh, and Hongwei Lyu! This is true teamwork! Looking forward to collaborating with the community to advance 3D medical image and video segmentation foundation models! University Health Network U of T Department of Computer Science Department of Laboratory Medicine & Pathobiology Temerty Centre for AI in Medicine (T-CAIREM) Vector Institute #MedTech #AIinHealthcare #DeepLearning #MedicalImaging #SAM2 #MedSAM #AIResearch

Bo Wang

178,455 views • 1 year ago