正在加载视频...
视频加载失败
Nvidia just dropped Describe Anything on Hugging Face Detailed Localized Image and Video Captioning
6 条评论

AK1 年前
discuss with author:

AK1 年前
app:

AI Pro Workflow1 年前
NVIDIA’s “Describe Anything” just dropped on Hugging Face — and it’s a vision-language beast. 🧠 Localized captions 🎯 Region-specific precision 📹 Works on images AND video Built with a novel focal prompt + localized backbone. If you're building multimodal apps, this changes the game.

Ofir Ozeri1 年前
Also localization? Meaning extracting objects xyz attributes?

Elomaquiabelo1 年前
@grok how can I use this on mi pc? Do I need code? Can I just download it... How it works

VRLA Tech1 年前
Your next W? A fresh gaming PC. Don’t lose out. 🎮🔥
