正在加载视频...

视频加载失败

Nvidia just dropped Describe Anything on Hugging Face Detailed Localized Image and Video Captioning

105,344 次观看 • 1 年前 •via X (Twitter)

6 条评论

AK 的头像
AK1 年前

discuss with author:

AK 的头像
AK1 年前

app:

AI Pro Workflow 的头像
AI Pro Workflow1 年前

NVIDIA’s “Describe Anything” just dropped on Hugging Face — and it’s a vision-language beast. 🧠 Localized captions 🎯 Region-specific precision 📹 Works on images AND video Built with a novel focal prompt + localized backbone. If you're building multimodal apps, this changes the game.

Ofir Ozeri 的头像
Ofir Ozeri1 年前

Also localization? Meaning extracting objects xyz attributes?

Elomaquiabelo 的头像
Elomaquiabelo1 年前

@grok how can I use this on mi pc? Do I need code? Can I just download it... How it works

VRLA Tech 的头像
VRLA Tech1 年前

Your next W? A fresh gaming PC. Don’t lose out. 🎮🔥

相关视频