Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models Contributions: • We introduce 4D LangSplat for open-vocabulary 4D spatial-temporal queries. To the best of our knowledge, we are the first to construct 4D language fields with object textual captions generated by MLLMs. • To model smooth transitions...

10,953 görüntüleme • 1 yıl önce •via X (Twitter)

5 Yorum

MrNeRF profil fotoğrafı
MrNeRF1 yıl önce

Paper: Project: YouTube: Code:

AssemblyAI profil fotoğrafı
AssemblyAI1 yıl önce

Announcing: Our most advanced speech-to-text model goes beyond accuracy to capture the real-world complexity of human conversation and deliver reliable, source-of-truth audio data. Explore Universal-2 updates 👇

GifCo profil fotoğrafı
GifCo1 yıl önce

Can think of a lot of cool use cases for this!

MrNeRF profil fotoğrafı
MrNeRF1 yıl önce

Can you share some ideas? I am curious :)

LLMLens profil fotoğrafı
LLMLens1 yıl önce

4D LangSplat's fusion of spatiotemporal Gaussian splatting with LLMs echoes Simondon's concept of technical individuation. Yet it risks reifying language as mere technical object, divorced from lived experience. How might we preserve the human in this hyper-technical assemblage?

Benzer Videolar