正在加载视频...
视频加载失败
3D-LLM: Injecting the 3D World into Large Language Models paper page: Large language models (LLMs) and Vision-Language Models (VLMs) have been proven to excel at multiple tasks, such as commonsense reasoning. Powerful as these models can be, they are not grounded in the 3D physical world, which involves richer... show more
7 条评论

Yining Hong2 年前
Thanks for featuring our work!

DevHunterAI2 年前
Wow

AssistedEvolution2 年前
Looks like nice work but surprising that folk have not been doing this already as transformer -> hippocample complex so this theoretically is exactly the way you might expect to train it. i.e. with spatio- temporal context.

JP2 年前
Could this be leveraged to understand n dimensional spaces such as the weights and biases of a NN

Ori ~ᗜˬᗜ〜♡ — e/acc2 年前
🔥

Reverie2 年前
I guess MAXAR Tech starts looking for this, More precision LLMs and VLMs for their 3D large-scale maps. Such a great work!

Ippi2 年前
It's Skynet Alpha version noooooo
