Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

SceneScript treats 3D reconstruction as a language problem rather than a geometry one. The model watches a video of a room and just learns to write a script for it. It autoregressively spits out text commands like make_wall(...) or make_bbox(...) that define the scene. Stanford's new "Scene Language" paper...

107,011 görüntüleme • 11 ay önce •via X (Twitter)

11 Yorum

Bilawal Sidhu profil fotoğrafı
Bilawal Sidhu11 ay önce

Semantic 3d scene understanding is absolutely crucial for robotics and spatial computing devices like AR and VR headsets.

Bilawal Sidhu profil fotoğrafı
Bilawal Sidhu11 ay önce

Paper/project here. Need to fill out a form to get access to model weights:

Bilawal Sidhu profil fotoğrafı
Bilawal Sidhu11 ay önce

Enjoyed this post? You might also enjoy my monthly newsletter:

Nev (unsupervised) profil fotoğrafı
Nev (unsupervised)11 ay önce

I was working in construction when the iPhone 12 Pro came out and I used the LiDAR scanner for EVERYTHING, my boss thought it was sort of gimmicky at first but I could tell he liked it after a couple days of me finding apps that created detailed depth maps and showed inconsistencies in the dug paths where slate was to be laid down, this is almost exactly what I imagined the next evolution would be

LazyFit profil fotoğrafı
LazyFit1 yıl önce

No jumping, No Running. Workouts at home at any time.🕒🏠 BEST 15 min Beginner Home Workout for Weight Loss 🧘‍♀️🔥

Andreas Klinger 🦾 profil fotoğrafı
Andreas Klinger 🦾11 ay önce

this is really cool and obvious thing that they dont mention is how this could be used to also create simpler vocabulary for a scene you could define an object in the room give it a boundary box and a name like objectX and then say "task: carry objectX to table3" or event: table3 moved to cordination xy

Dan Brickley profil fotoğrafı
Dan Brickley11 ay önce

Can it work from a Gaussian Splat scene?

Max Vox (fka Duke Zero) profil fotoğrafı
Max Vox (fka Duke Zero)11 ay önce

fung shui module wen?

David Branca profil fotoğrafı
David Branca11 ay önce

Very cool!

Andres Franco profil fotoğrafı
Andres Franco11 ay önce

Really cool stuff. Can’t wait to see where this ends up going.

Gordon Olson profil fotoğrafı
Gordon Olson11 ay önce

Special understanding goes so far. This is what will truly open up the full potential of AI. The possibilities will transgress new frontiers. This is an exciting part of that. Very cool!

Benzer Videolar

Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields paper page: Editing a local region or a specific object in a 3D scene represented by a NeRF is challenging, mainly due to the implicit nature of the scene representation. Consistently blending a new realistic object into the scene adds an additional level of difficulty. We present Blended-NeRF, a robust and flexible framework for editing a specific region of interest in an existing NeRF scene, based on text prompts or image patches, along with a 3D ROI box. Our method leverages a pretrained language-image model to steer the synthesis towards a user-provided text prompt or image patch, along with a 3D MLP model initialized on an existing NeRF scene to generate the object and blend it into a specified region in the original scene. We allow local editing by localizing a 3D ROI box in the input scene, and seamlessly blend the content synthesized inside the ROI with the existing scene using a novel volumetric blending technique. To obtain natural looking and view-consistent results, we leverage existing and new geometric priors and 3D augmentations for improving the visual fidelity of the final result. We test our framework both qualitatively and quantitatively on a variety of real 3D scenes and text prompts, demonstrating realistic multi-view consistent results with much flexibility and diversity compared to the baselines. Finally, we show the applicability of our framework for several 3D editing applications, including adding new objects to a scene, removing/replacing/altering existing objects, and texture conversion.

AK

62,768 görüntüleme • 3 yıl önce