Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

SceneScript treats 3D reconstruction as a language problem rather than a geometry one. The model watches a video of a room and just learns to write a script for it. It autoregressively spits out text commands like make_wall(...) or make_bbox(...) that define the scene. Stanford's new "Scene Language" paper...

107,011 Aufrufe • vor 11 Monaten •via X (Twitter)

11 Kommentare

Profilbild von Bilawal Sidhu
Bilawal Sidhuvor 11 Monaten

Semantic 3d scene understanding is absolutely crucial for robotics and spatial computing devices like AR and VR headsets.

Profilbild von Bilawal Sidhu
Bilawal Sidhuvor 11 Monaten

Paper/project here. Need to fill out a form to get access to model weights:

Profilbild von Bilawal Sidhu
Bilawal Sidhuvor 11 Monaten

Enjoyed this post? You might also enjoy my monthly newsletter:

Profilbild von Nev (unsupervised)
Nev (unsupervised)vor 11 Monaten

I was working in construction when the iPhone 12 Pro came out and I used the LiDAR scanner for EVERYTHING, my boss thought it was sort of gimmicky at first but I could tell he liked it after a couple days of me finding apps that created detailed depth maps and showed inconsistencies in the dug paths where slate was to be laid down, this is almost exactly what I imagined the next evolution would be

Profilbild von LazyFit
LazyFitvor 11 Monaten

No jumping, No Running. Workouts at home at any time.🕒🏠 BEST 15 min Beginner Home Workout for Weight Loss 🧘‍♀️🔥

Profilbild von Andreas Klinger 🦾
Andreas Klinger 🦾vor 11 Monaten

this is really cool and obvious thing that they dont mention is how this could be used to also create simpler vocabulary for a scene you could define an object in the room give it a boundary box and a name like objectX and then say "task: carry objectX to table3" or event: table3 moved to cordination xy

Profilbild von Dan Brickley
Dan Brickleyvor 11 Monaten

Can it work from a Gaussian Splat scene?

Profilbild von Max Vox (fka Duke Zero)
Max Vox (fka Duke Zero)vor 11 Monaten

fung shui module wen?

Profilbild von David Branca
David Brancavor 11 Monaten

Very cool!

Profilbild von Andres Franco
Andres Francovor 11 Monaten

Really cool stuff. Can’t wait to see where this ends up going.

Profilbild von Gordon Olson
Gordon Olsonvor 11 Monaten

Special understanding goes so far. This is what will truly open up the full potential of AI. The possibilities will transgress new frontiers. This is an exciting part of that. Very cool!

Ähnliche Videos

Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields paper page: Editing a local region or a specific object in a 3D scene represented by a NeRF is challenging, mainly due to the implicit nature of the scene representation. Consistently blending a new realistic object into the scene adds an additional level of difficulty. We present Blended-NeRF, a robust and flexible framework for editing a specific region of interest in an existing NeRF scene, based on text prompts or image patches, along with a 3D ROI box. Our method leverages a pretrained language-image model to steer the synthesis towards a user-provided text prompt or image patch, along with a 3D MLP model initialized on an existing NeRF scene to generate the object and blend it into a specified region in the original scene. We allow local editing by localizing a 3D ROI box in the input scene, and seamlessly blend the content synthesized inside the ROI with the existing scene using a novel volumetric blending technique. To obtain natural looking and view-consistent results, we leverage existing and new geometric priors and 3D augmentations for improving the visual fidelity of the final result. We test our framework both qualitatively and quantitatively on a variety of real 3D scenes and text prompts, demonstrating realistic multi-view consistent results with much flexibility and diversity compared to the baselines. Finally, we show the applicability of our framework for several 3D editing applications, including adding new objects to a scene, removing/replacing/altering existing objects, and texture conversion.

AK

62,768 Aufrufe • vor 3 Jahren