
Stan Szymanowicz @ CVPR26
@StanSzymanowicz • 1,091 subscribers
Making AI understand the physical 3D world @Oxford_VGG , intern @MetaAI | Previously intern @Google @microsoft | @Cambridge_Uni
Videos

We made an interactive client-server viewer for LagerNVS with Jonathon Luiten! You can now interactively explore scenes from just a photo capture - no optimization, no 3D Gaussians, just load your images, run the model on a cloud GPU and stream the renders to your local browser. Check out the video below for some spaces I recently captured in Oxford, London and beyond!
Stan Szymanowicz @ CVPR2616,637 görüntüleme • 25 gün önce

🍺 LagerNVS (CVPR 2026) 🍺 LagerNVS is a generalizable, feed-forward, real-time Novel View Synthesis network which - performs rendering in real time, - generalizes to in-the-wild data, - works with and without known source cameras, - sets a new state-of-the-art among deterministic methods, - can be paired with a diffusion decoder for generative extrapolation. LagerNVS shows that 3D biases are useful for Novel View Synthesis but explicit 3D representations are not required to achieve them. We use 3D biases in (1) architecture design and (2) pre-training: (1) In NVS with explicit 3D representations (3DGS, NeRF) reconstruction is typically difficult and slow, but rendering is much faster and simpler. We mimic this process in the network design: we use a large (1B params) encoder and a small, lightweight decoder (ViT-B). This allows increasing the network capacity while still achieving real-time rendering. (2) The encoder, initialized from VGGT, was pre-trained with 3D reconstruction objectives, making the initial features 3D aware. Both substantially improve performance. Project page: Code: Paper: Models: Work done with Jianyuan@CVPR Minghao Chen Christian Rupprecht and Andrea Vedaldi
Stan Szymanowicz @ CVPR2631,354 görüntüleme • 2 ay önce

⚡️ Introducing Bolt3D ⚡️ Bolt3D generates interactive 3D scenes in less than 7 seconds on a single GPU from one or more images. It features a latent diffusion model that *directly* generates 3D Gaussians of seen and unseen regions, without any test time optimization. 🧵👇 (1/9)
Stan Szymanowicz @ CVPR26125,801 görüntüleme • 1 yıl önce
Daha fazla içerik yok.