Hanwen Jiang's banner

Hanwen Jiang

@hanwenjiang1 • 1,989 subscribers

Research Scientist @ Adobe Research | PhD @ UT Austin | I work on Visual Intelligence, previously worked on 3D

Videos

Anya Rossi

sweetdream.ai

SweetDream.ai•Sponsored•Livecam

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Exclusive private shows

1.2k viewers online

Private Show

Join now for exclusive access

Free preview available • Premium content

(1/N) Will this be the BERT/GPT moment for 3D vision？ Finally, unsupervised pre-training for 3D works. Led by Qitao Zhao , we present E-RayZer — a fully self-supervised 3D reconstruction model that: 🔥Matches or surpasses supervised methods like VGGT 👀Learns transferable 3D representations, outperforming CroCo, VideoMAE, and DINO 📈Scales with more unlabeled data A new recipe for scalable 3D foundation models.

(1/N) Will this be the BERT/GPT moment for 3D vision？ Finally, unsupervised pre-training for 3D works. Led by Qitao Zhao , we present E-RayZer — a fully self-supervised 3D reconstruction model that: 🔥Matches or surpasses supervised methods like VGGT 👀Learns transferable 3D representations, outperforming CroCo, VideoMAE, and DINO 📈Scales with more unlabeled data A new recipe for scalable 3D foundation models.

58,093 次观看 • 7 个月前

Supervised learning has held 3D Vision back for too long. Meet RayZer — a self-supervised 3D model trained with zero 3D labels: ❌ No supervision of camera & geometry ✅ Just RGB images And the wild part? RayZer outperforms supervised methods (as 3D labels from COLMAP is noisy) 🌐 Project: (1/4)

Supervised learning has held 3D Vision back for too long. Meet RayZer — a self-supervised 3D model trained with zero 3D labels: ❌ No supervision of camera & geometry ✅ Just RGB images And the wild part? RayZer outperforms supervised methods (as 3D labels from COLMAP is noisy) 🌐 Project: (1/4)

69,607 次观看 • 1 年前

💥 Think more real data is needed for scene reconstruction? Think again! Meet MegaSynth: scaling up feed-forward 3D scene reconstruction with synthesized scenes. In 3 days, it generates 700K scenes for training—70x larger than real data! ✨ The secret? Reconstruction is mostly non-semantic! No need to rely heavily on real or highly realistic synthetic data. 🌐 Project: (1/4)

💥 Think more real data is needed for scene reconstruction? Think again! Meet MegaSynth: scaling up feed-forward 3D scene reconstruction with synthesized scenes. In 3 days, it generates 700K scenes for training—70x larger than real data! ✨ The secret? Reconstruction is mostly non-semantic! No need to rely heavily on real or highly realistic synthetic data. 🌐 Project: (1/4)

26,969 次观看 • 1 年前

Dense observations with perfect camera poses are hard to collect. What is a more practical setting for large-scale general object reconstruction? Presenting FORGE - jointly recovering object shape and camera poses from few-view observations, and it generalizes to novel categories in zero-shot. Project page & code: Joint work with Zhenyu Jiang, Kristen Grauman, Yuke Zhu

Dense observations with perfect camera poses are hard to collect. What is a more practical setting for large-scale general object reconstruction? Presenting FORGE - jointly recovering object shape and camera poses from few-view observations, and it generalizes to novel categories in zero-shot. Project page & code: Joint work with Zhenyu Jiang, Kristen Grauman, Yuke Zhu

18,116 次观看 • 3 年前

没有更多内容可加载