
Haian Jin@CVPR
@Haian_Jin • 1,175 subscribers
CS Ph.D. Student at @Cornell University | Interested in Vision and ML
Videos

Spatial reconstruction is a long-context problem: real scenes come with hundreds of images. But O(N²) transformer-based models don’t scale efficiently. Introducing: 🤐ZipMap (CVPR ’26): Linear-Time, Stateful 3D Reconstruction via Test-Time Training (TTT). ZipMap “zips” a large image collection into an implicit TTT scene state in a single linear-time operation. The state will then be decoded into spatial outputs, and can be queried efficiently for novel-view geometry and appearance (~100 FPS) ZipMap is not only much faster (>20× faster than VGGT), but also matches or surpasses the accuracy of all SOTA models.
Haian Jin@CVPR77,386 просмотров • 3 месяцев назад
Больше нет контента для загрузки