
Shubham Tulsiani
@shubhtuls • 3,429 subscribers
Assistant Professor in the Robotics Institute, Carnegie Mellon University I want to build perception systems that can understand the physical world
Shorts
Videos

[1/N] We present a plug-and-play mechanism to controllably steer inference of any diffusion/flow model towards a sharper or flatter sampling distribution, resulting in improvements across domains e.g. text-to-image (10% FID reduction), protein generation (improved designability).
Shubham Tulsiani60,777 просмотров • 8 месяцев назад

[1/N] Current visual geometry prediction models primarily rely on labeled 3D data. Our CVPR26 paper, Flow3r, allows additionally leveraging unlabeled videos (using flow supervision) for scalable visual geometry learning, enabling accurate multi-view 3D reconstruction in-the-wild.
Shubham Tulsiani15,974 просмотров • 3 месяцев назад
Больше нет контента для загрузки