
Shubham Tulsiani
@shubhtuls • 3,429 subscribers
Assistant Professor in the Robotics Institute, Carnegie Mellon University I want to build perception systems that can understand the physical world
Shorts
Videos

[1/N] We present a plug-and-play mechanism to controllably steer inference of any diffusion/flow model towards a sharper or flatter sampling distribution, resulting in improvements across domains e.g. text-to-image (10% FID reduction), protein generation (improved designability).
Shubham Tulsiani60,777 Aufrufe • vor 8 Monaten

[1/N] Current visual geometry prediction models primarily rely on labeled 3D data. Our CVPR26 paper, Flow3r, allows additionally leveraging unlabeled videos (using flow supervision) for scalable visual geometry learning, enabling accurate multi-view 3D reconstruction in-the-wild.
Shubham Tulsiani15,974 Aufrufe • vor 3 Monaten
Keine weiteren Inhalte verfügbar