
Jia-Bin Huang
@jbhuang0604 • 77,845 subscribers
I am the one who wears a jacket.
Shorts
Videos

Introducing Generative Omnimatte: A method for decomposing a video into complete layers, including objects and their associated effects (e.g., shadows, reflections). It enables many cool applications, such as video stylization, compositions, moment retiming, and object removal.
Jia-Bin Huang81,713 Aufrufe • vor 1 Jahr

Exploration is key for robots to generalize, especially in open-ended environments with vague goals and sparse rewards. BUT, how do we go beyond random poking? Wouldn't it be great to have a robot that explores an environment just like a kid? Introducing Imagine, Verify, Execute (IVE)! IVE leverages Vision-Language models to • extract semantic scene graphs, • imagine novel scenes, • predict their physical plausibility, and • generate executable sequences. IVE is a memory-guided agentic exploration framework that operates fully automatically, enabling more diverse and meaningful exploration.
Jia-Bin Huang45,285 Aufrufe • vor 1 Jahr

3D illusions are fascinating! 🤩 But it takes exceptional artistic skills to make one. We present Illusion3D - a simple method for creating 3D multiview illusions, where the interpretations change depending on your perspectives. Let's play Where's Waldo, shall we? 😆
Jia-Bin Huang34,483 Aufrufe • vor 1 Jahr

How do we go beyond reconstructing colors and recovering the intrinsic scene properties? 🤔 👁️ IRIS: Inverse Rendering of Indoor Scenes IRIS estimates accurate material, lighting, and camera response functions given a set of LDR images, enabling photorealistic and view-consistent relighting and object insertion.
Jia-Bin Huang15,917 Aufrufe • vor 1 Jahr
Keine weiteren Inhalte verfügbar