
Jia-Bin Huang
@jbhuang0604 • 77,845 subscribers
I am the one who wears a jacket.
Shorts
Videos

Introducing Generative Omnimatte: A method for decomposing a video into complete layers, including objects and their associated effects (e.g., shadows, reflections). It enables many cool applications, such as video stylization, compositions, moment retiming, and object removal.
Jia-Bin Huang81,713 次观看 • 1 年前

Exploration is key for robots to generalize, especially in open-ended environments with vague goals and sparse rewards. BUT, how do we go beyond random poking? Wouldn't it be great to have a robot that explores an environment just like a kid? Introducing Imagine, Verify, Execute (IVE)! IVE leverages Vision-Language models to • extract semantic scene graphs, • imagine novel scenes, • predict their physical plausibility, and • generate executable sequences. IVE is a memory-guided agentic exploration framework that operates fully automatically, enabling more diverse and meaningful exploration.
Jia-Bin Huang45,285 次观看 • 1 年前

How do we go beyond reconstructing colors and recovering the intrinsic scene properties? 🤔 👁️ IRIS: Inverse Rendering of Indoor Scenes IRIS estimates accurate material, lighting, and camera response functions given a set of LDR images, enabling photorealistic and view-consistent relighting and object insertion.
Jia-Bin Huang15,917 次观看 • 1 年前
没有更多内容可加载