Loading video...

Video Failed to Load

Go Home

Estimating a full-body pose from only the headset and controllers is ambiguous. Environment information can help resolve this. In our #SIGGRAPH2023 paper the avatar is generated from only the 3 shown coordinate frames (no cameras) + height of the environment (green dots). (1/4)👇

200,255 views • 3 years ago •via X (Twitter)

10 Comments

Alexander Winkler's profile picture
Alexander Winkler3 years ago

The avatar is trained with Reinforcement Learning to imitate hours of typical motions (e.g. getting up from the floor, sitting on chairs) knowing the terrain height. Then during inference it is able to generate the appropriate torques to track similar but unseen motions. (2/4)

Alexander Winkler's profile picture
Alexander Winkler3 years ago

Constraints of the physics simulation achieve these natural lower-body poses (e.g. center-of-mass stability, no object or floor penetration). What's cool is how the avatar also learned to manipulate objects (e.g. tilt the simulated chair) to better follow the input signal. (3/4)

Alexander Winkler's profile picture
Alexander Winkler3 years ago

This work was led by Sunmin Lee who has a more detailed twitter thread here: Authors: @sunnyCodes_ , @blacksquirrel__ , Yuting Ye, Jungdam Won, myself Project page: (4/4)

Khen's profile picture
Khen3 years ago

The legs estimation leaving me completely shocked

Jivanshu Sharma's profile picture
Jivanshu Sharma3 years ago

@VarunMayya @ArpanLokhande @amitkvermaxd @Ridhi_sam_11

DAMSEL's profile picture
DAMSEL3 years ago

@I3Llamas

morgan's profile picture
morgan3 years ago

A fantastic and practical result!

ChroniCover's profile picture
ChroniCover3 years ago

@vr_rames @EricdeBrocart

Chris Edwards's profile picture
Chris Edwards3 years ago

Great work! I notice that shoulder movement, which should be easy to model, is usually ignored and the error breaks immersion in most VR userbody models. How accurate do you think you can make it, if you use the Quest2 cameras to detect shoulders and knees?

@Atirut@toot.community🐧's profile picture
@[email protected]🐧3 years ago

@cgonfire This is it: mocap for VR headset users

Related Videos

Physics-based Motion Retargeting from Sparse Inputs paper page: Avatars are important to create interactive and immersive experiences in virtual worlds. One challenge in animating these characters to mimic a user's motion is that commercial AR/VR products consist only of a headset and controllers, providing very limited sensor data of the user's pose. Another challenge is that an avatar might have a different skeleton structure than a human and the mapping between them is unclear. In this work we address both of these challenges. We introduce a method to retarget motions in real-time from sparse human sensor data to characters of various morphologies. Our method uses reinforcement learning to train a policy to control characters in a physics simulator. We only require human motion capture data for training, without relying on artist-generated animations for each avatar. This allows us to use large motion capture datasets to train general policies that can track unseen users from real and sparse data in real-time. We demonstrate the feasibility of our approach on three characters with different skeleton structure: a dinosaur, a mouse-like creature and a human. We show that the avatar poses often match the user surprisingly well, despite having no sensor information of the lower body available. We discuss and ablate the important components in our framework, specifically the kinematic retargeting step, the imitation, contact and action reward as well as our asymmetric actor-critic observations. We further explore the robustness of our method in a variety of settings including unbalancing, dancing and sports motions.

AK

106,519 views • 3 years ago