TransPose: Real-time 3D Human Translation and Pose Estimation with Six Inertial Sensors

by   Xinyu Yi, et al.

Motion capture is facing some new possibilities brought by the inertial sensing technologies which do not suffer from occlusion or wide-range recordings as vision-based solutions do. However, as the recorded signals are sparse and quite noisy, online performance and global translation estimation turn out to be two key difficulties. In this paper, we present TransPose, a DNN-based approach to perform full motion capture (with both global translations and body poses) from only 6 Inertial Measurement Units (IMUs) at over 90 fps. For body pose estimation, we propose a multi-stage network that estimates leaf-to-full joint positions as intermediate results. This design makes the pose estimation much easier, and thus achieves both better accuracy and lower computation cost. For global translation estimation, we propose a supporting-foot-based method and an RNN-based method to robustly solve for the global translations with a confidence-based fusion technique. Quantitative and qualitative comparisons show that our method outperforms the state-of-the-art learning- and optimization-based methods with a large margin in both accuracy and efficiency. As a purely inertial sensor-based approach, our method is not limited by environmental settings (e.g., fixed cameras), making the capture free from common difficulties such as wide-range motion space and strong occlusion.


page 1

page 4

page 8


EgoPoser: Robust Real-Time Ego-Body Pose Estimation in Large Scenes

Full-body ego-pose estimation from head and hand poses alone has become ...

Fusing Monocular Images and Sparse IMU Signals for Real-time Human Motion Capture

Either RGB images or inertial signals have been used for the task of mot...

Estimating Egocentric 3D Human Pose in Global Space

Egocentric 3D human pose estimation using a single fisheye camera has be...

Sparse Inertial Poser: Automatic 3D Human Pose Estimation from Sparse IMUs

We address the problem of making human motion capture in the wild more p...

LiDAR-aid Inertial Poser: Large-scale Human Motion Capture by Sparse Inertial and LiDAR Sensors

We propose a multi-sensor fusion method for capturing challenging 3D hum...

Castle in the Sky: Dynamic Sky Replacement and Harmonization in Videos

This paper proposes a vision-based method for video sky replacement and ...

Transformer Inertial Poser: Attention-based Real-time Human Motion Reconstruction from Sparse IMUs

Real-time human motion reconstruction from a sparse set of wearable IMUs...

Please sign up or login with your details

Forgot password? Click here to reset