Cascaded 3D Full-body Pose Regression from Single Depth Image at 100 FPS
There are increasingly real-time live applications in virtual reality, where it plays an important role to capture and retarget 3D human pose. This paper presents a novel cascaded 3D full-body pose regression method to estimate accurate pose from a single depth image at 100 fps. The key idea is to train cascaded regressors based on Gradient Boosting algorithm from pre-recorded human motion capture database. By incorporating hierarchical kinematics model of human pose into the learning procedure, we can directly estimate 3D joint angles instead of joint positions. The biggest advantage of this model is that the bone length can be preserved during the whole 3D pose estimation procedure, which leads to more effective features and higher pose estimation accuracy. Our method can be used as an initialization procedure when combining with tracking methods. We demonstrate the power of our method on a wide range of synthesized human motion data from CMU mocap database, Human3.6M dataset and real human movements data captured in real time. In our comparison against alternative pose estimation methods and commercial system such as Kinect 2017, we achieve state-of-the-art accuracy.
READ FULL TEXT