CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation

08/01/2022
by   Zhihao Li, et al.
9

Top-down methods dominate the field of 3D human pose and shape estimation, because they are decoupled from human detection and allow researchers to focus on the core problem. However, cropping, their first step, discards the location information from the very beginning, which makes themselves unable to accurately predict the global rotation in the original camera coordinate system. To address this problem, we propose to Carry Location Information in Full Frames (CLIFF) into this task. Specifically, we feed more holistic features to CLIFF by concatenating the cropped-image feature with its bounding box information. We calculate the 2D reprojection loss with a broader view of the full frame, taking a projection process similar to that of the person projected in the image. Fed and supervised by global-location-aware information, CLIFF directly predicts the global rotation along with more accurate articulated poses. Besides, we propose a pseudo-ground-truth annotator based on CLIFF, which provides high-quality 3D annotations for in-the-wild 2D datasets and offers crucial full supervision for regression-based methods. Extensive experiments on popular benchmarks show that CLIFF outperforms prior arts by a significant margin, and reaches the first place on the AGORA leaderboard (the SMPL-Algorithms track). The code and data are available at https://github.com/huawei-noah/noah-research/tree/master/CLIFF.

READ FULL TEXT

page 6

page 9

page 12

page 13

page 20

page 24

research
12/01/2021

Camera Motion Agnostic 3D Human Pose Estimation

Although the performance of 3D human pose and shape estimation methods h...
research
10/01/2021

SPEC: Seeing People in the Wild with an Estimated Camera

Due to the lack of camera parameter information for in-the-wild images, ...
research
04/29/2023

TAPE: Temporal Attention-based Probabilistic human pose and shape Estimation

Reconstructing 3D human pose and shape from monocular videos is a well-s...
research
09/21/2022

Benchmarking and Analyzing 3D Human Pose and Shape Estimation Beyond Algorithms

3D human pose and shape estimation (a.k.a. "human mesh recovery") has ac...
research
04/29/2022

A Simple Method to Boost Human Pose Estimation Accuracy by Correcting the Joint Regressor for the Human3.6m Dataset

Many human pose estimation methods estimate Skinned Multi-Person Linear ...
research
08/31/2023

EMDB: The Electromagnetic Database of Global 3D Human Pose and Shape in the Wild

We present EMDB, the Electromagnetic Database of Global 3D Human Pose an...
research
08/30/2023

Reconstructing Groups of People with Hypergraph Relational Reasoning

Due to the mutual occlusion, severe scale variation, and complex spatial...

Please sign up or login with your details

Forgot password? Click here to reset