Heatmap Distribution Matching for Human Pose Estimation

by   Haoxuan Qu, et al.
Nanyang Technological University
Singapore University of Technology and Design

For tackling the task of 2D human pose estimation, the great majority of the recent methods regard this task as a heatmap estimation problem, and optimize the heatmap prediction using the Gaussian-smoothed heatmap as the optimization objective and using the pixel-wise loss (e.g. MSE) as the loss function. In this paper, we show that optimizing the heatmap prediction in such a way, the model performance of body joint localization, which is the intrinsic objective of this task, may not be consistently improved during the optimization process of the heatmap prediction. To address this problem, from a novel perspective, we propose to formulate the optimization of the heatmap prediction as a distribution matching problem between the predicted heatmap and the dot annotation of the body joint directly. By doing so, our proposed method does not need to construct the Gaussian-smoothed heatmap and can achieve a more consistent model performance improvement during the optimization of the heatmap prediction. We show the effectiveness of our proposed method through extensive experiments on the COCO dataset and the MPII dataset.


page 1

page 2

page 3

page 4


Multi-Domain Pose Network for Multi-Person Pose Estimation and Tracking

Multi-person human pose estimation and tracking in the wild is important...

Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation

We observe that human poses exhibit strong group-wise structural correla...

Learning Heatmap-Style Jigsaw Puzzles Provides Good Pretraining for 2D Human Pose Estimation

The target of 2D human pose estimation is to locate the keypoints of bod...

Distribution-Aware Coordinate Representation for Human Pose Estimation

While being the de facto standard coordinate representation in human pos...

Self-Correctable and Adaptable Inference for Generalizable Human Pose Estimation

A central challenge in human pose estimation, as well as in many other m...

Anchor Loss: Modulating Loss Scale based on Prediction Difficulty

We propose a novel loss function that dynamically rescales the cross ent...

Binarizing by Classification: Is soft function really necessary?

Binary neural network leverages the Sign function to binarize real value...

Please sign up or login with your details

Forgot password? Click here to reset