Learning Quality-aware Representation for Multi-person Pose Regression

01/04/2022
by   Yabo Xiao, et al.
3

Off-the-shelf single-stage multi-person pose regression methods generally leverage the instance score (i.e., confidence of the instance localization) to indicate the pose quality for selecting the pose candidates. We consider that there are two gaps involved in existing paradigm: 1) The instance score is not well interrelated with the pose regression quality. 2) The instance feature representation, which is used for predicting the instance score, does not explicitly encode the structural pose information to predict the reasonable score that represents pose regression quality. To address the aforementioned issues, we propose to learn the pose regression quality-aware representation. Concretely, for the first gap, instead of using the previous instance confidence label (e.g., discrete 1,0 or Gaussian representation) to denote the position and confidence for person instance, we firstly introduce the Consistent Instance Representation (CIR) that unifies the pose regression quality score of instance and the confidence of background into a pixel-wise score map to calibrates the inconsistency between instance score and pose regression quality. To fill the second gap, we further present the Query Encoding Module (QEM) including the Keypoint Query Encoding (KQE) to encode the positional and semantic information for each keypoint and the Pose Query Encoding (PQE) which explicitly encodes the predicted structural pose information to better fit the Consistent Instance Representation (CIR). By using the proposed components, we significantly alleviate the above gaps. Our method outperforms previous single-stage regression-based even bottom-up methods and achieves the state-of-the-art result of 71.7 AP on MS COCO test-dev set.

READ FULL TEXT

page 1

page 3

page 8

research
12/15/2022

QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Query

We propose a sparse end-to-end multi-person pose regression framework, t...
research
06/28/2020

Multi-Person Pose Regression via Pose Filtering and Scoring

Multi-person pose estimation is one of the mainstream tasks of computer ...
research
10/08/2022

AdaptivePose++: A Powerful Single-Stage Network for Multi-Person Pose Regression

Multi-person pose estimation generally follows top-down and bottom-up pa...
research
01/06/2017

Towards Accurate Multi-person Pose Estimation in the Wild

We propose a method for multi-person detection and 2-D pose estimation t...
research
07/19/2021

InsPose: Instance-Aware Networks for Single-Stage Multi-Person Pose Estimation

Multi-person pose estimation is an attractive and challenging task. Exis...
research
05/02/2018

Bi-directional Graph Structure Information Model for Multi-Person Pose Estimation

In this paper, we propose a novel multi-stage network architecture with ...
research
09/20/2023

Box2Poly: Memory-Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text

Recently, Transformer-based text detection techniques have sought to pre...

Please sign up or login with your details

Forgot password? Click here to reset