CrossFormer: Cross Spatio-Temporal Transformer for 3D Human Pose Estimation

03/24/2022
by   Mohammed Hassanin, et al.
3

3D human pose estimation can be handled by encoding the geometric dependencies between the body parts and enforcing the kinematic constraints. Recently, Transformer has been adopted to encode the long-range dependencies between the joints in the spatial and temporal domains. While they had shown excellence in long-range dependencies, studies have noted the need for improving the locality of vision Transformers. In this direction, we propose a novel pose estimation Transformer featuring rich representations of body joints critical for capturing subtle changes across frames (i.e., inter-feature representation). Specifically, through two novel interaction modules; Cross-Joint Interaction and Cross-Frame Interaction, the model explicitly encodes the local and global dependencies between the body joints. The proposed architecture achieved state-of-the-art performance on two popular 3D human pose estimation datasets, Human3.6 and MPI-INF-3DHP. In particular, our proposed CrossFormer method boosts performance by 0.9 counterpart, PoseFormer, using the detected 2D poses and ground-truth settings respectively.

READ FULL TEXT

page 6

page 7

page 8

page 9

page 13

page 14

page 15

page 16

research
03/29/2021

3D Human Pose Estimation with Spatial and Temporal Transformers

Transformer architectures have become the model of choice in natural lan...
research
03/26/2021

Lifting Transformer for 3D Human Pose Estimation in Video

Despite great progress in video-based 3D human pose estimation, it is st...
research
01/19/2022

Swin-Pose: Swin Transformer Based Human Pose Estimation

Convolutional neural networks (CNNs) have been widely utilized in many c...
research
03/10/2023

Human Pose Estimation from Ambiguous Pressure Recordings with Spatio-temporal Masked Transformers

Despite the impressive performance of vision-based pose estimators, they...
research
10/08/2022

(Fusionformer):Exploiting the Joint Motion Synergy with Fusion Network Based On Transformer for 3D Human Pose Estimation

For the current 3D human pose estimation task, in order to improve the e...
research
04/26/2016

A Framework for Human Pose Estimation in Videos

In this paper, we present a method to estimate a sequence of human poses...

Please sign up or login with your details

Forgot password? Click here to reset