HAN: An Efficient Hierarchical Self-Attention Network for Skeleton-Based Gesture Recognition

06/25/2021
by   Jianbo Liu, et al.
0

Previous methods for skeleton-based gesture recognition mostly arrange the skeleton sequence into a pseudo picture or spatial-temporal graph and apply deep Convolutional Neural Network (CNN) or Graph Convolutional Network (GCN) for feature extraction. Although achieving superior results, these methods have inherent limitations in dynamically capturing local features of interactive hand parts, and the computing efficiency still remains a serious issue. In this work, the self-attention mechanism is introduced to alleviate this problem. Considering the hierarchical structure of hand joints, we propose an efficient hierarchical self-attention network (HAN) for skeleton-based gesture recognition, which is based on pure self-attention without any CNN, RNN or GCN operators. Specifically, the joint self-attention module is used to capture spatial features of fingers, the finger self-attention module is designed to aggregate features of the whole hand. In terms of temporal features, the temporal self-attention module is utilized to capture the temporal dynamics of the fingers and the entire hand. Finally, these features are fused by the fusion self-attention module for gesture classification. Experiments show that our method achieves competitive results on three gesture recognition datasets with much lower computational complexity.

READ FULL TEXT

page 1

page 10

research
01/22/2021

A Two-stream Neural Network for Pose-based Hand Gesture Recognition

Pose based hand gesture recognition has been widely studied in the recen...
research
07/20/2019

Construct Dynamic Graphs for Hand Gesture Recognition via Spatial-Temporal Attention

We propose a Dynamic Graph-Based Spatial-Temporal Attention (DG-STA) met...
research
01/26/2022

Self-Attention Neural Bag-of-Features

In this work, we propose several attention formulations for multivariate...
research
12/17/2021

Self-attention based anchor proposal for skeleton-based action recognition

Skeleton sequences are widely used for action recognition task due to it...
research
02/27/2019

Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions

We introduce a new method to tag Multiword Expressions (MWEs) using a li...
research
07/15/2022

A Non-Anatomical Graph Structure for isolated hand gesture separation in continuous gesture sequences

Continuous Hand Gesture Recognition (CHGR) has been extensively studied ...
research
04/18/2023

GlobalMind: Global Multi-head Interactive Self-attention Network for Hyperspectral Change Detection

High spectral resolution imagery of the Earth's surface enables users to...

Please sign up or login with your details

Forgot password? Click here to reset