Human Action Recognition: Pose-based Attention draws focus to Hands

12/20/2017
by   Fabien Baradel, et al.
0

We propose a new spatio-temporal attention based mechanism for human action recognition able to automatically attend to the hands most involved into the studied action and detect the most discriminative moments in an action. Attention is handled in a recurrent manner employing Recurrent Neural Network (RNN) and is fully-differentiable. In contrast to standard soft-attention based mechanisms, our approach does not use the hidden RNN state as input to the attention model. Instead, attention distributions are extracted using external information: human articulated pose. We performed an extensive ablation study to show the strengths of this approach and we particularly studied the conditioning aspect of the attention mechanism. We evaluate the method on the largest currently available human action recognition dataset, NTU-RGB+D, and report state-of-the-art results. Other advantages of our model are certain aspects of explanability, as the spatial and temporal attention distributions at test time allow to study and verify on which parts of the input data the method focuses.

READ FULL TEXT

page 5

page 6

research
10/01/2018

Where and When to Look? Spatio-temporal Attention for Action Recognition in Videos

Inspired by the observation that humans are able to process videos effic...
research
03/29/2017

Pose-conditioned Spatio-Temporal Attention for Human Action Recognition

We address human action recognition from multi-modal video data involvin...
research
11/16/2016

Joint Network based Attention for Action Recognition

By extracting spatial and temporal characteristics in one network, the t...
research
11/12/2015

Action Recognition using Visual Attention

We propose a soft attention based model for the task of action recogniti...
research
07/01/2021

Action Transformer: A Self-Attention Model for Short-Time Human Action Recognition

Deep neural networks based purely on attention have been successful acro...
research
11/22/2016

Recurrent Attention Models for Depth-Based Person Identification

We present an attention-based model that reasons on human body shape and...
research
02/22/2018

Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points

We propose a method for human activity recognition from RGB data which d...

Please sign up or login with your details

Forgot password? Click here to reset