Actor-Centric Relation Network

07/28/2018
by   Chen Sun, et al.
6

Current state-of-the-art approaches for spatio-temporal action localization rely on detections at the frame level and model temporal context with 3D ConvNets. Here, we go one step further and model spatio-temporal relations to capture the interactions between human actors, relevant objects and scene elements essential to differentiate similar human actions. Our approach is weakly supervised and mines the relevant elements automatically with an actor-centric relational network (ACRN). ACRN computes and accumulates pair-wise relation information from actor and global scene features, and generates relation features for action classification. It is implemented as neural networks and can be trained jointly with an existing action detection system. We show that ACRN outperforms alternative approaches which capture relation information, and that the proposed framework improves upon the state-of-the-art performance on JHMDB and AVA. A visualization of the learned relation features confirms that our approach is able to attend to the relevant relations for each action.

READ FULL TEXT

page 2

page 13

page 14

research
04/24/2023

MRSN: Multi-Relation Support Network for Video Action Detection

Action detection is a challenging video understanding task, requiring mo...
research
06/29/2021

Spatio-Temporal Context for Action Detection

Research in action detection has grown in the recentyears, as it plays a...
research
12/30/2018

Actor Conditioned Attention Maps for Video Action Detection

Interactions with surrounding objects and people contain important infor...
research
10/24/2022

Abductive Action Inference

Abductive reasoning aims to make the most likely inference for a given s...
research
09/06/2022

Spatio-Temporal Action Detection Under Large Motion

Current methods for spatiotemporal action tube detection often extend a ...
research
08/02/2016

Spatio-temporal Co-Occurrence Characterizations for Human Action Classification

The human action classification task is a widely researched topic and is...
research
06/14/2020

Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization

Localizing persons and recognizing their actions from videos is a challe...

Please sign up or login with your details

Forgot password? Click here to reset