Human Action Recognition from Various Data Modalities: A Review

12/22/2020
by   Zehua Sun, et al.
25

Human Action Recognition (HAR), aiming to understand human behaviors and then assign category labels, has a wide range of applications, and thus has been attracting increasing attention in the field of computer vision. Generally, human actions can be represented using various data modalities, such as RGB, skeleton, depth, infrared sequence, point cloud, event stream, audio, acceleration, radar, and WiFi, etc., which encode different sources of useful yet distinct information and have various advantages and application scenarios. Consequently, lots of existing works have attempted to investigate different types of approaches for HAR using various modalities. In this paper, we give a comprehensive survey for HAR from the perspective of the input data modalities. Specifically, we review both the hand-crafted feature-based and deep learning-based methods for single data modalities, and also review the methods based on multiple modalities, including the fusion-based frameworks and the co-learning-based approaches. The current benchmark datasets for HAR are also introduced. Finally, we discuss some potentially important research directions in this area.

READ FULL TEXT
research
07/02/2019

An Analysis of Deep Neural Networks with Attention for Action Recognition from a Neurophysiological Perspective

We review three recent deep learning based methods for action recognitio...
research
08/27/2023

AIGC for Various Data Modalities: A Survey

AI-generated content (AIGC) methods aim to produce text, images, videos,...
research
02/14/2020

A Survey on 3D Skeleton-Based Action Recognition Using Learning Method

3D skeleton-based action recognition, owing to the latent advantages of ...
research
04/07/2022

A Comprehensive Review of Sign Language Recognition: Different Types, Modalities, and Datasets

A machine can understand human activities, and the meaning of signs can ...
research
11/25/2020

Recent Progress in Appearance-based Action Recognition

Action recognition, which is formulated as a task to identify various hu...
research
01/05/2016

Space-Time Representation of People Based on 3D Skeletal Data: A Review

Spatiotemporal human representation based on 3D visual perception data i...
research
06/14/2016

Multiple Human Tracking in RGB-D Data: A Survey

Multiple human tracking (MHT) is a fundamental task in many computer vis...

Please sign up or login with your details

Forgot password? Click here to reset