Deep Neural Networks in Video Human Action Recognition: A Review

by   Zihan Wang, et al.

Currently, video behavior recognition is one of the most foundational tasks of computer vision. The 2D neural networks of deep learning are built for recognizing pixel-level information such as images with RGB, RGB-D, or optical flow formats, with the current increasingly wide usage of surveillance video and more tasks related to human action recognition. There are increasing tasks requiring temporal information for frames dependency analysis. The researchers have widely studied video-based recognition rather than image-based(pixel-based) only to extract more informative elements from geometry tasks. Our current related research addresses multiple novel proposed research works and compares their advantages and disadvantages between the derived deep learning frameworks rather than machine learning frameworks. The comparison happened between existing frameworks and datasets, which are video format data only. Due to the specific properties of human actions and the increasingly wide usage of deep neural networks, we collected all research works within the last three years between 2020 to 2022. In our article, the performance of deep neural networks surpassed most of the techniques in the feature learning and extraction tasks, especially video action recognition.


page 1

page 6

page 7


Learning and Recognizing Human Action from Skeleton Movement with Deep Residual Neural Networks

Automatic human action recognition is indispensable for almost artificia...

Faster and Accurate Compressed Video Action Recognition Straight from the Frequency Domain

Human action recognition has become one of the most active field of rese...

RISE Video Dataset: Recognizing Industrial Smoke Emissions

Industrial smoke emissions pose a significant concern to human health. P...

Video-based estimation of pain indicators in dogs

Dog owners are typically capable of recognizing behavioral cues that rev...

When Kernel Methods meet Feature Learning: Log-Covariance Network for Action Recognition from Skeletal Data

Human action recognition from skeletal data is a hot research topic and ...

DIY Human Action Data Set Generation

The recent successes in applying deep learning techniques to solve stand...

Review of Video Predictive Understanding: Early Action Recognition and Future Action Prediction

Video predictive understanding encompasses a wide range of efforts that ...

Please sign up or login with your details

Forgot password? Click here to reset