Multi-label Class-imbalanced Action Recognition in Hockey Videos via 3D Convolutional Neural Networks

09/05/2017
by   Konstantin Sozykin, et al.
0

Automatic analysis of the video is one of most complex problems in the fields of computer vision and machine learning. A significant part of this research deals with (human) activity recognition (HAR) since humans, and the activities that they perform, generate most of the video semantics. Video-based HAR has applications in various domains, but one of the most important and challenging is HAR in sports videos. Some of the major issues include high inter- and intra-class variations, large class imbalance, the presence of both group actions and single player actions, and recognizing simultaneous actions, i.e., the multi-label learning problem. Keeping in mind these challenges and the recent success of CNNs in solving various computer vision problems, in this work, we implement a 3D CNN based multi-label deep HAR system for multi-label class-imbalanced action recognition in hockey videos. We test our system for two different scenarios: an ensemble of k binary networks vs. a single k-output network, on a publicly available dataset. We also compare our results with the system that was originally designed for the chosen dataset. Experimental results show that the proposed approach performs better than the existing solution.

READ FULL TEXT

page 5

page 8

page 9

research
09/15/2017

Multi-Label Zero-Shot Human Action Recognition via Joint Latent Embedding

Human action recognition refers to automatic recognizing human actions f...
research
11/01/2019

Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding

An event happening in the world is often made of different activities an...
research
07/20/2023

MSQNet: Actor-agnostic Action Recognition with Multi-modal Query

Existing action recognition methods are typically actor-specific due to ...
research
07/10/2018

Deep Imbalanced Attribute Classification using Visual Attention Aggregation

For many computer vision applications such as image description and huma...
research
04/25/2019

Holistic Large Scale Video Understanding

Action recognition has been advanced in recent years by benchmarks with ...
research
07/21/2019

Attention Filtering for Multi-person Spatiotemporal Action Detection on Deep Two-Stream CNN Architectures

Action detection and recognition tasks have been the target of much focu...
research
11/06/2022

Predicting User-specific Future Activities using LSTM-based Multi-label Classification

User-specific future activity prediction in the healthcare domain based ...

Please sign up or login with your details

Forgot password? Click here to reset