Unsupervised Learning and Segmentation of Complex Activities from Video

03/26/2018
by   Fadime Sener, et al.
0

This paper presents a new method for unsupervised segmentation of complex activities from video into multiple steps, or sub-activities, without any textual input. We propose an iterative discriminative-generative approach which alternates between discriminatively learning the appearance of sub-activities from the videos' visual features to sub-activity labels and generatively modelling the temporal structure of sub-activities using a Generalized Mallows Model. In addition, we introduce a model for background to account for frames unrelated to the actual activities. Our approach is validated on the challenging Breakfast Actions and Inria Instructional Videos datasets and outperforms both unsupervised and weakly-supervised state of the art.

READ FULL TEXT
research
01/29/2020

Joint Visual-Temporal Embedding for Unsupervised Learning of Actions in Untrimmed Sequences

Understanding the structure of complex activities in videos is one of th...
research
09/07/2015

An end-to-end generative framework for video segmentation and recognition

We describe an end-to-end generative approach for the segmentation and r...
research
08/21/2012

A Unified Approach for Modeling and Recognition of Individual Actions and Group Activities

Recognizing group activities is challenging due to the difficulties in i...
research
02/09/2018

Video Event Recognition and Anomaly Detection by Combining Gaussian Process and Hierarchical Dirichlet Process Models

In this paper, we present an unsupervised learning framework for analyzi...
research
10/11/2018

Globally Continuous and Non-Markovian Activity Analysis from Videos

Automatically recognizing activities in video is a classic problem in vi...
research
04/30/2021

Unsupervised Discriminative Embedding for Sub-Action Learning in Complex Activities

Action recognition and detection in the context of long untrimmed video ...
research
07/13/2020

Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization

Temporally localizing activities within untrimmed videos has been extens...

Please sign up or login with your details

Forgot password? Click here to reset