Unsupervised Discovery of Actions in Instructional Videos

06/28/2021
by   AJ Piergiovanni, et al.
10

In this paper we address the problem of automatically discovering atomic actions in unsupervised manner from instructional videos. Instructional videos contain complex activities and are a rich source of information for intelligent agents, such as, autonomous robots or virtual assistants, which can, for example, automatically `read' the steps from an instructional video and execute them. However, videos are rarely annotated with atomic activities, their boundaries or duration. We present an unsupervised approach to learn atomic actions of structured human tasks from a variety of instructional videos. We propose a sequential stochastic autoregressive model for temporal segmentation of videos, which learns to represent and discover the sequential relationship between different atomic actions of the task, and which provides automatic and unsupervised self-labeling for videos. Our approach outperforms the state-of-the-art unsupervised methods with large margins. We will open source the code.

READ FULL TEXT

page 1

page 8

research
06/07/2021

Unsupervised Action Segmentation for Instructional Videos

In this paper we address the problem of automatically discovering atomic...
research
06/19/2018

VirtualHome: Simulating Household Activities via Programs

In this paper, we are interested in modeling complex activities that occ...
research
02/01/2019

Learning Differentiable Grammars for Continuous Data

This paper proposes a novel algorithm which learns a formal regular gram...
research
04/28/2020

Inferring Temporal Compositions of Actions Using Probabilistic Automata

This paper presents a framework to recognize temporal compositions of at...
research
01/27/2020

rCRF: Recursive Belief Estimation over CRFs in RGB-D Activity Videos

For assistive robots, anticipating the future actions of humans is an es...
research
01/26/2022

Learning To Recognize Procedural Activities with Distant Supervision

In this paper we consider the problem of classifying fine-grained, multi...
research
02/09/2018

Video Event Recognition and Anomaly Detection by Combining Gaussian Process and Hierarchical Dirichlet Process Models

In this paper, we present an unsupervised learning framework for analyzi...

Please sign up or login with your details

Forgot password? Click here to reset