Unsupervised Semantic Parsing of Video Collections

06/28/2015
by   Ozan Sener, et al.
0

Human communication typically has an underlying structure. This is reflected in the fact that in many user generated videos, a starting point, ending, and certain objective steps between these two can be identified. In this paper, we propose a method for parsing a video into such semantic steps in an unsupervised way. The proposed method is capable of providing a semantic "storyline" of the video composed of its objective steps. We accomplish this using both visual and language cues in a joint generative model. The proposed method can also provide a textual description for each of the identified semantic steps and video segments. We evaluate this method on a large number of complex YouTube videos and show results of unprecedented quality for this intricate and impactful problem.

READ FULL TEXT

page 1

page 3

page 4

page 6

page 7

page 8

research
05/11/2016

Unsupervised Semantic Action Discovery from Video Collections

Human communication takes many forms, including speech, text and instruc...
research
10/16/2022

Motion-Based Weak Supervision for Video Parsing with Application to Colonoscopy

We propose a two-stage unsupervised approach for parsing videos into pha...
research
06/30/2015

Unsupervised Learning from Narrated Instruction Videos

We address the problem of automatically learning the main steps to compl...
research
03/06/2019

Visual Discourse Parsing

Text-level discourse parsing aims to unmask how two segments (or sentenc...
research
06/17/2020

Semantic Visual Navigation by Watching YouTube Videos

Semantic cues and statistical regularities in real-world environment lay...
research
03/07/2017

Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos

We propose an unsupervised method for reference resolution in instructio...
research
10/05/2016

Recognizing and Presenting the Storytelling Video Structure with Deep Multimodal Networks

This paper presents a novel approach for temporal and semantic segmentat...

Please sign up or login with your details

Forgot password? Click here to reset