Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs

01/09/2016
by   Zheng Shou, et al.
0

We address temporal action localization in untrimmed long videos. This is important because videos in real applications are usually unconstrained and contain multiple action instances plus video content of background scenes or other activities. To address this challenging issue, we exploit the effectiveness of deep networks in temporal action localization via three segment-based 3D ConvNets: (1) a proposal network identifies candidate segments in a long video that may contain actions; (2) a classification network learns one-vs-all action classification model to serve as initialization for the localization network; and (3) a localization network fine-tunes on the learned classification network to localize each action instance. We propose a novel loss function for the localization network to explicitly consider temporal overlap and therefore achieve high temporal localization accuracy. Only the proposal network and the localization network are used during prediction. On two large-scale benchmarks, our approach achieves significantly superior performances compared with other state-of-the-art systems: mAP increases from 1.7 when the overlap threshold for evaluation is set to 0.5.

READ FULL TEXT

page 2

page 8

research
07/21/2017

Temporal Convolution Based Action Proposal: Submission to ActivityNet 2017

In this notebook paper, we describe our approach in the submission to th...
research
08/02/2019

Scale Matters: Temporal Scale Aggregation Network for Precise Action Localization in Untrimmed Videos

Temporal action localization is a recently-emerging task, aiming to loca...
research
11/17/2022

ReLER@ZJU Submission to the Ego4D Moment Queries Challenge 2022

In this report, we present the ReLER@ZJU1 submission to the Ego4D Moment...
research
11/04/2019

Temporal Action Localization using Long Short-Term Dependency

Temporal action localization in untrimmed videos is an important but dif...
research
12/17/2020

Multi-shot Temporal Event Localization: a Benchmark

Current developments in temporal event or action localization usually ta...
research
08/13/2020

Localizing the Common Action Among a Few Videos

This paper strives to localize the temporal extent of an action in a lon...
research
10/17/2018

Embarrassingly Simple Model for Early Action Proposal

Early action proposal consists in generating high quality candidate temp...

Please sign up or login with your details

Forgot password? Click here to reset