End-to-end Learning of Action Detection from Frame Glimpses in Videos

11/22/2015
by   Serena Yeung, et al.
0

In this work we introduce a fully end-to-end approach for action detection in videos that learns to directly predict the temporal bounds of actions. Our intuition is that the process of detecting actions is naturally one of observation and refinement: observing moments in video, and refining hypotheses about when an action is occurring. Based on this insight, we formulate our model as a recurrent neural network-based agent that interacts with a video over time. The agent observes video frames and decides both where to look next and when to emit a prediction. Since backpropagation is not adequate in this non-differentiable setting, we use REINFORCE to learn the agent's decision policy. Our model achieves state-of-the-art results on the THUMOS'14 and ActivityNet datasets while observing only a fraction (2 frames.

READ FULL TEXT

page 1

page 6

page 7

page 8

research
11/18/2018

Temporal Recurrent Networks for Online Action Detection

Most work on temporal action detection is formulated in an offline manne...
research
03/31/2020

SCT: Set Constrained Temporal Transformer for Set Supervised Action Segmentation

Temporal action segmentation is a topic of increasing interest, however,...
research
08/18/2023

Progression-Guided Temporal Action Detection in Videos

We present a novel framework, Action Progression Network (APN), for temp...
research
11/30/2017

Budget-Aware Activity Detection with A Recurrent Policy Network

In this paper, we address the challenging problem of effi- cient tempora...
research
12/25/2018

On-Demand Video Dispatch Networks: A Scalable End-to-End Learning Approach

We design a dispatch system to improve the peak service quality of video...
research
06/22/2017

A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning

Existing action detection algorithms usually generate action proposals t...
research
10/26/2021

CTRN: Class-Temporal Relational Network for Action Detection

Action detection is an essential and challenging task, especially for de...

Please sign up or login with your details

Forgot password? Click here to reset