Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network

03/09/2020
by   Jialin Gao, et al.
0

Accurate temporal action proposals play an important role in detecting actions from untrimmed videos. The existing approaches have difficulties in capturing global contextual information and simultaneously localizing actions with different durations. To this end, we propose a Relation-aware pyramid Network (RapNet) to generate highly accurate temporal action proposals. In RapNet, a novel relation-aware module is introduced to exploit bi-directional long-range relations between local features for context distilling. This embedded module enhances the RapNet in terms of its multi-granularity temporal proposal generation ability, given predefined anchor boxes. We further introduce a two-stage adjustment scheme to refine the proposal boundaries and measure their confidence in containing an action with snippet-level actionness. Extensive experiments on the challenging ActivityNet and THUMOS14 benchmarks demonstrate our RapNet generates superior accurate proposals over the existing state-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/09/2019

Relation-Aware Pyramid Network (RapNet) for temporal action proposal

In this technical report, we describe our solution to temporal action pr...
research
06/21/2022

Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation

It has been found that temporal action proposal generation, which aims t...
research
11/26/2019

SRG: Snippet Relatedness-based Temporal Action Proposal Generator

Recent temporal action proposal generation approaches have suggested int...
research
03/02/2022

DisARM: Displacement Aware Relation Module for 3D Detection

We introduce Displacement Aware Relation Module (DisARM), a novel neural...
research
09/07/2019

Graph Convolutional Networks for Temporal Action Localization

Most state-of-the-art action localization systems process each action pr...
research
07/20/2022

HTNet: Anchor-free Temporal Action Localization with Hierarchical Transformers

Temporal action localization (TAL) is a task of identifying a set of act...
research
12/01/2021

Graph Convolutional Module for Temporal Action Localization in Videos

Temporal action localization has long been researched in computer vision...

Please sign up or login with your details

Forgot password? Click here to reset