RADNet: A Deep Neural Network Model for Robust Perception in Moving Autonomous Systems

04/30/2022
by   Burhan A. Mudassar, et al.
0

Interactive autonomous applications require robustness of the perception engine to artifacts in unconstrained videos. In this paper, we examine the effect of camera motion on the task of action detection. We develop a novel ranking method to rank videos based on the degree of global camera motion. For the high ranking camera videos we show that the accuracy of action detection is decreased. We propose an action detection pipeline that is robust to the camera motion effect and verify it empirically. Specifically, we do actor feature alignment across frames and couple global scene features with local actor-specific features. We do feature alignment using a novel formulation of the Spatio-temporal Sampling Network (STSN) but with multi-scale offset prediction and refinement using a pyramid structure. We also propose a novel input dependent weighted averaging strategy for fusing local and global features. We show the applicability of our network on our dataset of moving camera videos with high camera motion (MOVE dataset) with a 4.1 frame mAP and 17

READ FULL TEXT

page 3

page 4

page 6

research
09/06/2022

Spatio-Temporal Action Detection Under Large Motion

Current methods for spatiotemporal action tube detection often extend a ...
research
01/10/2019

Cricket stroke extraction: Towards creation of a large-scale cricket actions dataset

In this paper, we deal with the problem of temporal action localization ...
research
06/10/2019

Embodied View-Contrastive 3D Feature Learning

Humans can effortlessly imagine the occluded side of objects in a photog...
research
02/10/2015

Video Primal Sketch: A Unified Middle-Level Representation for Video

This paper presents a middle-level video representation named Video Prim...
research
12/31/2022

An end-to-end multi-scale network for action prediction in videos

In this paper, we develop an efficient multi-scale network to predict ac...
research
12/10/2020

Developing Motion Code Embedding for Action Recognition in Videos

In this work, we propose a motion embedding strategy known as motion cod...
research
03/21/2019

Quotienting Impertinent Camera Kinematics for 3D Video Stabilization

With the recent advent of methods that allow for real-time computation, ...

Please sign up or login with your details

Forgot password? Click here to reset