Objects do not disappear: Video object detection by single-frame object location anticipation

08/09/2023
by   Xin Liu, et al.
0

Objects in videos are typically characterized by continuous smooth motion. We exploit continuous smooth motion in three ways. 1) Improved accuracy by using object motion as an additional source of supervision, which we obtain by anticipating object locations from a static keyframe. 2) Improved efficiency by only doing the expensive feature computations on a small subset of all frames. Because neighboring video frames are often redundant, we only compute features for a single static keyframe and predict object locations in subsequent frames. 3) Reduced annotation cost, where we only annotate the keyframe and use smooth pseudo-motion between keyframes. We demonstrate computational efficiency, annotation efficiency, and improved mean average precision compared to the state-of-the-art on four datasets: ImageNet VID, EPIC KITCHENS-55, YouTube-BoundingBoxes, and Waymo Open dataset. Our source code is available at https://github.com/L-KID/Videoobject-detection-by-location-anticipation.

READ FULL TEXT

page 1

page 5

page 8

research
07/07/2020

Single Shot Video Object Detector

Single shot detectors that are potentially faster and simpler than two-s...
research
08/27/2020

Learning Representations of Endoscopic Videos to Detect Tool Presence Without Supervision

In this work, we explore whether it is possible to learn representations...
research
03/29/2017

Flow-Guided Feature Aggregation for Video Object Detection

Extending state-of-the-art object detectors from image to video is chall...
research
03/15/2018

Object Detection in Video with Spatiotemporal Sampling Networks

We propose a Spatiotemporal Sampling Network (STSN) that uses deformable...
research
04/26/2023

Video Frame Interpolation with Densely Queried Bilateral Correlation

Video Frame Interpolation (VFI) aims to synthesize non-existent intermed...
research
03/10/2021

PatchNet – Short-range Template Matching for Efficient Video Processing

Object recognition is a fundamental problem in many video processing tas...
research
07/29/2022

GPU-accelerated SIFT-aided source identification of stabilized videos

Video stabilization is an in-camera processing commonly applied by moder...

Please sign up or login with your details

Forgot password? Click here to reset