Encode the Unseen: Predictive Video Hashing for Scalable Mid-Stream Retrieval

09/30/2020
by   Tong Yu, et al.
0

This paper tackles a new problem in computer vision: mid-stream video-to-video retrieval. This task, which consists in searching a database for content similar to a video right as it is playing, e.g. from a live stream, exhibits challenging characteristics. Only the beginning part of the video is available as query and new frames are constantly added as the video plays out. To perform retrieval in this demanding situation, we propose an approach based on a binary encoder that is both predictive and incremental in order to (1) account for the missing video content at query time and (2) keep up with repeated, continuously evolving queries throughout the streaming. In particular, we present the first hashing framework that infers the unseen future content of a currently playing video. Experiments on FCVID and ActivityNet demonstrate the feasibility of this task. Our approach also yields a significant mAP@20 performance increase compared to a baseline adapted from the literature for this task, for instance 7.4 elapsed runtime on FCVID using bitcodes of size 192 bits.

READ FULL TEXT

page 1

page 7

page 8

page 13

research
03/08/2022

Live Laparoscopic Video Retrieval with Compressed Uncertainty

Searching through large volumes of medical data to retrieve relevant inf...
research
09/01/2020

Uncovering Hidden Challenges in Query-Based Video Moment Retrieval

The query-based moment retrieval is a problem of localising a specific c...
research
02/07/2018

Self-Supervised Video Hashing with Hierarchical Binary Auto-encoder

Existing video hash functions are built on three isolated stages: frame ...
research
01/10/2022

Multi-query Video Retrieval

Retrieving target videos based on text descriptions is a task of great p...
research
06/14/2018

Semantic Image Retrieval by Uniting Deep Neural Networks and Cognitive Architectures

Image and video retrieval by their semantic content has been an importan...
research
11/02/2017

Set-to-Set Hashing with Applications in Visual Recognition

Visual data, such as an image or a sequence of video frames, is often na...

Please sign up or login with your details

Forgot password? Click here to reset