Learning Fine-Grained Features for Pixel-wise Video Correspondences

08/06/2023
by   Rui Li, et al.
0

Video analysis tasks rely heavily on identifying the pixels from different frames that correspond to the same visual target. To tackle this problem, recent studies have advocated feature learning methods that aim to learn distinctive representations to match the pixels, especially in a self-supervised fashion. Unfortunately, these methods have difficulties for tiny or even single-pixel visual targets. Pixel-wise video correspondences were traditionally related to optical flows, which however lead to deterministic correspondences and lack robustness on real-world videos. We address the problem of learning features for establishing pixel-wise correspondences. Motivated by optical flows as well as the self-supervised feature learning, we propose to use not only labeled synthetic videos but also unlabeled real-world videos for learning fine-grained representations in a holistic framework. We adopt an adversarial learning scheme to enhance the generalization ability of the learned features. Moreover, we design a coarse-to-fine framework to pursue high computational efficiency. Our experimental results on a series of correspondence-based tasks demonstrate that the proposed method outperforms state-of-the-art rivals in both accuracy and efficiency.

READ FULL TEXT

page 1

page 3

page 4

page 7

page 8

research
09/26/2019

Joint-task Self-supervised Learning for Temporal Correspondence

This paper proposes to learn reliable dense correspondence from videos i...
research
07/30/2022

Learning Shadow Correspondence for Video Shadow Detection

Video shadow detection aims to generate consistent shadow predictions am...
research
11/14/2022

PiPa: Pixel- and Patch-wise Self-supervised Learning for Domain Adaptative Semantic Segmentation

Unsupervised Domain Adaptation (UDA) aims to enhance the generalization ...
research
06/16/2020

Dual-Resolution Correspondence Networks

We tackle the problem of establishing dense pixel-wise correspondences b...
research
07/19/2021

Exploring Set Similarity for Dense Self-supervised Representation Learning

By considering the spatial correspondence, dense self-supervised represe...
research
01/04/2015

Unsupervised Feature Learning for Dense Correspondences across Scenes

We propose a fast, accurate matching method for estimating dense pixel c...
research
07/20/2020

MotionSqueeze: Neural Motion Feature Learning for Video Understanding

Motion plays a crucial role in understanding videos and most state-of-th...

Please sign up or login with your details

Forgot password? Click here to reset