Decoupling Features in Hierarchical Propagation for Video Object Segmentation

10/18/2022
by   Zongxin Yang, et al.
9

This paper focuses on developing a more effective method of hierarchical propagation for semi-supervised Video Object Segmentation (VOS). Based on vision transformers, the recently-developed Associating Objects with Transformers (AOT) approach introduces hierarchical propagation into VOS and has shown promising results. The hierarchical propagation can gradually propagate information from past frames to the current frame and transfer the current frame feature from object-agnostic to object-specific. However, the increase of object-specific information will inevitably lead to the loss of object-agnostic visual information in deep propagation layers. To solve such a problem and further facilitate the learning of visual embeddings, this paper proposes a Decoupling Features in Hierarchical Propagation (DeAOT) approach. Firstly, DeAOT decouples the hierarchical propagation of object-agnostic and object-specific embeddings by handling them in two independent branches. Secondly, to compensate for the additional computation from dual-branch propagation, we propose an efficient module for constructing hierarchical propagation, i.e., Gated Propagation Module, which is carefully designed with single-head attention. Extensive experiments show that DeAOT significantly outperforms AOT in both accuracy and efficiency. On YouTube-VOS, DeAOT can achieve 86.0 we achieve new state-of-the-art performance on four benchmarks, i.e., YouTube-VOS (86.2 (0.622). Project page: https://github.com/z-x-yang/AOT.

READ FULL TEXT
research
03/22/2022

Associating Objects with Scalable Transformers for Video Object Segmentation

This paper investigates how to realize better and more efficient embeddi...
research
07/05/2023

ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised Video Object Segmentation

The Associating Objects with Transformers (AOT) framework has exhibited ...
research
06/04/2021

Associating Objects with Transformers for Video Object Segmentation

This paper investigates how to realize better and more efficient embeddi...
research
07/22/2022

QueryProp: Object Query Propagation for High-Performance Video Object Detection

Video object detection has been an important yet challenging topic in co...
research
12/06/2021

Reliable Propagation-Correction Modulation for Video Object Segmentation

Error propagation is a general but crucial problem in online semi-superv...
research
10/23/2020

Delving into the Cyclic Mechanism in Semi-supervised Video Object Segmentation

In this paper, we address several inadequacies of current video object s...
research
11/02/2021

Exploring the Semi-supervised Video Object Segmentation Problem from a Cyclic Perspective

Modern video object segmentation (VOS) algorithms have achieved remarkab...

Please sign up or login with your details

Forgot password? Click here to reset