Towards Unbalanced Motion: Part-Decoupling Network for Video Portrait Segmentation

07/31/2023
by   Tianshu Yu, et al.
0

Video portrait segmentation (VPS), aiming at segmenting prominent foreground portraits from video frames, has received much attention in recent years. However, simplicity of existing VPS datasets leads to a limitation on extensive research of the task. In this work, we propose a new intricate large-scale Multi-scene Video Portrait Segmentation dataset MVPS consisting of 101 video clips in 7 scenario categories, in which 10,843 sampled frames are finely annotated at pixel level. The dataset has diverse scenes and complicated background environments, which is the most complex dataset in VPS to our best knowledge. Through the observation of a large number of videos with portraits during dataset construction, we find that due to the joint structure of human body, motion of portraits is part-associated, which leads that different parts are relatively independent in motion. That is, motion of different parts of the portraits is unbalanced. Towards this unbalance, an intuitive and reasonable idea is that different motion states in portraits can be better exploited by decoupling the portraits into parts. To achieve this, we propose a Part-Decoupling Network (PDNet) for video portrait segmentation. Specifically, an Inter-frame Part-Discriminated Attention (IPDA) module is proposed which unsupervisely segments portrait into parts and utilizes different attentiveness on discriminative features specified to each different part. In this way, appropriate attention can be imposed to portrait parts with unbalanced motion to extract part-discriminated correlations, so that the portraits can be segmented more accurately. Experimental results demonstrate that our method achieves leading performance with the comparison to state-of-the-art methods.

READ FULL TEXT

page 1

page 3

page 5

page 8

page 9

research
03/11/2021

Triple-cooperative Video Shadow Detection

Shadow detection in a single image has received significant research int...
research
04/07/2020

Motion-supervised Co-Part Segmentation

Recent co-part segmentation methods mostly operate in a supervised learn...
research
03/07/2023

MOSO: Decomposing MOtion, Scene and Object for Video Prediction

Motion, scene and object are three primary visual components of a video....
research
11/30/2020

Adaptive Compact Attention For Few-shot Video-to-video Translation

This paper proposes an adaptive compact attention model for few-shot vid...
research
10/19/2021

NeuralDiff: Segmenting 3D objects that move in egocentric videos

Given a raw video sequence taken from a freely-moving camera, we study t...
research
03/10/2019

Shape2Motion: Joint Analysis of Motion Parts and Attributes from 3D Shapes

For the task of mobility analysis of 3D shapes, we propose joint analysi...
research
01/19/2020

See More, Know More: Unsupervised Video Object Segmentation with Co-Attention Siamese Networks

We introduce a novel network, called CO-attention Siamese Network (COSNe...

Please sign up or login with your details

Forgot password? Click here to reset