Motion-state Alignment for Video Semantic Segmentation

04/18/2023
by   Jinming Su, et al.
0

In recent years, video semantic segmentation has made great progress with advanced deep neural networks. However, there still exist two main challenges , information inconsistency and computation cost. To deal with the two difficulties, we propose a novel motion-state alignment framework for video semantic segmentation to keep both motion and state consistency. In the framework, we first construct a motion alignment branch armed with an efficient decoupled transformer to capture dynamic semantics, guaranteeing region-level temporal consistency. Then, a state alignment branch composed of a stage transformer is designed to enrich feature spaces for the current frame to extract static semantics and achieve pixel-level state consistency. Next, by a semantic assignment mechanism, the region descriptor of each semantic category is gained from dynamic semantics and linked with pixel descriptors from static semantics. Benefiting from the alignment of these two kinds of effective information, the proposed method picks up dynamic and static semantics in a targeted way, so that video semantic regions are consistently segmented to obtain precise locations with low computational complexity. Extensive experiments on Cityscapes and CamVid datasets show that the proposed approach outperforms state-of-the-art methods and validates the effectiveness of the motion-state alignment framework.

READ FULL TEXT

page 1

page 2

page 8

research
04/28/2022

Region-level Contrastive and Consistency Learning for Semi-Supervised Semantic Segmentation

Current semi-supervised semantic segmentation methods mainly focus on de...
research
05/29/2023

Alignment-free HDR Deghosting with Semantics Consistent Transformer

High dynamic range (HDR) imaging aims to retrieve information from multi...
research
06/15/2023

Contrast, Stylize and Adapt: Unsupervised Contrastive Learning Framework for Domain Adaptive Semantic Segmentation

To overcome the domain gap between synthetic and real-world datasets, un...
research
10/11/2021

Domain Adaptive Semantic Segmentation with Regional Contrastive Consistency Regularization

Unsupervised domain adaptation (UDA) aims to bridge the domain shift bet...
research
04/14/2023

A Unified HDR Imaging Method with Pixel and Patch Level

Mapping Low Dynamic Range (LDR) images with different exposures to High ...
research
08/05/2018

Pixel-level Semantics Guided Image Colorization

While many image colorization algorithms have recently shown the capabil...
research
09/21/2023

PanoVOS:Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation

Panoramic videos contain richer spatial information and have attracted t...

Please sign up or login with your details

Forgot password? Click here to reset