Learning Video Instance Segmentation with Recurrent Graph Neural Networks

12/07/2020
by   Joakim Johnander, et al.
8

Most existing approaches to video instance segmentation comprise multiple modules that are heuristically combined to produce the final output. Formulating a purely learning-based method instead, which models both the temporal aspect as well as a generic track management required to solve the video instance segmentation task, is a highly challenging problem. In this work, we propose a novel learning formulation, where the entire video instance segmentation problem is modelled jointly. We fit a flexible model to our formulation that, with the help of a graph neural network, processes all available new information in each frame. Past information is considered and processed via a recurrent connection. We demonstrate the effectiveness of the proposed approach in comprehensive experiments. Our approach, operating at over 25 FPS, outperforms previous video real-time methods. We further conduct detailed ablative experiments that validate the different aspects of our approach.

READ FULL TEXT

page 1

page 3

page 7

page 8

page 13

research
03/07/2022

End-to-end video instance segmentation via spatial-temporal graph neural networks

Video instance segmentation is a challenging task that extends image ins...
research
01/04/2023

Object Segmentation with Audio Context

Visual objects often have acoustic signatures that are naturally synchro...
research
11/30/2021

PolyWorld: Polygonal Building Extraction with Graph Neural Networks in Satellite Images

Most state-of-the-art instance segmentation methods produce binary segme...
research
03/11/2021

Instance Segmentation GNNs for One-Shot Conformal Tracking at the LHC

3D instance segmentation remains a challenging problem in computer visio...
research
06/06/2018

Instance Segmentation and Tracking with Cosine Embeddings and Recurrent Hourglass Networks

Different to semantic segmentation, instance segmentation assigns unique...
research
07/28/2021

Improving Video Instance Segmentation via Temporal Pyramid Routing

Video Instance Segmentation (VIS) is a new and inherently multi-task pro...
research
09/25/2019

Rescan: Inductive Instance Segmentation for Indoor RGBD Scans

In depth-sensing applications ranging from home robotics to AR/VR, it wi...

Please sign up or login with your details

Forgot password? Click here to reset