RDSNet: A New Deep Architecture for Reciprocal Object Detection and Instance Segmentation

12/11/2019
by   Shaoru Wang, et al.
21

Object detection and instance segmentation are two fundamental computer vision tasks. They are closely correlated but their relations have not yet been fully explored in most previous work. This paper presents RDSNet, a novel deep architecture for reciprocal object detection and instance segmentation. To reciprocate these two tasks, we design a two-stream structure to learn features on both the object level (i.e., bounding boxes) and the pixel level (i.e., instance masks) jointly. Within this structure, information from the two streams is fused alternately, namely information on the object level introduces the awareness of instance and translation variance to the pixel level, and information on the pixel level refines the localization accuracy of objects on the object level in return. Specifically, a correlation module and a cropping module are proposed to yield instance masks, as well as a mask based boundary refinement module for more accurate bounding boxes. Extensive experimental analyses and comparisons on the COCO dataset demonstrate the effectiveness and efficiency of RDSNet. The source code is available at https://github.com/wangsr126/RDSNet.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

page 8

research
06/15/2021

A Spacecraft Dataset for Detection, Segmentation and Parts Recognition

Virtually all aspects of modern life depend on space technology. Thanks ...
research
02/08/2022

SCR: Smooth Contour Regression with Geometric Priors

While object detection methods traditionally make use of pixel-level mas...
research
09/21/2018

Global Weighted Average Pooling Bridges Pixel-level Localization and Image-level Classification

In this work, we first tackle the problem of simultaneous pixel-level lo...
research
02/14/2020

Layered Embeddings for Amodal Instance Segmentation

The proposed method extends upon the representational output of semantic...
research
01/22/2021

Personal Fixations-Based Object Segmentation with Object Localization and Boundary Preservation

As a natural way for human-computer interaction, fixation provides a pro...
research
01/11/2019

FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction

The basic principles in designing convolutional neural network (CNN) str...
research
03/02/2023

Bayesian Deep Learning for Affordance Segmentation in images

Affordances are a fundamental concept in robotics since they relate avai...

Please sign up or login with your details

Forgot password? Click here to reset