Shatter and Gather: Learning Referring Image Segmentation with Text Supervision

08/29/2023
by   Dongwon Kim, et al.
0

Referring image segmentation, the task of segmenting any arbitrary entities described in free-form texts, opens up a variety of vision applications. However, manual labeling of training data for this task is prohibitively costly, leading to lack of labeled data for training. We address this issue by a weakly supervised learning approach using text descriptions of training images as the only source of supervision. To this end, we first present a new model that discovers semantic entities in input image and then combines such entities relevant to text query to predict the mask of the referent. We also present a new loss function that allows the model to be trained without any further supervision. Our method was evaluated on four public benchmarks for referring image segmentation, where it clearly outperformed the existing method for the same task and recent open-vocabulary segmentation models on all the benchmarks.

READ FULL TEXT

page 1

page 3

page 4

page 7

page 8

page 14

page 15

page 16

research
02/09/2015

Weakly- and Semi-Supervised Learning of a DCNN for Semantic Image Segmentation

Deep convolutional neural networks (DCNNs) trained on a large number of ...
research
10/12/2021

Weakly-Supervised Semantic Segmentation by Learning Label Uncertainty

Since the rise of deep learning, many computer vision tasks have seen si...
research
01/12/2023

Guiding Text-to-Image Diffusion Model Towards Grounded Generation

The goal of this paper is to augment a pre-trained text-to-image diffusi...
research
08/28/2023

Referring Image Segmentation Using Text Supervision

Existing Referring Image Segmentation (RIS) methods typically require ex...
research
03/19/2016

Seed, Expand and Constrain: Three Principles for Weakly-Supervised Image Segmentation

We introduce a new loss function for the weakly-supervised training of s...
research
03/31/2022

ReSTR: Convolution-free Referring Image Segmentation Using Transformers

Referring image segmentation is an advanced semantic segmentation task w...
research
12/18/2021

Prompt-Based Multi-Modal Image Segmentation

Image segmentation is usually addressed by training a model for a fixed ...

Please sign up or login with your details

Forgot password? Click here to reset