NamedMask: Distilling Segmenters from Complementary Foundation Models

09/22/2022
by   Gyungin Shin, et al.
0

The goal of this work is to segment and name regions of images without access to pixel-level labels during training. To tackle this task, we construct segmenters by distilling the complementary strengths of two foundation models. The first, CLIP (Radford et al. 2021), exhibits the ability to assign names to image content but lacks an accessible representation of object structure. The second, DINO (Caron et al. 2021), captures the spatial extent of objects but has no knowledge of object names. Our method, termed NamedMask, begins by using CLIP to construct category-specific archives of images. These images are pseudo-labelled with a category-agnostic salient object detector bootstrapped from DINO, then refined by category-specific segmenters using the CLIP archive labels. Thanks to the high quality of the refined masks, we show that a standard segmentation architecture trained on these archives with appropriate data augmentation achieves impressive semantic segmentation abilities for both single-object and multi-object images. As a result, our proposed NamedMask performs favourably against a range of prior work on five benchmarks including the VOC2012, COCO and large-scale ImageNet-S datasets.

READ FULL TEXT

page 1

page 3

page 6

page 10

research
05/25/2017

Weakly Supervised Semantic Segmentation Based on Web Image Co-segmentation

Training a Fully Convolutional Network (FCN) for semantic segmentation r...
research
12/06/2021

Semantic Segmentation In-the-Wild Without Seeing Any Segmentation Examples

Semantic segmentation is a key computer vision task that has been active...
research
06/14/2022

ReCo: Retrieve and Co-segment for Zero-shot Transfer

Semantic segmentation has a broad range of applications, but its real-wo...
research
10/25/2019

Learning to Track Any Object

Object tracking can be formulated as "finding the right object in a vide...
research
09/24/2019

Object-Contextual Representations for Semantic Segmentation

In this paper, we address the problem of semantic segmentation and focus...
research
09/19/2023

Few-Shot Panoptic Segmentation With Foundation Models

Current state-of-the-art methods for panoptic segmentation require an im...
research
11/27/2022

Conditioning Covert Geo-Location (CGL) Detection on Semantic Class Information

The primary goal of artificial intelligence is to mimic humans. Therefor...

Please sign up or login with your details

Forgot password? Click here to reset