Learning Instance Segmentation by Interaction

06/21/2018
by   Deepak Pathak, et al.
2

We present an approach for building an active agent that learns to segment its visual observations into individual objects by interacting with its environment in a completely self-supervised manner. The agent uses its current segmentation model to infer pixels that constitute objects and refines the segmentation model by interacting with these pixels. The model learned from over 50K interactions generalizes to novel objects and backgrounds. To deal with noisy training signal for segmenting objects obtained by self-supervised interactions, we propose robust set loss. A dataset of robot's interactions along-with a few human labeled examples is provided as a benchmark for future research. We test the utility of the learned segmentation model by providing results on a downstream vision-based control task of rearranging multiple objects into target configurations from visual inputs alone. Videos, code, and robotic interaction dataset are available at https://pathak22.github.io/seg-by-interaction/

READ FULL TEXT

page 3

page 7

page 8

page 9

page 12

page 13

research
05/19/2020

Self-supervised Transfer Learning for Instance Segmentation through Physical Interaction

Instance segmentation of unknown objects from images is regarded as rele...
research
05/10/2023

Self-Supervised Instance Segmentation by Grasping

Instance segmentation is a fundamental skill for many robotic applicatio...
research
02/07/2023

Self-Supervised Unseen Object Instance Segmentation via Long-Term Robot Interaction

We introduce a novel robotic system for improving unseen object instance...
research
12/03/2020

Locating the source of interacting signal in complex networks

We investigate the problem of locating the source of a self-interacting ...
research
07/12/2020

Data-Efficient Reinforcement Learning with Momentum Predictive Representations

While deep reinforcement learning excels at solving tasks where large am...
research
08/01/2021

Visual Boundary Knowledge Translation for Foreground Segmentation

When confronted with objects of unknown types in an image, humans can ef...

Please sign up or login with your details

Forgot password? Click here to reset