SIMstack: A Generative Shape and Instance Model for Unordered Object Stacks

03/30/2021
by   Zoe Landgraf, et al.
8

By estimating 3D shape and instances from a single view, we can capture information about an environment quickly, without the need for comprehensive scanning and multi-view fusion. Solving this task for composite scenes (such as object stacks) is challenging: occluded areas are not only ambiguous in shape but also in instance segmentation; multiple decompositions could be valid. We observe that physics constrains decomposition as well as shape in occluded regions and hypothesise that a latent space learned from scenes built under physics simulation can serve as a prior to better predict shape and instances in occluded regions. To this end we propose SIMstack, a depth-conditioned Variational Auto-Encoder (VAE), trained on a dataset of objects stacked under physics simulation. We formulate instance segmentation as a centre voting task which allows for class-agnostic detection and doesn't require setting the maximum number of objects in the scene. At test time, our model can generate 3D shape and instance segmentation from a single depth view, probabilistically sampling proposals for the occluded region from the learned latent space. Our method has practical applications in providing robots some of the ability humans have to make rapid intuitive inferences of partially observed scenes. We demonstrate an application for precise (non-disruptive) object grasping of unknown objects from a single depth view.

READ FULL TEXT

page 1

page 5

page 7

page 8

page 13

page 15

page 16

research
12/10/2020

Amodal Segmentation Based on Visible Region Segmentation and Shape Prior

Almost all existing amodal segmentation methods make the inferences of o...
research
11/27/2020

Descriptor-Free Multi-View Region Matching for Instance-Wise 3D Reconstruction

This paper proposes a multi-view extension of instance segmentation with...
research
01/21/2020

Instance Segmentation of Visible and Occluded Regions for Finding and Picking Target from a Pile of Objects

We present a robotic system for picking a target from a pile of objects ...
research
12/29/2020

FPCC-Net: Fast Point Cloud Clustering for Instance Segmentation

Instance segmentation is an important pre-processing task in numerous re...
research
01/21/2023

Time-Conditioned Generative Modeling of Object-Centric Representations for Video Decomposition and Prediction

When perceiving the world from multiple viewpoints, humans have the abil...
research
04/01/2021

Fusing RGBD Tracking and Segmentation Tree Sampling for Multi-Hypothesis Volumetric Segmentation

Despite rapid progress in scene segmentation in recent years, 3D segment...
research
04/14/2013

Single View Depth Estimation from Examples

We describe a non-parametric, "example-based" method for estimating the ...

Please sign up or login with your details

Forgot password? Click here to reset