Parsing Geometry Using Structure-Aware Shape Templates

Real-life man-made objects often exhibit strong and easily-identifiable structure, as a direct result of their design or their intended functionality. Structure typically appears in the form of individual parts and their arrangement. Knowing about object structure can be an important cue for object recognition and scene understanding - a key goal for various AR and robotics applications. However, commodity RGB-D sensors used in these scenarios only produce raw, unorganized point clouds, without structural information about the captured scene. Moreover, the generated data is commonly partial and susceptible to artifacts and noise, which makes inferring the structure of scanned objects challenging. In this paper, we organize large shape collections into parameterized shape templates to capture the underlying structure of the objects. The templates allow us to transfer the structural information onto new objects and incomplete scans. We employ a deep neural network that matches the partial scan with one of the shape templates, then match and fit it to complete and detailed models from the collection. This allows us to faithfully label its parts and to guide the reconstruction of the scanned object. We showcase the effectiveness of our method by comparing it to other state-of-the-art approaches.

READ FULL TEXT

page 1

page 3

page 5

page 6

page 7

page 8

research
09/19/2018

Deep Part Induction from Articulated Object Pairs

Object functionality is often expressed through part articulation -- as ...
research
07/14/2022

Fine-grained Few-shot Recognition by Deep Object Parsing

In our framework, an object is made up of K distinct parts or units, and...
research
08/02/2018

PCN: Point Completion Network

Shape completion, the problem of estimating the complete geometry of obj...
research
08/01/2019

StructureNet: Hierarchical Graph Networks for 3D Shape Generation

The ability to generate novel, diverse, and realistic 3D shapes along wi...
research
06/03/2020

GFPNet: A Deep Network for Learning Shape Completion in Generic Fitted Primitives

In this paper, we propose an object reconstruction apparatus that uses t...
research
10/25/2017

Complete 3D Scene Parsing from Single RGBD Image

Inferring the location, shape, and class of each object in a single imag...
research
11/08/2012

3D Scene Grammar for Parsing RGB-D Pointclouds

We pose 3D scene-understanding as a problem of parsing in a grammar. A g...

Please sign up or login with your details

Forgot password? Click here to reset