Hole-robust Wireframe Detection

by   Naejin Kong, et al.

"Wireframe" is a line segment based representation designed to well capture large-scale visual properties of regular, structural shaped man-made scenes surrounding us. Unlike the wireframes, conventional edges or line segments focus on all visible edges and lines without particularly distinguishing which of them are more salient to man-made structural information. Existing wireframe detection models rely on supervising the annotated data but do not explicitly pay attention to understand how to compose the structural shapes of the scene. In addition, we often face that many foreground objects occluding the background scene interfere with proper inference of the full scene structure behind them. To resolve these problems, we first time in the field, propose new conditional data generation and training that help the model understand how to ignore occlusion indicated by holes, such as foreground object regions masked out on the image. In addition, we first time combine GAN in the model to let the model better predict underlying scene structure even beyond large holes. We also introduce pseudo labeling to further enlarge the model capacity to overcome small-scale labeled data. We show qualitatively and quantitatively that our approach significantly outperforms previous works unable to handle holes, as well as improves ordinary detection without holes given.


page 8

page 34

page 35

page 36

page 37

page 39

page 40

page 41


Line as object: datasets and framework for semantic line segment detection

In this work, we propose a learning-based approach to the task of detect...

Salient Object Detection with Purificatory Mechanism and Structural Similarity Loss

By the aid of attention mechanisms to weight the image features adaptive...

Structure-measure: A New Way to Evaluate Foreground Maps

Foreground map evaluation is crucial for gauging the progress of object ...

Semantic Attention Flow Fields for Dynamic Scene Decomposition

We present SAFF: a dynamic neural volume reconstruction of a casual mono...

Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs

Controllable scene synthesis consists of generating 3D information that ...

Sharp Eyes: A Salient Object Detector Working The Same Way as Human Visual Characteristics

Current methods aggregate multi-level features or introduce edge and ske...

Please sign up or login with your details

Forgot password? Click here to reset