Learning to Parse Wireframes in Images of Man-Made Environments

07/15/2020
by   Kun Huang, et al.
15

In this paper, we propose a learning-based approach to the task of automatically extracting a "wireframe" representation for images of cluttered man-made environments. The wireframe (see Fig. 1) contains all salient straight lines and their junctions of the scene that encode efficiently and accurately large-scale geometry and object shapes. To this end, we have built a very large new dataset of over 5,000 images with wireframes thoroughly labelled by humans. We have proposed two convolutional neural networks that are suitable for extracting junctions and lines with large spatial support, respectively. The networks trained on our dataset have achieved significantly better performance than state-of-the-art methods for junction detection and line segment detection, respectively. We have conducted extensive experiments to evaluate quantitatively and qualitatively the wireframes obtained by our method, and have convincingly shown that effectively and efficiently parsing wireframes for images of man-made environments is a feasible goal within reach. Such wireframes could benefit many important visual tasks such as feature correspondence, 3D reconstruction, vision-based mapping, localization, and navigation. The data and source code are available at https://github.com/huangkuns/wireframe.

READ FULL TEXT

page 1

page 3

page 4

page 8

page 13

page 14

page 15

research
05/08/2019

End-to-End Wireframe Parsing

We present a conceptually simple yet effective algorithm to detect wiref...
research
11/06/2020

ULSD: Unified Line Segment Detection across Pinhole, Fisheye, and Spherical Cameras

Line segment detection is essential for high-level tasks in computer vis...
research
03/27/2023

3D Video Object Detection with Learnable Object-Centric Global Optimization

We explore long-term temporal visual correspondence-based optimization f...
research
07/10/2023

SAM-IQA: Can Segment Anything Boost Image Quality Assessment?

Image Quality Assessment (IQA) is a challenging task that requires train...
research
10/14/2021

SGoLAM: Simultaneous Goal Localization and Mapping for Multi-Object Goal Navigation

We present SGoLAM, short for simultaneous goal localization and mapping,...
research
03/30/2023

3D Line Mapping Revisited

In contrast to sparse keypoints, a handful of line segments can concisel...
research
04/10/2023

Exploring Effective Factors for Improving Visual In-Context Learning

The In-Context Learning (ICL) is to understand a new task via a few demo...

Please sign up or login with your details

Forgot password? Click here to reset