Enforcing geometric constraints of virtual normal for depth prediction

07/29/2019
by   Yin Wei, et al.
6

Monocular depth prediction plays a crucial role in understanding 3D scene geometry. Although recent methods have achieved impressive progress in evaluation metrics such as the pixel-wise relative error, most methods neglect the geometric constraints in the 3D space. In this work, we show the importance of the high-order 3D geometric constraints for depth prediction. By designing a loss term that enforces one simple type of geometric constraints, namely, virtual normal directions determined by randomly sampled three points in the reconstructed 3D space, we can considerably improve the depth prediction accuracy. Significantly, the byproduct of this predicted depth being sufficiently accurate is that we are now able to recover good 3D structures of the scene such as the point cloud and surface normal directly from the depth, eliminating the necessity of training new sub-models as was previously done. Experiments on two benchmarks: NYU Depth-V2 and KITTI demonstrate the effectiveness of our method and state-of-the-art performance.

READ FULL TEXT

page 1

page 6

page 7

page 8

page 10

page 11

page 12

page 13

research
12/13/2020

GeoNet++: Iterative Geometric Neural Network with Edge-Aware Refinement for Joint Depth and Surface Normal Estimation

In this paper, we propose a geometric neural network with edge-aware ref...
research
12/17/2020

Learning to Recover 3D Scene Shape from a Single Image

Despite significant progress in monocular depth estimation in the wild, ...
research
02/13/2023

VA-DepthNet: A Variational Approach to Single Image Depth Prediction

We introduce VA-DepthNet, a simple, effective, and accurate deep neural ...
research
11/29/2021

Instance-wise Occlusion and Depth Orders in Natural Scenes

In this paper, we introduce a new dataset, named InstaOrder, that can be...
research
12/20/2018

SfMLearner++: Learning Monocular Depth & Ego-Motion using Meaningful Geometric Constraints

Most geometric approaches to monocular Visual Odometry (VO) provide robu...
research
04/24/2023

D2NT: A High-Performing Depth-to-Normal Translator

Surface normal holds significant importance in visual environmental perc...
research
05/13/2022

Monocular Human Digitization via Implicit Re-projection Networks

We present an approach to generating 3D human models from images. The ke...

Please sign up or login with your details

Forgot password? Click here to reset