Geometry Uncertainty Projection Network for Monocular 3D Object Detection

07/29/2021
by   Yan Lu, et al.
1

Geometry Projection is a powerful depth estimation method in monocular 3D object detection. It estimates depth dependent on heights, which introduces mathematical priors into the deep model. But projection process also introduces the error amplification problem, in which the error of the estimated height will be amplified and reflected greatly at the output depth. This property leads to uncontrollable depth inferences and also damages the training efficiency. In this paper, we propose a Geometry Uncertainty Projection Network (GUP Net) to tackle the error amplification problem at both inference and training stages. Specifically, a GUP module is proposed to obtains the geometry-guided uncertainty of the inferred depth, which not only provides high reliable confidence for each depth but also benefits depth learning. Furthermore, at the training stage, we propose a Hierarchical Task Learning strategy to reduce the instability caused by error amplification. This learning algorithm monitors the learning situation of each task by a proposed indicator and adaptively assigns the proper loss weights for different tasks according to their pre-tasks situation. Based on that, each task starts learning only when its pre-tasks are learned well, which can significantly improve the stability and efficiency of the training process. Extensive experiments demonstrate the effectiveness of the proposed method. The overall model can infer more reliable object depth than existing methods and outperforms the state-of-the-art image-based monocular 3D detectors by 3.74 pedestrian categories on the KITTI benchmark.

READ FULL TEXT

page 1

page 3

page 8

research
07/29/2021

Learning Geometry-Guided Depth via Projective Modeling for Monocular 3D Object Detection

As a crucial task of autonomous driving, 3D object detection has made gr...
research
05/23/2019

Shift R-CNN: Deep Monocular 3D Object Detection with Closed-Form Geometric Constraints

We propose Shift R-CNN, a hybrid model for monocular 3D object detection...
research
05/19/2022

Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular 3D Object Detection

As an inherently ill-posed problem, depth estimation from single images ...
research
07/28/2021

Aug3D-RPN: Improving Monocular 3D Object Detection by Synthetic Images with Virtual Depth

Current geometry-based monocular 3D object detection models can efficien...
research
07/20/2022

Densely Constrained Depth Estimator for Monocular 3D Object Detection

Estimating accurate 3D locations of objects from monocular images is a c...
research
11/30/2020

Monocular 3D Object Detection with Sequential Feature Association and Depth Hint Augmentation

Monocular 3D object detection is a promising research topic for the inte...
research
06/08/2022

Learning Ego 3D Representation as Ray Tracing

A self-driving perception model aims to extract 3D semantic representati...

Please sign up or login with your details

Forgot password? Click here to reset