Single-Stage Monocular 3D Object Detection with Virtual Cameras

12/17/2019
by   Andrea Simonelli, et al.
16

While expensive LiDAR and stereo camera rigs have enabled the development of successful 3D object detection methods, monocular RGB-only approaches still lag significantly behind. Our work advances the state of the art by introducing MoVi-3D, a novel, single-stage deep architecture for monocular 3D object detection. At its core, MoVi-3D leverages geometrical information to generate synthetic views from virtual cameras at both, training and test time, resulting in normalized object appearance with respect to distance. Our synthetically generated views facilitate the detection task as they cut down the variability in visual appearance associated to objects placed at different distances from the camera. As a consequence, the deep model is relieved from learning depth-specific representations and its complexity can be significantly reduced. In particular we show that our proposed concept of exploiting virtual cameras enables us to set new state-of-the-art results on the popular KITTI3D benchmark using just a lightweight, single-stage architecture.

READ FULL TEXT

page 1

page 4

page 8

page 9

page 10

research
04/19/2022

Shape-Aware Monocular 3D Object Detection

The detection of 3D objects through a single perspective camera is a cha...
research
03/08/2020

Monocular 3D Object Detection in Cylindrical Images from Fisheye Cameras

Detecting objects in 3D from a monocular camera has been successfully de...
research
02/06/2021

Gated3D: Monocular 3D Object Detection From Temporal Illumination Cues

Today's state-of-the-art methods for 3D object detection are based on li...
research
11/02/2021

Absolute distance prediction based on deep learning object detection and monocular depth estimation models

Determining the distance between the objects in a scene and the camera s...
research
08/22/2022

STS: Surround-view Temporal Stereo for Multi-view 3D Detection

Learning accurate depth is essential to multi-view 3D object detection. ...
research
07/28/2017

The WILDTRACK Multi-Camera Person Dataset

People detection methods are highly sensitive to the perpetual occlusion...
research
05/29/2023

Monocular 2D Camera-based Proximity Monitoring for Human-Machine Collision Warning on Construction Sites

Accident of struck-by machines is one of the leading causes of casualtie...

Please sign up or login with your details

Forgot password? Click here to reset