DSGN++: Exploiting Visual-Spatial Relation for Stereo-based 3D Detectors

04/06/2022
by   Yilun Chen, et al.
0

Camera-based 3D object detectors are welcome due to their wider deployment and lower price than LiDAR sensors. We revisit the prior stereo modeling DSGN about the stereo volume constructions for representing both 3D geometry and semantics. We polish the stereo modeling and propose our approach, DSGN++, aiming for improving information flow throughout the 2D-to-3D pipeline in the following three main aspects. First, to effectively lift the 2D information to stereo volume, we propose depth-wise plane sweeping (DPS) that allows denser connections and extracts depth-guided features. Second, for better grasping differently spaced features, we present a novel stereo volume – Dual-view Stereo Volume (DSV) that integrates front-view and top-view features and reconstructs sub-voxel depth in the camera frustum. Third, as the foreground region becomes less dominant in 3D space, we firstly propose a multi-modal data editing strategy – Stereo-LiDAR Copy-Paste, which ensures cross-modal alignment and improves data efficiency. Without bells and whistles, extensive experiments in various modality setups on the popular KITTI benchmark show that our method consistently outperforms other camera-based 3D detectors for all categories. Code will be released at https://github.com/chenyilun95/DSGN2.

READ FULL TEXT

page 4

page 5

page 6

page 11

research
12/28/2022

TiG-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning

To achieve accurate and low-cost 3D object detection, existing methods p...
research
01/10/2020

DSGN: Deep Stereo Geometry Network for 3D Object Detection

Most state-of-the-art 3D object detectors heavily rely on LiDAR sensors ...
research
08/18/2021

LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D Detector

Stereo-based 3D detection aims at detecting 3D object bounding boxes fro...
research
04/05/2019

3D LiDAR and Stereo Fusion using Stereo Matching Network with Conditional Cost Volume Normalization

The complementary characteristics of active and passive depth sensing te...
research
05/08/2022

Non-parametric Depth Distribution Modelling based Depth Inference for Multi-view Stereo

Recent cost volume pyramid based deep neural networks have unlocked the ...
research
04/15/2022

MVSTER: Epipolar Transformer for Efficient Multi-View Stereo

Learning-based Multi-View Stereo (MVS) methods warp source images into t...
research
09/19/2022

SOCRATES: A Stereo Camera Trap for Monitoring of Biodiversity

The development and application of modern technology is an essential bas...

Please sign up or login with your details

Forgot password? Click here to reset