PV-SSD: A Projection and Voxel-based Double Branch Single-Stage 3D Object Detector

08/13/2023
by   Yongxin Shao, et al.
0

LIDAR-based 3D object detection and classification is crucial for autonomous driving. However, inference in real-time from extremely sparse 3D data poses a formidable challenge. To address this issue, a common approach is to project point clouds onto a bird's-eye or perspective view, effectively converting them into an image-like data format. However, this excessive compression of point cloud data often leads to the loss of information. This paper proposes a 3D object detector based on voxel and projection double branch feature extraction (PV-SSD) to address the problem of information loss. We add voxel features input containing rich local semantic information, which is fully fused with the projected features in the feature extraction stage to reduce the local information loss caused by projection. A good performance is achieved compared to the previous work. In addition, this paper makes the following contributions: 1) a voxel feature extraction method with variable receptive fields is proposed; 2) a feature point sampling method by weight sampling is used to filter out the feature points that are more conducive to the detection task; 3) the MSSFA module is proposed based on the SSFA module. To verify the effectiveness of our method, we designed comparison experiments.

READ FULL TEXT

page 2

page 3

page 7

page 13

page 14

research
06/09/2020

Stereo RGB and Deeper LIDAR Based Network for 3D Object Detection

3D object detection has become an emerging task in autonomous driving sc...
research
03/10/2022

Point Density-Aware Voxels for LiDAR 3D Object Detection

LiDAR has become one of the primary 3D object detection sensors in auton...
research
11/17/2017

VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection

Accurate detection of objects in 3D point clouds is a central problem in...
research
02/29/2020

HVNet: Hybrid Voxel Network for LiDAR Based 3D Object Detection

We present Hybrid Voxel Network (HVNet), a novel one-stage unified netwo...
research
09/01/2018

VoxSegNet: Volumetric CNNs for Semantic Part Segmentation of 3D Shapes

Voxel is an important format to represent geometric data, which has been...
research
08/31/2023

MS23D: A 3D Object Detection Method Using Multi-Scale Semantic Feature Points to Construct 3D Feature Layers

Lidar point clouds, as a type of data with accurate distance perception,...
research
08/13/2021

CNN-based Two-Stage Parking Slot Detection Using Region-Specific Multi-Scale Feature Extraction

Autonomous parking systems start with the detection of available parking...

Please sign up or login with your details

Forgot password? Click here to reset