The Impact of Different Backbone Architecture on Autonomous Vehicle Dataset

by   Ning Ding, et al.

Object detection is a crucial component of autonomous driving, and many detection applications have been developed to address this task. These applications often rely on backbone architectures, which extract representation features from inputs to perform the object detection task. The quality of the features extracted by the backbone architecture can have a significant impact on the overall detection performance. Many researchers have focused on developing new and improved backbone architectures to enhance the efficiency and accuracy of object detection applications. While these backbone architectures have shown state-of-the-art performance on generic object detection datasets like MS-COCO and PASCAL-VOC, evaluating their performance under an autonomous driving environment has not been previously explored. To address this, our study evaluates three well-known autonomous vehicle datasets, namely KITTI, NuScenes, and BDD, to compare the performance of different backbone architectures on object detection tasks.


page 1

page 2

page 3

page 4


A Versatile Multi-View Framework for LiDAR-based 3D Object Detection with Guidance from Panoptic Segmentation

3D object detection using LiDAR data is an indispensable component for a...

An Empirical Study of Adder Neural Networks for Object Detection

Adder neural networks (AdderNets) have shown impressive performance on i...

Transformation-Equivariant 3D Object Detection for Autonomous Driving

3D object detection received increasing attention in autonomous driving ...

DuEqNet: Dual-Equivariance Network in Outdoor 3D Object Detection for Autonomous Driving

Outdoor 3D object detection has played an essential role in the environm...

Q-YOLOP: Quantization-aware You Only Look Once for Panoptic Driving Perception

In this work, we present an efficient and quantization-aware panoptic dr...

It's All Around You: Range-Guided Cylindrical Network for 3D Object Detection

Modern perception systems in the field of autonomous driving rely on 3D ...

SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization

Convolutional neural networks typically encode an input image into a ser...

Please sign up or login with your details

Forgot password? Click here to reset