BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object Detection

03/31/2022
by   Junjie Huang, et al.
0

Single frame data contains finite information which limits the performance of the existing vision-based multi-camera 3D object detection paradigms. For fundamentally pushing the performance boundary in this area, BEVDet4D is proposed to lift the scalable BEVDet paradigm from the spatial-only 3D space to the spatial-temporal 4D space. We upgrade the framework with a few modifications just for fusing the feature from the previous frame with the corresponding one in the current frame. In this way, with negligible extra computing budget, we enable the algorithm to access the temporal cues by querying and comparing the two candidate features. Beyond this, we also simplify the velocity learning task by removing the factors of ego-motion and time, which equips BEVDet4D with robust generalization performance and reduces the velocity error by 52.8 time, become comparable with those relied on LiDAR or radar in this aspect. On challenge benchmark nuScenes, we report a new record of 51.5 high-performance configuration dubbed BEVDet4D-Base, which surpasses the previous leading method BEVDet by +4.3

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/24/2020

An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds

Detecting objects in 3D LiDAR data is a core technology for autonomous d...
research
07/01/2023

Spatial-Temporal Enhanced Transformer Towards Multi-Frame 3D Object Detection

The Detection Transformer (DETR) has revolutionized the design of CNN-ba...
research
09/14/2020

3D Object Detection and Tracking Based on Streaming Data

Recent approaches for 3D object detection have made tremendous progresse...
research
11/27/2020

Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection in Autonomous Driving

The strong demand of autonomous driving in the industry has lead to stro...
research
01/18/2022

STURE: Spatial-Temporal Mutual Representation Learning for Robust Data Association in Online Multi-Object Tracking

Online multi-object tracking (MOT) is a longstanding task for computer v...
research
08/18/2022

Ret3D: Rethinking Object Relations for Efficient 3D Object Detection in Driving Scenes

Current efficient LiDAR-based detection frameworks are lacking in exploi...

Please sign up or login with your details

Forgot password? Click here to reset