Active Data Acquisition in Autonomous Driving Simulation

06/24/2023
by   Jianyu Lai, et al.
0

Autonomous driving algorithms rely heavily on learning-based models, which require large datasets for training. However, there is often a large amount of redundant information in these datasets, while collecting and processing these datasets can be time-consuming and expensive. To address this issue, this paper proposes the concept of an active data-collecting strategy. For high-quality data, increasing the collection density can improve the overall quality of the dataset, ultimately achieving similar or even better results than the original dataset with lower labeling costs and smaller dataset sizes. In this paper, we design experiments to verify the quality of the collected dataset and to demonstrate this strategy can significantly reduce labeling costs and dataset size while improving the overall quality of the dataset, leading to better performance of autonomous driving systems. The source code implementing the proposed approach is publicly available on https://github.com/Th1nkMore/carla_dataset_tools.

READ FULL TEXT
research
04/01/2021

Perspective, Survey and Trends: Public Driving Datasets and Toolsets for Autonomous Driving Virtual Test

Owing to the merits of early safety and reliability guarantee, autonomou...
research
04/24/2023

Synthetic Datasets for Autonomous Driving: A Survey

Autonomous driving techniques have been flourishing in recent years whil...
research
07/04/2022

How Much More Data Do I Need? Estimating Requirements for Downstream Tasks

Given a small training data set and a learning algorithm, how much more ...
research
09/06/2022

SIND: A Drone Dataset at Signalized Intersection in China

Intersection is one of the most challenging scenarios for autonomous dri...
research
02/15/2022

Sim-to-Real Domain Adaptation for Lane Detection and Classification in Autonomous Driving

While supervised detection and classification frameworks in autonomous d...
research
05/29/2023

Synfeal: A Data-Driven Simulator for End-to-End Camera Localization

Collecting real-world data is often considered the bottleneck of Artific...
research
04/27/2020

The Gutenberg Dialogue Dataset

Large datasets are essential for many NLP tasks. Current publicly availa...

Please sign up or login with your details

Forgot password? Click here to reset