Deflating Dataset Bias Using Synthetic Data Augmentation

by   Nikita Jaipuria, et al.

Deep Learning has seen an unprecedented increase in vision applications since the publication of large-scale object recognition datasets and introduction of scalable compute hardware. State-of-the-art methods for most vision tasks for Autonomous Vehicles (AVs) rely on supervised learning and often fail to generalize to domain shifts and/or outliers. Dataset diversity is thus key to successful real-world deployment. No matter how big the size of the dataset, capturing long tails of the distribution pertaining to task-specific environmental factors is impractical. The goal of this paper is to investigate the use of targeted synthetic data augmentation - combining the benefits of gaming engine simulations and sim2real style transfer techniques - for filling gaps in real datasets for vision tasks. Empirical studies on three different computer vision tasks of practical use to AVs - parking slot detection, lane detection and monocular depth estimation - consistently show that having synthetic data in the training mix provides a significant boost in cross-dataset generalization performance as compared to training on real data only, for the same size of the training set.


page 4

page 5

page 6

page 13

page 14

page 15

page 16

page 17


Image Data Augmentation for Deep Learning: A Survey

Deep learning has achieved remarkable results in many computer vision ta...

Unity Perception: Generate Synthetic Data for Computer Vision

We introduce the Unity Perception package which aims to simplify and acc...

How to augment your ViTs? Consistency loss and StyleAug, a random style transfer augmentation

The Vision Transformer (ViT) architecture has recently achieved competit...

CutDepth:Edge-aware Data Augmentation in Depth Estimation

It is difficult to collect data on a large scale in a monocular depth es...

Style Transfer as Data Augmentation: A Case Study on Named Entity Recognition

In this work, we take the named entity recognition task in the English l...

Learning Good Features to Transfer Across Tasks and Domains

Availability of labelled data is the major obstacle to the deployment of...

Procedural Modeling and Physically Based Rendering for Synthetic Data Generation in Automotive Applications

We present an overview and evaluation of a new, systematic approach for ...

Please sign up or login with your details

Forgot password? Click here to reset