Augmentation Backdoors

09/29/2022
by   Joseph Rance, et al.
0

Data augmentation is used extensively to improve model generalisation. However, reliance on external libraries to implement augmentation methods introduces a vulnerability into the machine learning pipeline. It is well known that backdoors can be inserted into machine learning models through serving a modified dataset to train on. Augmentation therefore presents a perfect opportunity to perform this modification without requiring an initially backdoored dataset. In this paper we present three backdoor attacks that can be covertly inserted into data augmentation. Our attacks each insert a backdoor using a different type of computer vision augmentation transform, covering simple image transforms, GAN-based augmentation, and composition-based augmentation. By inserting the backdoor using these augmentation transforms, we make our backdoors difficult to detect, while still supporting arbitrary backdoor functionality. We evaluate our attacks on a range of computer vision benchmarks and demonstrate that an attacker is able to introduce backdoors through just a malicious augmentation routine.

READ FULL TEXT

page 4

page 6

research
11/27/2019

PanDA: Panoptic Data Augmentation

The recently proposed panoptic segmentation task presents a significant ...
research
10/28/2020

Evaluating data augmentation for financial time series classification

Data augmentation methods in combination with deep neural networks have ...
research
11/28/2022

Exoplanet Detection by Machine Learning with Data Augmentation

It has recently been demonstrated that deep learning has significant pot...
research
02/10/2021

Auctus: A Dataset Search Engine for Data Augmentation

Machine Learning models are increasingly being adopted in many applicati...
research
11/19/2020

Differentiable Data Augmentation with Kornia

In this paper we present a review of the Kornia differentiable data augm...
research
08/11/2017

Augmentor: An Image Augmentation Library for Machine Learning

The generation of artificial data based on existing observations, known ...
research
05/14/2020

Data Augmentation for Deep Candlestick Learner

To successfully build a deep learning model, it will need a large amount...

Please sign up or login with your details

Forgot password? Click here to reset