Good Artists Copy, Great Artists Steal: Model Extraction Attacks Against Image Translation Generative Adversarial Networks

04/26/2021
by   Sebastian Szyller, et al.
8

Machine learning models are typically made available to potential client users via inference APIs. Model extraction attacks occur when a malicious client uses information gleaned from queries to the inference API of a victim model F_V to build a surrogate model F_A that has comparable functionality. Recent research has shown successful model extraction attacks against image classification, and NLP models. In this paper, we show the first model extraction attack against real-world generative adversarial network (GAN) image translation models. We present a framework for conducting model extraction attacks against image translation models, and show that the adversary can successfully extract functional surrogate models. The adversary is not required to know F_V's architecture or any other information about it beyond its intended image translation task, and queries F_V's inference interface using data drawn from the same domain as the training data for F_V. We evaluate the effectiveness of our attacks using three different instances of two popular categories of image translation: (1) Selfie-to-Anime and (2) Monet-to-Photo (image style transfer), and (3) Super-Resolution (super resolution). Using standard performance metrics for GANs, we show that our attacks are effective in each of the three cases – the differences between F_V and F_A, compared to the target are in the following ranges: Selfie-to-Anime: FID 13.36-68.66, Monet-to-Photo: FID 3.57-4.40, and Super-Resolution: SSIM: 0.06-0.08 and PSNR: 1.43-4.46. Furthermore, we conducted a large scale (125 participants) user study on Selfie-to-Anime and Monet-to-Photo to show that human perception of the images produced by the victim and surrogate models can be considered equivalent, within an equivalence bound of Cohen's d=0.3.

READ FULL TEXT

page 1

page 2

page 6

page 8

research
06/18/2022

Multi-Modality Image Super-Resolution using Generative Adversarial Networks

Over the past few years deep learning-based techniques such as Generativ...
research
03/05/2020

Generating Embroidery Patterns Using Image-to-Image Translation

In many scenarios in computer vision, machine learning, and computer gra...
research
01/06/2021

Model Extraction and Defenses on Generative Adversarial Networks

Model extraction attacks aim to duplicate a machine learning model throu...
research
08/01/2019

Content and Colour Distillation for Learning Image Translations with the Spatial Profile Loss

Generative adversarial networks has emerged as a defacto standard for im...
research
10/06/2021

SDA-GAN: Unsupervised Image Translation Using Spectral Domain Attention-Guided Generative Adversarial Network

This work introduced a novel GAN architecture for unsupervised image tra...
research
05/19/2018

Sparsely Grouped Multi-task Generative Adversarial Networks for Facial Attribute Manipulation

Recently, Image-to-Image Translation (IIT) has made great progress in en...
research
08/09/2023

Data-Free Model Extraction Attacks in the Context of Object Detection

A significant number of machine learning models are vulnerable to model ...

Please sign up or login with your details

Forgot password? Click here to reset