Approximate Data Deletion from Machine Learning Models: Algorithms and Evaluations

02/24/2020
by   Zachary Izzo, et al.
0

Deleting data from a trained machine learning (ML) model is a critical task in many applications. For example, we may want to remove the influence of training points that might be out of date or outliers. Regulations such as EU's General Data Protection Regulation also stipulate that individuals can request to have their data deleted. The naive approach to data deletion is to retrain the ML model on the remaining data, but this is too time consuming. Moreover there is no known efficient algorithm that exactly deletes data from most ML models. In this work, we evaluate several approaches for approximate data deletion from trained models. For the case of linear regression, we propose a new method with linear dependence on the feature dimension d, a significant gain over all existing methods which all have superlinear time dependence on the dimension. We also provide a new test for evaluating data deletion from linear models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/11/2019

Making AI Forget You: Data Deletion in Machine Learning

Intense recent discussions have focused on how to provide individuals wi...
research
06/29/2022

Approximate Data Deletion in Generative Models

Users have the right to have their data deleted by third-party learned s...
research
06/08/2021

Adaptive Machine Unlearning

Data deletion algorithms aim to remove the influence of deleted data poi...
research
08/30/2022

On the Trade-Off between Actionable Explanations and the Right to be Forgotten

As machine learning (ML) models are increasingly being deployed in high-...
research
09/25/2022

Algorithms that Approximate Data Removal: New Results and Limitations

We study the problem of deleting user data from machine learning models ...
research
03/04/2021

Remember What You Want to Forget: Algorithms for Machine Unlearning

We study the problem of forgetting datapoints from a learnt model. In th...
research
05/21/2023

Random Relabeling for Efficient Machine Unlearning

Learning algorithms and data are the driving forces for machine learning...

Please sign up or login with your details

Forgot password? Click here to reset