Privacy Adhering Machine Un-learning in NLP

Regulations introduced by General Data Protection Regulation (GDPR) in the EU or California Consumer Privacy Act (CCPA) in the US have included provisions on the right to be forgotten that mandates industry applications to remove data related to an individual from their systems. In several real world industry applications that use Machine Learning to build models on user data, such mandates require significant effort both in terms of data cleansing as well as model retraining while ensuring the models do not deteriorate in prediction quality due to removal of data. As a result, continuous removal of data and model retraining steps do not scale if these applications receive such requests at a very high frequency. Recently, a few researchers proposed the idea of Machine Unlearning to tackle this challenge. Despite the significant importance of this task, the area of Machine Unlearning is under-explored in Natural Language Processing (NLP) tasks. In this paper, we explore the Unlearning framework on various GLUE tasks <cit.>, such as, QQP, SST and MNLI. We propose computationally efficient approaches (SISA-FC and SISA-A) to perform guaranteed Unlearning that provides significant reduction in terms of both memory (90-95%), time (100x) and space consumption (99%) in comparison to the baselines while keeping model performance constant.

READ FULL TEXT
research
10/17/2022

Pseudo-OOD training for robust language models

While pre-trained large-scale deep models have garnered attention as an ...
research
11/09/2020

Low-Resource Adaptation of Neural NLP Models

Real-world applications of natural language processing (NLP) are challen...
research
09/10/2021

How May I Help You? Using Neural Text Simplification to Improve Downstream NLP Tasks

The general goal of text simplification (TS) is to reduce text complexit...
research
04/23/2020

Towards an evolutionary-based approach for natural language processing

Tasks related to Natural Language Processing (NLP) have recently been th...
research
06/04/2021

How Good Is NLP? A Sober Look at NLP Tasks through the Lens of Social Impact

Recent years have seen many breakthroughs in natural language processing...
research
10/04/2022

Certified Data Removal in Sum-Product Networks

Data protection regulations like the GDPR or the California Consumer Pri...
research
02/20/2023

Towards Unbounded Machine Unlearning

Deep machine unlearning is the problem of removing the influence of a co...

Please sign up or login with your details

Forgot password? Click here to reset