Task-aware Privacy Preservation for Multi-dimensional Data

10/05/2021
by   Jiangnan Cheng, et al.
0

Local differential privacy (LDP), a state-of-the-art technique for privacy preservation, has been successfully deployed in a few real-world applications. In the future, LDP can be adopted to anonymize richer user data attributes that will be input to more sophisticated machine learning (ML) tasks. However, today's LDP approaches are largely task-agnostic and often lead to sub-optimal performance – they will simply inject noise to all data attributes according to a given privacy budget, regardless of what features are most relevant for an ultimate task. In this paper, we address how to significantly improve the ultimate task performance for multi-dimensional user data by considering a task-aware privacy preservation problem. The key idea is to use an encoder-decoder framework to learn (and anonymize) a task-relevant latent representation of user data, which gives an analytical near-optimal solution for a linear setting with mean-squared error (MSE) task loss. We also provide an approximate solution through a learning algorithm for general nonlinear cases. Extensive experiments demonstrate that our task-aware approach significantly improves ultimate task accuracy compared to a standard benchmark LDP approach while guaranteeing the same level of privacy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/25/2023

(Local) Differential Privacy has NO Disparate Impact on Fairness

In recent years, Local Differential Privacy (LDP), a robust privacy-pres...
research
04/15/2021

Privacy-Adaptive BERT for Natural Language Understanding

When trying to apply the recent advance of Natural Language Understandin...
research
10/23/2020

Learning to Noise: Application-Agnostic Data Sharing with Local Differential Privacy

In recent years, the collection and sharing of individuals' private data...
research
06/05/2019

Locally Differentially Private Data Collection and Analysis

Local differential privacy (LDP) can provide each user with strong priva...
research
12/26/2022

Packing Privacy Budget Efficiently

Machine learning (ML) models can leak information about users, and diffe...
research
10/31/2022

kt-Safety: Graph Release via k-Anonymity and t-Closeness (Technical Report)

In a wide spectrum of real-world applications, it is very important to a...
research
12/03/2022

Castell: Scalable Joint Probability Estimation of Multi-dimensional Data Randomized with Local Differential Privacy

Performing randomized response (RR) over multi-dimensional data is subje...

Please sign up or login with your details

Forgot password? Click here to reset