Collecting and Analyzing Multidimensional Data with Local Differential Privacy

06/28/2019
by   Ning Wang, et al.
0

Local differential privacy (LDP) is a recently proposed privacy standard for collecting and analyzing data, which has been used, e.g., in the Chrome browser, iOS and macOS. In LDP, each user perturbs her information locally, and only sends the randomized version to an aggregator who performs analyses, which protects both the users and the aggregator against private information leaks. Although LDP has attracted much research attention in recent years, the majority of existing work focuses on applying LDP to complex data and/or analysis tasks. In this paper, we point out that the fundamental problem of collecting multidimensional data under LDP has not been addressed sufficiently, and there remains much room for improvement even for basic tasks such as computing the mean value over a single numeric attribute under LDP. Motivated by this, we first propose novel LDP mechanisms for collecting a numeric attribute, whose accuracy is at least no worse (and usually better) than existing solutions in terms of worst-case noise variance. Then, we extend these mechanisms to multidimensional data that can contain both numeric and categorical attributes, where our mechanisms always outperform existing solutions regarding worst-case noise variance. As a case study, we apply our solutions to build an LDP-compliant stochastic gradient descent algorithm (SGD), which powers many important machine learning tasks. Experiments using real datasets confirm the effectiveness of our methods, and their advantages over existing solutions.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 7

page 8

page 9

page 12

research
11/20/2018

Locally Private Gaussian Estimation

We study a basic private estimation problem: each of n users draws a sin...
research
09/04/2022

On the Risks of Collecting Multidimensional Data Under Local Differential Privacy

The private collection of multiple statistics from a population is a fun...
research
10/11/2020

A Comprehensive Survey on Local Differential Privacy Toward Data Statistics and Analysis in Crowdsensing

Collecting and analyzing massive data generated from smart devices have ...
research
04/25/2023

(Local) Differential Privacy has NO Disparate Impact on Fairness

In recent years, Local Differential Privacy (LDP), a robust privacy-pres...
research
11/08/2021

Improving the Utility of Locally Differentially Private Protocols for Longitudinal and Multidimensional Frequency Estimates

This paper investigates the problem of collecting multidimensional data ...
research
09/15/2021

Random Sampling Plus Fake Data: Multidimensional Frequency Estimates With Local Differential Privacy

With local differential privacy (LDP), users can privatize their data an...
research
10/15/2018

Assessing and Remedying Coverage for a Given Dataset

Data analysis impacts virtually every aspect of our society today. Often...

Please sign up or login with your details

Forgot password? Click here to reset