Don't Look at the Data! How Differential Privacy Reconfigures the Practices of Data Science

02/23/2023
by   Jayshree Sarathy, et al.
0

Across academia, government, and industry, data stewards are facing increasing pressure to make datasets more openly accessible for researchers while also protecting the privacy of data subjects. Differential privacy (DP) is one promising way to offer privacy along with open access, but further inquiry is needed into the tensions between DP and data science. In this study, we conduct interviews with 19 data practitioners who are non-experts in DP as they use a DP data analysis prototype to release privacy-preserving statistics about sensitive data, in order to understand perceptions, challenges, and opportunities around using DP. We find that while DP is promising for providing wider access to sensitive datasets, it also introduces challenges into every stage of the data science workflow. We identify ethics and governance questions that arise when socializing data scientists around new privacy constraints and offer suggestions to better integrate DP and data science.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/04/2020

The Limits of Differential Privacy (and its Misuse in Data Release and Machine Learning)

Differential privacy (DP) is a neat privacy definition that can co-exist...
research
12/03/2021

Differential Privacy in Privacy-Preserving Big Data and Learning: Challenge and Opportunity

Differential privacy (DP) has become the de facto standard of privacy pr...
research
09/15/2021

A Systematic Literature Review on Wearable Health Data Publishing under Differential Privacy

Wearable devices generate different types of physiological data about th...
research
09/09/2022

Impacts of Census Differential Privacy for Small-Area Disease Mapping to Monitor Health Inequities

US Census Bureau (USCB) has implemented a new privacy-preserving disclos...
research
02/17/2021

Differential Privacy for Government Agencies – Are We There Yet?

Government agencies always need to carefully consider potential risks of...
research
10/11/2021

Privacy preserving local analysis of digital trace data: A proof-of-concept

We present PORT, a software platform for local data extraction and analy...
research
07/07/2023

Random Number Generators and Seeding for Differential Privacy

Differential Privacy (DP) relies on random numbers to preserve privacy, ...

Please sign up or login with your details

Forgot password? Click here to reset