Optimal Data Acquisition with Privacy-Aware Agents

by   Rachel Cummings, et al.

We study the problem faced by a data analyst or platform that wishes to collect private data from privacy-aware agents. To incentivize participation, in exchange for this data, the platform provides a service to the agents in the form of a statistic computed using all agents' submitted data. The agents decide whether to join the platform (and truthfully reveal their data) or not participate by considering both the privacy costs of joining and the benefit they get from obtaining the statistic. The platform must ensure the statistic is computed differentially privately and chooses a central level of noise to add to the computation, but can also induce personalized privacy levels (or costs) by giving different weights to different agents in the computation as a function of their heterogeneous privacy preferences (which are known to the platform). We assume the platform aims to optimize the accuracy of the statistic, and must pick the privacy level of each agent to trade-off between i) incentivizing more participation and ii) adding less noise to the estimate. We provide a semi-closed form characterization of the optimal choice of agent weights for the platform in two variants of our model. In both of these models, we identify a common nontrivial structure in the platform's optimal solution: an instance-specific number of agents with the least stringent privacy requirements are pooled together and given the same weight, while the weights of the remaining agents decrease as a function of the strength of their privacy requirement. We also provide algorithmic results on how to find the optimal value of the noise parameter used by the platform and of the weights given to the agents.


page 1

page 2

page 3

page 4


Differentially Private LQ Control

As multi-agent systems proliferate and share more and more user data, ne...

Optimal and Differentially Private Data Acquisition: Central and Local Mechanisms

We consider a platform's problem of collecting data from privacy sensiti...

Data Curation from Privacy-Aware Agents

A data curator would like to collect data from privacy-aware agents. The...

The Fair Value of Data Under Heterogeneous Privacy Constraints

Modern data aggregation often takes the form of a platform collecting da...

Optimal Data Acquisition for Statistical Estimation

We consider a data analyst's problem of purchasing data from strategic a...

ASCII: ASsisted Classification with Ignorance Interchange

The rapid development in data collecting devices and computation platfor...

The Fast and the Private: Task-based Dataset Search

Modern dataset search platforms employ ML task-based utility metrics ins...

Please sign up or login with your details

Forgot password? Click here to reset