An Effective and Differentially Private Protocol for Secure Distributed Cardinality Estimation

02/04/2023
by   Pinghui Wang, et al.
0

Counting the number of distinct elements distributed over multiple data holders is a fundamental problem with many real-world applications ranging from crowd counting to network monitoring. Although a number of space and computational efficient sketch methods (e.g., the Flajolet-Martin sketch and the HyperLogLog sketch) for cardinality estimation have been proposed to solve the above problem, these sketch methods are insecure when considering privacy concerns related to the use of each data holder's personal dataset. Despite a recently proposed protocol that successfully implements the well-known Flajolet-Martin (FM) sketch on a secret-sharing based multiparty computation (MPC) framework for solving the problem of private distributed cardinality estimation (PDCE), we observe that this MPC-FM protocol is not differentially private. In addition, the MPC-FM protocol is computationally expensive, which limits its applications to data holders with limited computation resources. To address the above issues, in this paper we propose a novel protocol DP-DICE, which is computationally efficient and differentially private for solving the problem of PDCE. Experimental results show that our DP-DICE achieves orders of magnitude speedup and reduces the estimation error by several times in comparison with state-of-the-arts under the same security requirements.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2021

Differentially Private Frequency Moments Estimation with Polylogarithmic Space

We prove that 𝔽_p sketch, a well-celebrated streaming algorithm for freq...
research
02/04/2023

Sketch-Flip-Merge: Mergeable Sketches for Private Distinct Counting

Data sketching is a critical tool for distinct counting, enabling multis...
research
01/31/2020

Efficient Differentially Private F_0 Linear Sketching

A powerful feature of linear sketches is that from sketches of two data ...
research
02/10/2023

Fast Gumbel-Max Sketch and its Applications

The well-known Gumbel-Max Trick for sampling elements from a categorical...
research
10/11/2021

Differentially Private Approximate Quantiles

In this work we study the problem of differentially private (DP) quantil...
research
07/26/2021

Selective MPC: Distributed Computation of Differentially Private Key Value Statistics

An increasingly popular method for computing aggregate statistics while ...
research
06/11/2022

Sampling-based Estimation of the Number of Distinct Values in Distributed Environment

In data mining, estimating the number of distinct values (NDV) is a fund...

Please sign up or login with your details

Forgot password? Click here to reset