Overlook: Differentially Private Exploratory Visualization for Big Data

06/22/2020
by   Pratiksha Thaker, et al.
0

Data exploration systems that provide differential privacy must manage a privacy budget that measures the amount of privacy lost across multiple queries. One effective strategy to manage the privacy budget is to compute a one-time private synopsis of the data, to which users can make an unlimited number of queries. However, existing systems using synopses are built for offline use cases, where a set of queries is known ahead of time and the system carefully optimizes a synopsis for it. The synopses that these systems build are costly to compute and may also be costly to store. We introduce Overlook, a system that enables private data exploration at interactive latencies for both data analysts and data curators. The key idea in Overlook is a virtual synopsis that can be evaluated incrementally, without extra space storage or expensive precomputation. Overlook simply executes queries using an existing engine, such as a SQL DBMS, and adds noise to their results. Because Overlook's synopses do not require costly precomputation or storage, data curators can also use Overlook to explore the impact of privacy parameters interactively. Overlook offers a rich visual query interface based on the open source Hillview system. Overlook achieves accuracy comparable to existing synopsis-based systems, while offering better performance and removing the need for extra storage.

READ FULL TEXT

page 10

page 11

research
10/03/2018

Shrinkwrap: Differentially-Private Query Processing in Private Data Federations

A private data federation is a set of autonomous databases that share a ...
research
12/19/2022

Multi-Analyst Differential Privacy for Online Query Answering

Most differentially private mechanisms are designed for the use of a sin...
research
06/23/2021

Mr. Plotter: Unifying Data Reduction Techniques in Storage and Visualization Systems

As the rate of data collection continues to grow rapidly, developing vis...
research
09/19/2023

DProvDB: Differentially Private Query Processing with Multi-Analyst Provenance

Recent years have witnessed the adoption of differential privacy (DP) in...
research
09/04/2019

Differentially Private SQL with Bounded User Contribution

Differential privacy (DP) provides formal guarantees that the output of ...
research
11/25/2022

M^2M: A general method to perform various data analysis tasks from a differentially private sketch

Differential privacy is the standard privacy definition for performing a...
research
11/30/2022

Differentially Private Enhanced Permissioned Blockchain for Private Data Sharing in Industrial IoT

The integration of permissioned blockchain such as Hyperledger fabric (H...

Please sign up or login with your details

Forgot password? Click here to reset