Between Pure and Approximate Differential Privacy
January 24, 2015 ยท Declared Dead ยท ๐ Journal of Privacy and Confidentiality
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Thomas Steinke, Jonathan Ullman
arXiv ID
1501.06095
Category
cs.DS: Data Structures & Algorithms
Cross-listed
cs.CR,
cs.LG
Citations
171
Venue
Journal of Privacy and Confidentiality
Last Checked
1 month ago
Abstract
We show a new lower bound on the sample complexity of $(\varepsilon, ฮด)$-differentially private algorithms that accurately answer statistical queries on high-dimensional databases. The novelty of our bound is that it depends optimally on the parameter $ฮด$, which loosely corresponds to the probability that the algorithm fails to be private, and is the first to smoothly interpolate between approximate differential privacy ($ฮด> 0$) and pure differential privacy ($ฮด= 0$). Specifically, we consider a database $D \in \{\pm1\}^{n \times d}$ and its \emph{one-way marginals}, which are the $d$ queries of the form "What fraction of individual records have the $i$-th bit set to $+1$?" We show that in order to answer all of these queries to within error $\pm ฮฑ$ (on average) while satisfying $(\varepsilon, ฮด)$-differential privacy, it is necessary that $$ n \geq ฮฉ\left( \frac{\sqrt{d \log(1/ฮด)}}{ฮฑ\varepsilon} \right), $$ which is optimal up to constant factors. To prove our lower bound, we build on the connection between \emph{fingerprinting codes} and lower bounds in differential privacy (Bun, Ullman, and Vadhan, STOC'14). In addition to our lower bound, we give new purely and approximately differentially private algorithms for answering arbitrary statistical queries that improve on the sample complexity of the standard Laplace and Gaussian mechanisms for achieving worst-case accuracy guarantees by a logarithmic factor.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Data Structures & Algorithms
R.I.P.
๐ป
Ghosted
R.I.P.
๐ป
Ghosted
Relief-Based Feature Selection: Introduction and Review
R.I.P.
๐ป
Ghosted
Route Planning in Transportation Networks
R.I.P.
๐ป
Ghosted
Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration
R.I.P.
๐ป
Ghosted
Hierarchical Clustering: Objective Functions and Algorithms
R.I.P.
๐ป
Ghosted
Graph Isomorphism in Quasipolynomial Time
Died the same way โ ๐ป Ghosted
R.I.P.
๐ป
Ghosted
Language Models are Few-Shot Learners
R.I.P.
๐ป
Ghosted
PyTorch: An Imperative Style, High-Performance Deep Learning Library
R.I.P.
๐ป
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
๐ป
Ghosted