English

Differentially Private Quantiles

Machine Learning 2021-09-21 v3 Cryptography and Security

Abstract

Quantiles are often used for summarizing and understanding data. If that data is sensitive, it may be necessary to compute quantiles in a way that is differentially private, providing theoretical guarantees that the result does not reveal private information. However, when multiple quantiles are needed, existing differentially private algorithms fare poorly: they either compute quantiles individually, splitting the privacy budget, or summarize the entire distribution, wasting effort. In either case the result is reduced accuracy. In this work we propose an instance of the exponential mechanism that simultaneously estimates exactly mm quantiles from nn data points while guaranteeing differential privacy. The utility function is carefully structured to allow for an efficient implementation that returns estimates of all mm quantiles in time O(mnlog(n)+m2n)O(mn\log(n) + m^2n). Experiments show that our method significantly outperforms the current state of the art on both real and synthetic data while remaining efficient enough to be practical.

Keywords

Cite

@article{arxiv.2102.08244,
  title  = {Differentially Private Quantiles},
  author = {Jennifer Gillenwater and Matthew Joseph and Alex Kulesza},
  journal= {arXiv preprint arXiv:2102.08244},
  year   = {2021}
}

Comments

This version corrects pseudocode typos and adds a note about perturbing data