English

Locally Differentially Private Sparse Vector Aggregation

Cryptography and Security 2022-03-01 v4

Abstract

Vector mean estimation is a central primitive in federated analytics. In vector mean estimation, each user i[n]i \in [n] holds a real-valued vector vi[1,1]dv_i\in [-1, 1]^d, and a server wants to estimate the mean of all nn vectors. Not only so, we would like to protect each individual user's privacy. In this paper, we consider the kk-sparse version of the vector mean estimation problem, that is, suppose that each user's vector has at most kk non-zero coordinates in its dd-dimensional vector, and moreover, kdk \ll d. In practice, since the universe size dd can be very large (e.g., the space of all possible URLs), we would like the per-user communication to be succinct, i.e., independent of or (poly-)logarithmic in the universe size. In this paper, we are the first to show matching upper- and lower-bounds for the kk-sparse vector mean estimation problem under local differential privacy. Specifically, we construct new mechanisms that achieve asymptotically optimal error as well as succinct communication, either under user-level-LDP or event-level-LDP. We implement our algorithms and evaluate them on synthetic as well as real-world datasets. Our experiments show that we can often achieve one or two orders of magnitude reduction in error in comparison with prior works under typical choices of parameters, while incurring insignificant communication cost.

Keywords

Cite

@article{arxiv.2112.03449,
  title  = {Locally Differentially Private Sparse Vector Aggregation},
  author = {Mingxun Zhou and Tianhao Wang and T-H. Hubert Chan and Giulia Fanti and Elaine Shi},
  journal= {arXiv preprint arXiv:2112.03449},
  year   = {2022}
}
R2 v1 2026-06-24T08:06:57.533Z