Efficient Unbiased Sparsification

Leighton Barnes; Stephen Cameron; Timothy Chow; Emma Cohen; Keith Frankston; Benjamin Howard; Fred Kochman; Daniel Scheinerman; Jeffrey VanderKam

Efficient Unbiased Sparsification

Information Theory 2024-07-25 v2 Machine Learning math.IT Statistics Theory Statistics Theory

Authors: Leighton Barnes , Stephen Cameron , Timothy Chow , Emma Cohen , Keith Frankston , Benjamin Howard , Fred Kochman , Daniel Scheinerman , Jeffrey VanderKam

View on arXiv ↗ PDF ↗

Abstract

An unbiased $m$ -sparsification of a vector $p\in \mathbb{R}^n$ is a random vector $Q\in \mathbb{R}^n$ with mean $p$ that has at most $m<n$ nonzero coordinates. Unbiased sparsification compresses the original vector without introducing bias; it arises in various contexts, such as in federated learning and sampling sparse probability distributions. Ideally, unbiased sparsification should also minimize the expected value of a divergence function $\mathsf{Div}(Q,p)$ that measures how far away $Q$ is from the original $p$ . If $Q$ is optimal in this sense, then we call it efficient. Our main results describe efficient unbiased sparsifications for divergences that are either permutation-invariant or additively separable. Surprisingly, the characterization for permutation-invariant divergences is robust to the choice of divergence function, in the sense that our class of optimal $Q$ for squared Euclidean distance coincides with our class of optimal $Q$ for Kullback-Leibler divergence, or indeed any of a wide variety of divergences.

Keywords

gaussian estimation sparse optimization randomized algorithm

Cite

@article{arxiv.2402.14925,
  title  = {Efficient Unbiased Sparsification},
  author = {Leighton Barnes and Stephen Cameron and Timothy Chow and Emma Cohen and Keith Frankston and Benjamin Howard and Fred Kochman and Daniel Scheinerman and Jeffrey VanderKam},
  journal= {arXiv preprint arXiv:2402.14925},
  year   = {2024}
}

Efficient Unbiased Sparsification

Abstract

Keywords

Cite

Related papers