English

Streaming Algorithms from Precision Sampling

Data Structures and Algorithms 2011-04-26 v2 Computational Geometry

Abstract

A technique introduced by Indyk and Woodruff [STOC 2005] has inspired several recent advances in data-stream algorithms. We show that a number of these results follow easily from the application of a single probabilistic method called Precision Sampling. Using this method, we obtain simple data-stream algorithms that maintain a randomized sketch of an input vector x=(x1,...xn)x=(x_1,...x_n), which is useful for the following applications. 1) Estimating the FkF_k-moment of xx, for k>2k>2. 2) Estimating the p\ell_p-norm of xx, for p[1,2]p\in[1,2], with small update time. 3) Estimating cascaded norms p(q)\ell_p(\ell_q) for all p,q>0p,q>0. 4) 1\ell_1 sampling, where the goal is to produce an element ii with probability (approximately) xi/x1|x_i|/\|x\|_1. It extends to similarly defined p\ell_p-sampling, for p[1,2]p\in [1,2]. For all these applications the algorithm is essentially the same: scale the vector x entry-wise by a well-chosen random vector, and run a heavy-hitter estimation algorithm on the resulting vector. Our sketch is a linear function of x, thereby allowing general updates to the vector x. Precision Sampling itself addresses the problem of estimating a sum i=1nai\sum_{i=1}^n a_i from weak estimates of each real ai[0,1]a_i\in[0,1]. More precisely, the estimator first chooses a desired precision ui(0,1]u_i\in(0,1] for each i[n]i\in[n], and then it receives an estimate of every aia_i within additive uiu_i. Its goal is to provide a good approximation to ai\sum a_i while keeping a tab on the "approximation cost" i(1/ui)\sum_i (1/u_i). Here we refine previous work [Andoni, Krauthgamer, and Onak, FOCS 2010] which shows that as long as ai=Ω(1)\sum a_i=\Omega(1), a good multiplicative approximation can be achieved using total precision of only O(nlogn)O(n\log n).

Keywords

Cite

@article{arxiv.1011.1263,
  title  = {Streaming Algorithms from Precision Sampling},
  author = {Alexandr Andoni and Robert Krauthgamer and Krzysztof Onak},
  journal= {arXiv preprint arXiv:1011.1263},
  year   = {2011}
}
R2 v1 2026-06-21T16:39:16.206Z