English
Related papers

Related papers: Streaming Algorithms from Precision Sampling

200 papers

The problem of estimating the pth moment F_p (p nonnegative and real) in data streams is as follows. There is a vector x which starts at 0, and many updates of the form x_i <-- x_i + v come sequentially in a stream. The algorithm also…

Data Structures and Algorithms · Computer Science 2009-04-09 Daniel M. Kane , Jelani Nelson , David P. Woodruff

This paper addresses a fundamental problem in random variate generation: given access to a random source that emits a stream of independent fair bits, what is the most accurate and entropy-efficient algorithm for sampling from a discrete…

Data Structures and Algorithms · Computer Science 2020-03-10 Feras A. Saad , Cameron E. Freer , Martin C. Rinard , Vikash K. Mansinghka

We study $\ell_p$ sampling and frequency moment estimation in a single-pass insertion-only data stream. For $p \in (0,2)$, we present a nearly space-optimal approximate $\ell_p$ sampler that uses $\widetilde{O}(\log n \log(1/\delta))$ bits…

Data Structures and Algorithms · Computer Science 2026-04-07 Honghao Lin , Hoai-An Nguyen , William Swartworth , David P. Woodruff

Low-rank approximation in data streams is a fundamental and significant task in computing science, machine learning and statistics. Multiple streaming algorithms have emerged over years and most of them are inspired by randomized…

Data Structures and Algorithms · Computer Science 2022-09-30 Cuiyu Liu , Chuanfu Xiao , Mingshuo Ding , Chao Yang

We study the classical problem of moment estimation of an underlying vector whose $n$ coordinates are implicitly defined through a series of updates in a data stream. We show that if the updates to the vector arrive in the random-order…

Data Structures and Algorithms · Computer Science 2022-07-08 David P. Woodruff , Samson Zhou

Most known algorithms in the streaming model of computation aim to approximate a single function such as an $\ell_p$-norm. In 2009, Nelson [\url{https://sublinear.info}, Open Problem 30] asked if it possible to design \emph{universal…

Data Structures and Algorithms · Computer Science 2020-04-07 Vladimir Braverman , Robert Krauthgamer , Lin F. Yang

We introduce a new computational model for data streams: asymptotically exact streaming algorithms. These algorithms have an approximation ratio that tends to one as the length of the stream goes to infinity while the memory used by the…

Data Structures and Algorithms · Computer Science 2014-08-11 Marc Heinrich , Alexander Munteanu , Christian Sohler

Given data stream $D = \{p_1,p_2,...,p_m\}$ of size $m$ of numbers from $\{1,..., n\}$, the frequency of $i$ is defined as $f_i = |\{j: p_j = i\}|$. The $k$-th \emph{frequency moment} of $D$ is defined as $F_k = \sum_{i=1}^n f_i^k$. We…

Data Structures and Algorithms · Computer Science 2012-12-05 Vladimir Braverman , Rafail Ostrovsky

In insertion-only streaming, one sees a sequence of indices $a_1, a_2, \ldots, a_m\in [n]$. The stream defines a sequence of $m$ frequency vectors $x^{(1)},\ldots,x^{(m)}\in\mathbb{R}^n$ with $(x^{(t)})_i = |\{j : j\in[t], a_j = i\}|$. That…

Data Structures and Algorithms · Computer Science 2017-11-10 Jarosław Błasiok , Jian Ding , Jelani Nelson

Histograms, i.e., piece-wise constant approximations, are a popular tool used to represent data distributions. Traditionally, the difference between the histogram and the underlying distribution (i.e., the approximation error) is measured…

Data Structures and Algorithms · Computer Science 2022-07-19 Justin Y. Chen , Piotr Indyk , Tal Wagner

The probabilistic-stream model was introduced by Jayram et al. \cite{JKV07}. It is a generalization of the data stream model that is suited to handling ``probabilistic'' data where each item of the stream represents a probability…

Data Structures and Algorithms · Computer Science 2007-05-23 Andrew McGregor , S. Muthukrishnan

The manuscript introduces a method to select a random sample from a stream by deciding on each sampling unit immediately after observing it. The process could be applied to unequal as well as equal probability sampling. The implementation…

Data Structures and Algorithms · Computer Science 2021-11-19 Bardia Panahbehagh , Raphaël Jauslin , Yves Tillé

For each $p \in (0,2]$, we present a randomized algorithm that returns an $\epsilon$-approximation of the $p$th frequency moment of a data stream $F_p = \sum_{i = 1}^n \abs{f_i}^p$. The algorithm requires space $O(\epsilon^{-2} \log…

Data Structures and Algorithms · Computer Science 2010-06-21 Sumit Ganguly

Approximate computing aims for efficient execution of workflows where an approximate output is sufficient instead of the exact output. The idea behind approximate computing is to compute over a representative sample instead of the entire…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-09-12 Do Le Quoc , Ruichuan Chen , Pramod Bhatotia , Christof Fetze , Volker Hilt , Thorsten Strufe

We present a novel approach for the problem of frequency estimation in data streams that is based on optimization and machine learning. Contrary to state-of-the-art streaming frequency estimation algorithms, which heavily rely on random…

Data Structures and Algorithms · Computer Science 2022-07-19 Dimitris Bertsimas , Vassilis Digalakis

In this paper we study how to perform distinct sampling in the streaming model where data contain near-duplicates. The goal of distinct sampling is to return a distinct element uniformly at random from the universe of elements, given that…

Data Structures and Algorithms · Computer Science 2018-10-31 Jiecao Chen , Qin Zhang

From a high volume stream of weighted items, we want to maintain a generic sample of a certain limited size $k$ that we can later use to estimate the total weight of arbitrary subsets. This is the classic context of on-line reservoir…

Data Structures and Algorithms · Computer Science 2010-11-16 Edith Cohen , Nick Duffield , Haim Kaplan , Carsten Lund , Mikkel Thorup

A streaming algorithm to compute the spectral proper orthogonal decomposition (SPOD) of stationary random processes is presented. As new data becomes available, an incremental update of the truncated eigenbasis of the estimated…

Fluid Dynamics · Physics 2019-01-14 Oliver T. Schmidt , Aaron Towne

The \emph{$\ell_2$ tracking problem} is the task of obtaining a streaming algorithm that, given access to a stream of items $a_1,a_2,a_3,\ldots$ from a universe $[n]$, outputs at each time $t$ an estimate to the $\ell_2$ norm of the…

Data Structures and Algorithms · Computer Science 2019-09-02 Chi-Ning Chou , Zhixian Lei , Preetum Nakkiran

In this paper, we study streaming algorithms that minimize the number of changes made to their internal state (i.e., memory contents). While the design of streaming algorithms typically focuses on minimizing space and update time, these…

Data Structures and Algorithms · Computer Science 2024-06-12 Rajesh Jayaram , David P. Woodruff , Samson Zhou
‹ Prev 1 2 3 10 Next ›