English
Related papers

Related papers: Better Bounds for Frequency Moments in Random-Orde…

200 papers

We revisit one of the classic problems in the data stream literature, namely, that of estimating the frequency moments $F_p$ for $0 < p < 2$ of an underlying $n$-dimensional vector presented as a sequence of additive updates in a stream. It…

Data Structures and Algorithms · Computer Science 2018-03-07 Vladimir Braverman , Emanuele Viola , David Woodruff , Lin F. Yang

Frequency estimation in data streams is one of the classical problems in streaming algorithms. Following much research, there are now almost matching upper and lower bounds for the trade-off needed between the number of samples and the…

Computational Complexity · Computer Science 2023-01-16 Shachar Lovett , Jiapeng Zhang

In this paper we consider the problem of approximating frequency moments in the streaming model. Given a stream $D = \{p_1,p_2,\dots,p_m\}$ of numbers from $\{1,\dots, n\}$, a frequency of $i$ is defined as $f_i = |\{j: p_j = i\}|$. The…

Data Structures and Algorithms · Computer Science 2014-01-28 Vladimir Braverman , Jonathan Katzman , Charles Seidell , Gregory Vorsanger

We consider the \textsf{Unit Interval Selection} problem in the one-pass random order streaming model. Here, an algorithm is presented a sequence of $n$ unit-length intervals on the line that arrive in uniform random order, and the…

Data Structures and Algorithms · Computer Science 2026-03-11 Cezar-Mihail Alexandru , Adithya Diddapur , Magnús M. Halldórsson , Christian Konrad , Kheeran K. Naidu

We show an improved lower bound for the Fp estimation problem in a data stream setting for p>2. A data stream is a sequence of items from the domain [n] with possible repetitions. The frequency vector x is an n-dimensional non-negative…

Data Structures and Algorithms · Computer Science 2015-03-19 Sumit Ganguly

We present a novel approach for the problem of frequency estimation in data streams that is based on optimization and machine learning. Contrary to state-of-the-art streaming frequency estimation algorithms, which heavily rely on random…

Data Structures and Algorithms · Computer Science 2022-07-19 Dimitris Bertsimas , Vassilis Digalakis

Mining frequent itemsets through static Databases has been extensively studied and used and is always considered a highly challenging task. For this reason it is interesting to extend it to data streams field. In the streaming case, the…

Databases · Computer Science 2012-06-06 Manel Zarrouk , Med Salah Gouider

Machine learning from data streams is an active and growing research area. Research on learning from streaming data typically makes strict assumptions linked to computational resource constraints, including requirements for stream mining…

Machine Learning · Computer Science 2023-11-01 Indre Zliobaite , Jesse Read

Estimating the first moment of a data stream defined as $F_1 = \sum_{i \in \{1, 2, \ldots, n\}} \abs{f_i}$ to within $1 \pm \epsilon$-relative error with high probability is a basic and influential problem in data stream processing. A tight…

Data Structures and Algorithms · Computer Science 2015-03-17 Sumit Ganguly , Purushottam Kar

We study the classical problem of moment estimation of an underlying vector whose $n$ coordinates are implicitly defined through a series of updates in a data stream. We show that if the updates to the vector arrive in the random-order…

Data Structures and Algorithms · Computer Science 2022-07-08 David P. Woodruff , Samson Zhou

Estimating the second frequency moment of a stream up to $(1\pm\varepsilon)$ multiplicative error requires at most $O(\log n / \varepsilon^2)$ bits of space, due to a seminal result of Alon, Matias, and Szegedy. It is also known that at…

Data Structures and Algorithms · Computer Science 2025-08-06 Mark Braverman , Or Zamir

We study which property testing and sublinear time algorithms can be transformed into graph streaming algorithms for random order streams. Our main result is that for bounded degree graphs, any property that is constant-query testable in…

Data Structures and Algorithms · Computer Science 2017-07-25 Morteza Monemizadeh , S. Muthukrishnan , Pan Peng , Christian Sohler

A central problem in data streams is to characterize which functions of an underlying frequency vector can be approximated efficiently. Recently there has been considerable effort in extending this problem to that of estimating functions of…

Data Structures and Algorithms · Computer Science 2018-10-25 Vladimir Braverman , Stephen R. Chestnut , Robert Krauthgamer , Yi Li , David P. Woodruff , Lin F. Yang

We obtain the best possible upper bounds for the moments of a single order statistic from independent, non-negative random variables, in terms of the population mean. The main result covers the independent identically distributed case.…

Statistics Theory · Mathematics 2018-06-14 Nickos Papadatos

One of the oldest problems in the data stream model is to approximate the $p$-th moment $\|\mathcal{X}\|_p^p = \sum_{i=1}^n |\mathcal{X}_i|^p$ of an underlying vector $\mathcal{X} \in \mathbb{R}^n$, which is presented as a sequence of…

Data Structures and Algorithms · Computer Science 2019-07-15 Rajesh Jayaram , David P. Woodruff

The bounds for absolute moments of order statistics are established. Let $X_1,\dots ,X_n$ be independent identically distributed real-valued random variables and let $X_{1:n}\le \dots \le X_{n:n}$ be the corresponding order statistics. The…

Probability · Mathematics 2016-08-01 Nadezhda V. Gribkova

We introduce a new notion of information complexity for multi-pass streaming problems and use it to resolve several important questions in data streams. In the coin problem, one sees a stream of $n$ i.i.d. uniform bits and one would like to…

Computational Complexity · Computer Science 2024-04-01 Mark Braverman , Sumegha Garg , Qian Li , Shuo Wang , David P. Woodruff , Jiapeng Zhang

Estimating the second frequency moment $F_2$ of a data stream up to a $(1 \pm \varepsilon)$ factor is a central problem in the streaming literature. For errors $\varepsilon > \Omega(1/\sqrt{n})$, the tight bound…

Data Structures and Algorithms · Computer Science 2025-09-10 Naomi Green-Maimon , Or Zamir

We consider the problem of learning over non-stationary ranking streams. The rankings can be interpreted as the preferences of a population and the non-stationarity means that the distribution of preferences changes over time. Our goal is…

Machine Learning · Statistics 2020-10-28 Ekhine Irurozki , Jesus Lobo , Aritz Perez , Javier Del Ser

We use fluid limits to explore the (in)stability properties of wireless networks with queue-based random-access algorithms. Queue-based random-access schemes are simple and inherently distributed in nature, yet provide the capability to…

Networking and Internet Architecture · Computer Science 2013-02-26 Javad Ghaderi , Sem Borst , Phil Whiting
‹ Prev 1 2 3 10 Next ›