English
Related papers

Related papers: Optimal Quantile Approximation in Streams

200 papers

Estimating ranks, quantiles, and distributions over streaming data is a central task in data analysis and monitoring. Given a stream of $n$ items from a data universe equipped with a total order, the task is to compute a sketch (data…

Data Structures and Algorithms · Computer Science 2023-08-25 Graham Cormode , Zohar Karnin , Edo Liberty , Justin Thaler , Pavel Veselý

Approximating quantiles and distributions over streaming data has been studied for roughly two decades now. Recently, Karnin, Lang, and Liberty proposed the first asymptotically optimal algorithm for doing so. This manuscript complements…

Data Structures and Algorithms · Computer Science 2019-07-02 Nikita Ivkin , Edo Liberty , Kevin Lang , Zohar Karnin , Vladimir Braverman

Computing the approximate quantiles or ranks of a stream is a fundamental task in data monitoring. Given a stream of elements $x_1, x_2, \dots, x_n$ and a query $x$, a relative-error quantile estimation algorithm can estimate the rank of…

Data Structures and Algorithms · Computer Science 2024-11-05 Elena Gribelyuk , Pachara Sawettamalya , Hongxun Wu , Huacheng Yu

We resolve the space complexity of linear sketches for approximating the maximum matching problem in dynamic graph streams where the stream may include both edge insertion and deletion. Specifically, we show that for any $\epsilon > 0$,…

Data Structures and Algorithms · Computer Science 2015-05-07 Sepehr Assadi , Sanjeev Khanna , Yang Li , Grigory Yaroslavtsev

An $\varepsilon$-approximate quantile sketch over a stream of $n$ inputs approximates the rank of any query point $q$ - that is, the number of input points less than $q$ - up to an additive error of $\varepsilon n$, generally with some…

Data Structures and Algorithms · Computer Science 2023-04-18 Nicholas Schiefer , Justin Y. Chen , Piotr Indyk , Shyam Narayanan , Sandeep Silwal , Tal Wagner

Estimating quantiles is one of the foundational problems of data sketching. Given $n$ elements $x_1, x_2, \dots, x_n$ from some universe of size $U$ arriving in a data stream, a quantile sketch estimates the rank of any element with…

Data Structures and Algorithms · Computer Science 2024-04-08 Meghal Gupta , Mihir Singhal , Hongxun Wu

Frequency estimation is one of the most fundamental problems in streaming algorithms. Given a stream $S$ of elements from some universe $U=\{1 \ldots n\}$, the goal is to compute, in a single pass, a short sketch of $S$ so that for any…

Data Structures and Algorithms · Computer Science 2021-11-09 Piotr Indyk , Shyam Narayanan , David P. Woodruff

Maximum coverage and minimum set cover problems --collectively called coverage problems-- have been studied extensively in streaming models. However, previous research not only achieve sub-optimal approximation factors and space…

Data Structures and Algorithms · Computer Science 2017-03-13 Mohammadhossein Bateni , Hossein Esfandiari , Vahab Mirrokni

We initiate a broad study of classical problems in the streaming model with insertions and deletions in the setting where we allow the approximation factor $\alpha$ to be much larger than $1$. Such algorithms can use significantly less…

Data Structures and Algorithms · Computer Science 2022-07-19 Yi Li , Honghao Lin , David P. Woodruff , Yuheng Zhang

We consider the problem of finding a minimum cut of a weighted graph presented as a single-pass stream. While graph sparsification in streams has been intensively studied, the specific application of finding minimum cuts in streams is less…

Data Structures and Algorithms · Computer Science 2024-12-09 Matthew Ding , Alexandro Garces , Jason Li , Honghao Lin , Jelani Nelson , Vihan Shah , David P. Woodruff

Sketching and streaming algorithms are in the forefront of current research directions for cut problems in graphs. In the streaming model, we show that $(1-\epsilon)$-approximation for Max-Cut must use $n^{1-O(\epsilon)}$ space; moreover,…

Data Structures and Algorithms · Computer Science 2026-02-23 Dmitry Kogan , Robert Krauthgamer

The distinct elements problem is one of the fundamental problems in streaming algorithms --- given a stream of integers in the range $\{1,\ldots,n\}$, we wish to provide a $(1+\varepsilon)$ approximation to the number of distinct elements…

Data Structures and Algorithms · Computer Science 2019-01-07 Jarosław Błasiok

We develop a new algorithmic technique that allows to transfer some constant time approximation algorithms for general graphs into random order streaming algorithms. We illustrate our technique by proving that in random order streams with…

Data Structures and Algorithms · Computer Science 2017-11-15 Pan Peng , Christian Sohler

We consider the problem of estimating the value of max cut in a graph in the streaming model of computation. At one extreme, there is a trivial $2$-approximation for this problem that uses only $O(\log n)$ space, namely, count the number of…

Data Structures and Algorithms · Computer Science 2014-09-09 Michael Kapralov , Sanjeev Khanna , Madhu Sudan

Space-efficient streaming estimation of quantiles in massive datasets is a fundamental problem with numerous applications in data monitoring and analysis. While theoretical research led to optimal algorithms, such as the Greenwald-Khanna…

Data Structures and Algorithms · Computer Science 2025-09-12 Aleksander Łukasiewicz , Jakub Tětek , Pavel Veselý

Quantiles, such as the median or percentiles, provide concise and useful information about the distribution of a collection of items, drawn from a totally ordered universe. We study data structures, called quantile summaries, which keep…

Data Structures and Algorithms · Computer Science 2026-01-27 Graham Cormode , Pavel Veselý

We give the first L_1-sketching algorithm for integer vectors which produces nearly optimal sized sketches in nearly linear time. This answers the first open problem in the list of open problems from the 2006 IITK Workshop on Algorithms for…

Data Structures and Algorithms · Computer Science 2009-04-15 Jelani Nelson , David P. Woodruff

Approximate Nearest Neighbor (ANN) search and Approximate Kernel Density Estimation (A-KDE) are fundamental problems at the core of modern machine learning, with broad applications in data analysis, information systems, and large-scale…

Machine Learning · Computer Science 2025-10-28 Ved Danait , Srijan Das , Sujoy Bhore

The majority of streaming problems are defined and analyzed in a static setting, where the data stream is any worst-case sequence of insertions and deletions that is fixed in advance. However, many real-world applications require a more…

Data Structures and Algorithms · Computer Science 2024-09-25 Elena Gribelyuk , Honghao Lin , David P. Woodruff , Huacheng Yu , Samson Zhou

Stream monitoring is fundamental in many data stream applications, such as financial data trackers, security, anomaly detection, and load balancing. In that respect, quantiles are of particular interest, as they often capture the user's…

Data Structures and Algorithms · Computer Science 2022-01-07 Rana Shahout , Roy Friedman , Ran Ben Basat
‹ Prev 1 2 3 10 Next ›