English
Related papers

Related papers: Streaming Algorithms for Support-Aware Histograms

200 papers

In many problems in data mining and machine learning, data items that need to be clustered or classified are not points in a high-dimensional space, but are distributions (points on a high dimensional simplex). For distributions, natural…

Data Structures and Algorithms · Computer Science 2007-07-13 Sudipto Guha , Andrew McGregor , Suresh Venkatasubramanian

We consider the problem of computing distance between a pattern of length $n$ and all $n$-length subwords of a text in the streaming model. In the streaming setting, only the Hamming distance ($L_0$) has been studied. It is known that…

Data Structures and Algorithms · Computer Science 2020-11-10 Tatiana Starikovskaya , Michal Svagerka , Przemysław Uznański

In this paper, we study streaming algorithms that minimize the number of changes made to their internal state (i.e., memory contents). While the design of streaming algorithms typically focuses on minimizing space and update time, these…

Data Structures and Algorithms · Computer Science 2024-06-12 Rajesh Jayaram , David P. Woodruff , Samson Zhou

The problem of estimating the pth moment F_p (p nonnegative and real) in data streams is as follows. There is a vector x which starts at 0, and many updates of the form x_i <-- x_i + v come sequentially in a stream. The algorithm also…

Data Structures and Algorithms · Computer Science 2009-04-09 Daniel M. Kane , Jelani Nelson , David P. Woodruff

Work on approximate linear algebra has led to efficient distributed and streaming algorithms for problems such as approximate matrix multiplication, low rank approximation, and regression, primarily for the Euclidean norm $\ell_2$. We study…

Data Structures and Algorithms · Computer Science 2018-07-10 Graham Cormode , Charlie Dickens , David P. Woodruff

We consider streaming algorithms for approximating a product of input probabilities up to multiplicative error of $1-\epsilon$. It is shown that every randomized streaming algorithm for this problem needs space $\Omega(\log n + \log b -…

Data Structures and Algorithms · Computer Science 2025-10-02 Markus Lohrey , Leon Rische , Louisa Seelbach Benkner , Julio Xochitemol

We study high-dimensional robust statistics tasks in the streaming model. A recent line of work obtained computationally efficient algorithms for a range of high-dimensional robust estimation tasks. Unfortunately, all previous algorithms…

Data Structures and Algorithms · Computer Science 2023-05-04 Ilias Diakonikolas , Daniel M. Kane , Ankit Pensia , Thanasis Pittas

In this thesis, we explore streaming algorithms for approximating constraint satisfaction problems (CSPs). The setup is roughly the following: A computer has limited memory space, sees a long "stream" of local constraints on a set of…

Data Structures and Algorithms · Computer Science 2023-04-14 Noah G. Singer

We introduce a new computational model for data streams: asymptotically exact streaming algorithms. These algorithms have an approximation ratio that tends to one as the length of the stream goes to infinity while the memory used by the…

Data Structures and Algorithms · Computer Science 2014-08-11 Marc Heinrich , Alexander Munteanu , Christian Sohler

We characterize the streaming space complexity of every symmetric norm $l$ (a norm on $\mathbb{R}^n$ invariant under sign-flips and coordinate-permutations), by relating this space complexity to the measure-concentration characteristics of…

Data Structures and Algorithms · Computer Science 2017-06-27 Jaroslaw Blasiok , Vladimir Braverman , Stephen R. Chestnut , Robert Krauthgamer , Lin F. Yang

The Hierarchical Heavy Hitters problem extends the notion of frequent items to data arranged in a hierarchy. This problem has applications to network traffic monitoring, anomaly detection, and DDoS detection. We present a new streaming…

Data Structures and Algorithms · Computer Science 2011-08-10 Michael Mitzenmacher , Thomas Steinke , Justin Thaler

Most known algorithms in the streaming model of computation aim to approximate a single function such as an $\ell_p$-norm. In 2009, Nelson [\url{https://sublinear.info}, Open Problem 30] asked if it possible to design \emph{universal…

Data Structures and Algorithms · Computer Science 2020-04-07 Vladimir Braverman , Robert Krauthgamer , Lin F. Yang

We initiate a broad study of classical problems in the streaming model with insertions and deletions in the setting where we allow the approximation factor $\alpha$ to be much larger than $1$. Such algorithms can use significantly less…

Data Structures and Algorithms · Computer Science 2022-07-19 Yi Li , Honghao Lin , David P. Woodruff , Yuheng Zhang

In this paper, we design the first streaming algorithms for the problem of multitasking scheduling on parallel machines with shared processing. In one pass, our streaming approximation schemes can provide an approximate value of the optimal…

Data Structures and Algorithms · Computer Science 2022-04-06 Bin Fu , Yumei Huo , Hairong Zhao

Tracking and approximating data matrices in streaming fashion is a fundamental challenge. The problem requires more care and attention when data comes from multiple distributed sites, each receiving a stream of data. This paper considers…

Databases · Computer Science 2014-05-01 Mina Ghashami , Jeff M. Phillips , Feifei Li

A technique introduced by Indyk and Woodruff [STOC 2005] has inspired several recent advances in data-stream algorithms. We show that a number of these results follow easily from the application of a single probabilistic method called…

Data Structures and Algorithms · Computer Science 2011-04-26 Alexandr Andoni , Robert Krauthgamer , Krzysztof Onak

We investigate the problem of testing the equivalence between two discrete histograms. A {\em $k$-histogram} over $[n]$ is a probability distribution that is piecewise constant over some set of $k$ intervals over $[n]$. Histograms have been…

Data Structures and Algorithms · Computer Science 2017-03-07 Ilias Diakonikolas , Daniel M. Kane , Vladimir Nikishkin

Estimating the number of subgraphs in data streams is a fundamental problem that has received great attention in the past decade. In this paper, we give improved streaming algorithms for approximately counting the number of occurrences of…

Data Structures and Algorithms · Computer Science 2022-03-29 Hendrik Fichtenberger , Pan Peng

Frequency estimation in data streams is one of the classical problems in streaming algorithms. Following much research, there are now almost matching upper and lower bounds for the trade-off needed between the number of samples and the…

Computational Complexity · Computer Science 2023-01-16 Shachar Lovett , Jiapeng Zhang

In this paper, we develop the first one-pass streaming algorithm for submodular maximization that does not evaluate the entire stream even once. By carefully subsampling each element of data stream, our algorithm enjoys the tightest…

Machine Learning · Computer Science 2018-02-21 Moran Feldman , Amin Karbasi , Ehsan Kazemi
‹ Prev 1 2 3 10 Next ›