English
Related papers

Related papers: A Near-Optimal Algorithm for L1-Difference

200 papers

We study sketching and streaming algorithms for the Longest Common Subsequence problem (LCS) on strings of small alphabet size $|\Sigma|$. For the problem of deciding whether the LCS of strings $x,y$ has length at least $L$, we obtain a…

Data Structures and Algorithms · Computer Science 2018-10-03 Karl Bringmann , Bhaskar Ray Chaudhury

This paper resolves one of the longest standing basic problems in the streaming computational model. Namely, optimal construction of quantile sketches. An $\varepsilon$ approximate quantile sketch receives a stream of items $x_1,\ldots,x_n$…

Data Structures and Algorithms · Computer Science 2016-04-07 Zohar Karnin , Kevin Lang , Edo Liberty

We resolve the space complexity of linear sketches for approximating the maximum matching problem in dynamic graph streams where the stream may include both edge insertion and deletion. Specifically, we show that for any $\epsilon > 0$,…

Data Structures and Algorithms · Computer Science 2015-05-07 Sepehr Assadi , Sanjeev Khanna , Yang Li , Grigory Yaroslavtsev

We initiate a broad study of classical problems in the streaming model with insertions and deletions in the setting where we allow the approximation factor $\alpha$ to be much larger than $1$. Such algorithms can use significantly less…

Data Structures and Algorithms · Computer Science 2022-07-19 Yi Li , Honghao Lin , David P. Woodruff , Yuheng Zhang

We consider statistical as well as algorithmic aspects of solving large-scale least-squares (LS) problems using randomized sketching algorithms. For a LS problem with input data $(X, Y) \in \mathbb{R}^{n \times p} \times \mathbb{R}^n$,…

Machine Learning · Statistics 2015-08-26 Garvesh Raskutti , Michael Mahoney

Maximum coverage and minimum set cover problems --collectively called coverage problems-- have been studied extensively in streaming models. However, previous research not only achieve sub-optimal approximation factors and space…

Data Structures and Algorithms · Computer Science 2017-03-13 Mohammadhossein Bateni , Hossein Esfandiari , Vahab Mirrokni

We consider the problem of computing a $(1+\epsilon)$-approximation of the Hamming distance between a pattern of length $n$ and successive substrings of a stream. We first look at the one-way randomised communication complexity of this…

Data Structures and Algorithms · Computer Science 2016-02-24 Raphael Clifford , Tatiana Starikovskaya

Edit distance is an important measure of string similarity. It counts the number of insertions, deletions and substitutions one has to make to a string $x$ to get a string $y$. In this paper we design an almost linear-size sketching scheme…

Data Structures and Algorithms · Computer Science 2024-06-18 Michal Koucký , Michael Saks

The metric sketching problem is defined as follows. Given a metric on $n$ points, and $\epsilon>0$, we wish to produce a small size data structure (sketch) that, given any pair of point indices, recovers the distance between the points up…

Computational Geometry · Computer Science 2016-11-30 Piotr Indyk , Tal Wagner

We study the problem of solving semidefinite programs (SDP) in the streaming model. Specifically, $m$ constraint matrices and a target matrix $C$, all of size $n\times n$ together with a vector $b\in \mathbb{R}^m$ are streamed to us…

Data Structures and Algorithms · Computer Science 2023-09-12 Zhao Song , Mingquan Ye , Lichen Zhang

The majority of streaming problems are defined and analyzed in a static setting, where the data stream is any worst-case sequence of insertions and deletions that is fixed in advance. However, many real-world applications require a more…

Data Structures and Algorithms · Computer Science 2024-09-25 Elena Gribelyuk , Honghao Lin , David P. Woodruff , Huacheng Yu , Samson Zhou

We present faster algorithms for approximate maximum flow in undirected graphs with good separator structures, such as bounded genus, minor free, and geometric graphs. Given such a graph with $n$ vertices, $m$ edges along with a recursive…

Data Structures and Algorithms · Computer Science 2012-10-19 Gary Miller , Richard Peng

We adapt a well known streaming algorithm for approximating item frequencies to the matrix sketching setting. The algorithm receives the rows of a large matrix $A \in \R^{n \times m}$ one after the other in a streaming fashion. It maintains…

Data Structures and Algorithms · Computer Science 2012-07-12 Edo Liberty

We present the first sublinear memory sketch that can be queried to find the nearest neighbors in a dataset. Our online sketching algorithm compresses an N element dataset to a sketch of size $O(N^b \log^3 N)$ in $O(N^{(b+1)} \log^3 N)$…

Data Structures and Algorithms · Computer Science 2020-09-15 Benjamin Coleman , Richard G. Baraniuk , Anshumali Shrivastava

Estimating ranks, quantiles, and distributions over streaming data is a central task in data analysis and monitoring. Given a stream of $n$ items from a data universe equipped with a total order, the task is to compute a sketch (data…

Data Structures and Algorithms · Computer Science 2023-08-25 Graham Cormode , Zohar Karnin , Edo Liberty , Justin Thaler , Pavel Veselý

We improve upon previous oblivious sketching and turnstile streaming results for $\ell_1$ and logistic regression, giving a much smaller sketching dimension achieving $O(1)$-approximation and yielding an efficient optimization problem in…

Data Structures and Algorithms · Computer Science 2023-04-05 Alexander Munteanu , Simon Omlor , David Woodruff

We develop an algorithm for estimating the values of a vector x in R^n over a support S of size k from a randomized sparse binary linear sketch Ax of size O(k). Given Ax and S, we can recover x' with ||x' - x_S||_2 <= eps ||x - x_S||_2 with…

Data Structures and Algorithms · Computer Science 2010-11-22 Eric Price

Sketching is one of the most fundamental tools in large-scale machine learning. It enables runtime and memory saving via randomly compressing the original large problem into lower dimensions. In this paper, we propose a novel sketching…

Machine Learning · Computer Science 2023-06-08 Zhao Song , Yitan Wang , Zheng Yu , Lichen Zhang

The problem of estimating frequency moments of a data stream has attracted a lot of attention since the onset of streaming algorithms [AMS99]. While the space complexity for approximately computing the $p^{\rm th}$ moment, for $p\in(0,2]$…

Data Structures and Algorithms · Computer Science 2013-06-27 Alexandr Andoni , Huy L. Nguyen , Yury Polyanskiy , Yihong Wu

LP-type problems such as the Minimum Enclosing Ball (MEB), Linear Support Vector Machine (SVM), Linear Programming (LP), and Semidefinite Programming (SDP) are fundamental combinatorial optimization problems, with many important…

Data Structures and Algorithms · Computer Science 2025-07-17 N. Efe Çekirge , William Gay , David P. Woodruff
‹ Prev 1 2 3 10 Next ›