English
Related papers

Related papers: Streaming $k$-edit approximate pattern matching vi…

200 papers

In this work, we revisit the fundamental and well-studied problem of approximate pattern matching under edit distance. Given an integer $k$, a pattern $P$ of length $m$, and a text $T$ of length $n \ge m$, the task is to find substrings of…

Data Structures and Algorithms · Computer Science 2021-06-14 Tomasz Kociumaka , Ely Porat , Tatiana Starikovskaya

We present a new streaming algorithm for the $k$-Mismatch problem, one of the most basic problems in pattern matching. Given a pattern and a text, the task is to find all substrings of the text that are at the Hamming distance at most $k$…

Data Structures and Algorithms · Computer Science 2019-04-24 Jakub Radoszewski , Tatiana Starikovskaya

The edit distance is a way of quantifying how similar two strings are to one another by counting the minimum number of character insertions, deletions, and substitutions required to transform one string into the other. In this paper we…

Data Structures and Algorithms · Computer Science 2016-07-14 Diptarka Chakraborty , Elazar Goldenberg , Michal Koucký

We consider the streaming complexity of a fundamental task in approximate pattern matching: the $k$-mismatch problem. It asks to compute Hamming distances between a pattern of length $n$ and all length-$n$ substrings of a text for which the…

Data Structures and Algorithms · Computer Science 2018-04-10 Raphaël Clifford , Tomasz Kociumaka , Ely Porat

In the $k$-mismatch problem we are given a pattern of length $n$ and a text and must find all locations where the Hamming distance between the pattern and the text is at most $k$. A series of recent breakthroughs have resulted in an…

Data Structures and Algorithms · Computer Science 2021-06-22 Paweł Gawrychowski , Tatiana Starikovskaya

We consider the approximate pattern matching problem under edit distance. In this problem we are given a pattern $P$ of length $w$ and a text $T$ of length $n$ over some alphabet $\Sigma$, and a positive integer $k$. The goal is to find all…

Data Structures and Algorithms · Computer Science 2018-11-06 Diptarka Chakraborty , Debarati Das , Michal Koucky

We study the classic NP-Hard problem of finding the maximum $k$-set coverage in the data stream model: given a set system of $m$ sets that are subsets of a universe $\{1,\ldots,n \}$, find the $k$ sets that cover the most number of distinct…

Data Structures and Algorithms · Computer Science 2018-05-11 Andrew McGregor , Hoa T. Vu

We revisit the $k$-mismatch problem in the streaming model on a pattern of length $m$ and a streaming text of length $n$, both over a size-$\sigma$ alphabet. The current state-of-the-art algorithm for the streaming $k$-mismatch problem, by…

Data Structures and Algorithms · Computer Science 2020-04-28 Shay Golan , Tomasz Kociumaka , Tsvi Kopelowitz , Ely Porat

We present streaming algorithms for the graph $k$-matching problem in both the insert-only and dynamic models. Our algorithms, with space complexity matching the best upper bounds, have optimal or near-optimal update time, significantly…

Data Structures and Algorithms · Computer Science 2023-10-18 Jianer Chen , Qin Huang , Iyad Kanj , Qian Li , Ge Xia

We revisit the complexity of one of the most basic problems in pattern matching. In the k-mismatch problem we must compute the Hamming distance between a pattern of length m and every m-length substring of a text of length n, as long as…

Data Structures and Algorithms · Computer Science 2015-08-28 Raphaël Clifford , Allyx Fontaine , Ely Porat , Benjamin Sach , Tatiana Starikovskaya

We investigate the problem of deterministic pattern matching in multiple streams. In this model, one symbol arrives at a time and is associated with one of s streaming texts. The task at each time step is to report if there is a new match…

Data Structures and Algorithms · Computer Science 2012-04-26 Raphael Clifford , Markus Jalsenius , Ely Porat , Benjamin Sach

We consider the approximate pattern matching problem under the edit distance. Given a text $T$ of length $n$, a pattern $P$ of length $m$, and a threshold $k$, the task is to find the starting positions of all substrings of $T$ that can be…

Data Structures and Algorithms · Computer Science 2022-04-08 Panagiotis Charalampopoulos , Tomasz Kociumaka , Philip Wellnitz

Approximate pattern matching is a natural and well-studied problem on strings: Given a text $T$, a pattern $P$, and a threshold $k$, find (the starting positions of) all substrings of $T$ that are at distance at most $k$ from $P$. We…

Data Structures and Algorithms · Computer Science 2020-11-17 Panagiotis Charalampopoulos , Tomasz Kociumaka , Philip Wellnitz

In this paper we present improved bounds for approximating maximum matchings in bipartite graphs in the streaming model. First, we consider the question of how well maximum matching can be approximated in a single pass over the input using…

Data Structures and Algorithms · Computer Science 2021-03-18 Michael Kapralov

We show that both clustering and subspace embeddings can be performed in the streaming model with the same asymptotic efficiency as in the central/offline setting. For $(k, z)$-clustering in the streaming model, we achieve a number of words…

Data Structures and Algorithms · Computer Science 2025-04-24 Vincent Cohen-Addad , Liudeng Wang , David P. Woodruff , Samson Zhou

The equidistant subsequence pattern matching problem is considered. Given a pattern string $P$ and a text string $T$, we say that $P$ is an \emph{equidistant subsequence} of $T$ if $P$ is a subsequence of the text such that consecutive…

Data Structures and Algorithms · Computer Science 2020-02-18 Mitsuru Funakoshi , Yuto Nakashima , Shunsuke Inenaga , Hideo Bannai , Masayuki Takeda , Ayumi Shinohara

We consider the problem of computing distance between a pattern of length $n$ and all $n$-length subwords of a text in the streaming model. In the streaming setting, only the Hamming distance ($L_0$) has been studied. It is known that…

Data Structures and Algorithms · Computer Science 2020-11-10 Tatiana Starikovskaya , Michal Svagerka , Przemysław Uznański

We consider the problem of estimating the value of max cut in a graph in the streaming model of computation. At one extreme, there is a trivial $2$-approximation for this problem that uses only $O(\log n)$ space, namely, count the number of…

Data Structures and Algorithms · Computer Science 2014-09-09 Michael Kapralov , Sanjeev Khanna , Madhu Sudan

Analyzing patterns in data streams generated by network traffic, sensor networks, or satellite feeds is a challenge for systems in which the available storage is limited. In addition, real data is noisy, which makes designing data stream…

Data Structures and Algorithms · Computer Science 2017-11-15 Elena Grigorescu , Erfan Sadeqi Azer , Samson Zhou

We study the $k$-connectivity augmentation problem ($k$-CAP) in the single-pass streaming model. Given a $(k-1)$-edge connected graph $G=(V,E)$ that is stored in memory, and a stream of weighted edges $L$ with weights in $\{0,1,\dots,W\}$,…

Data Structures and Algorithms · Computer Science 2024-02-19 Ce Jin , Michael Kapralov , Sepideh Mahabadi , Ali Vakilian
‹ Prev 1 2 3 10 Next ›