English
Related papers

Related papers: Faster Sublinear-Time Edit Distance

200 papers

We revisit the task of computing the edit distance in sublinear time. In the $(k,K)$-gap edit distance problem the task is to distinguish whether the edit distance of two strings is at most $k$ or at least $K$. It has been established by…

Data Structures and Algorithms · Computer Science 2023-03-17 Karl Bringmann , Alejandro Cassis , Nick Fischer , Vasileios Nakos

We study the problem of approximating the edit distance of two strings in sublinear time, in a setting where one or both string(s) are preprocessed, as initiated by Goldenberg, Rubinstein, Saha (STOC '20). Specifically, in the $(k, K)$-gap…

Data Structures and Algorithms · Computer Science 2022-10-13 Karl Bringmann , Alejandro Cassis , Nick Fischer , Vasileios Nakos

In this paper, we design new sublinear-time algorithms for solving the gap edit distance problem and for embedding edit distance to Hamming distance. For the gap edit distance problem, we give an $\tilde{O}(\frac{n}{k}+k^2)$-time greedy…

Data Structures and Algorithms · Computer Science 2020-11-17 Tomasz Kociumaka , Barna Saha

We study the problem of estimating the edit distance between two $n$-character strings. While exact computation in the worst case is believed to require near-quadratic time, previous work showed that in certain regimes it is possible to…

Data Structures and Algorithms · Computer Science 2020-07-29 Joshua Brakensiek , Moses Charikar , Aviad Rubinstein

The edit distance is a way of quantifying how similar two strings are to one another by counting the minimum number of character insertions, deletions, and substitutions required to transform one string into the other. A simple dynamic…

Computational Complexity · Computer Science 2019-10-03 Elazar Goldenberg , Robert Krauthgamer , Barna Saha

We study the problem of approximating edit distance in sublinear time. This is formalized as the $(k,k^c)$-Gap Edit Distance problem, where the input is a pair of strings $X,Y$ and parameters $k,c>1$, and the goal is to return YES if…

Data Structures and Algorithms · Computer Science 2022-10-04 Elazar Goldenberg , Tomasz Kociumaka , Robert Krauthgamer , Barna Saha

We show how to compute the edit distance between two strings of length n up to a factor of 2^{\~O(sqrt(log n))} in n^(1+o(1)) time. This is the first sub-polynomial approximation algorithm for this problem that runs in near-linear time,…

Data Structures and Algorithms · Computer Science 2011-09-27 Alexandr Andoni , Krzysztof Onak

We present an algorithm for approximating the edit distance between two strings of length $n$ in time $n^{1+\varepsilon}$ up to a constant factor, for any $\varepsilon>0$. Our result completes a research direction set forth in the recent…

Data Structures and Algorithms · Computer Science 2022-07-18 Alexandr Andoni , Negev Shekel Nosatzki

We present a near-linear time algorithm that approximates the edit distance between two strings within a polylogarithmic factor; specifically, for strings of length n and every fixed epsilon>0, it can compute a (log n)^O(1/epsilon)…

Data Structures and Algorithms · Computer Science 2010-05-24 Alexandr Andoni , Robert Krauthgamer , Krzysztof Onak

The problem of approximate string matching is important in many different areas such as computational biology, text processing and pattern recognition. A great effort has been made to design efficient algorithms addressing several variants…

Data Structures and Algorithms · Computer Science 2008-07-29 Dimitris Papamichail , Georgios Papamichail

The edit distance is a fundamental measure of sequence similarity, defined as the minimum number of character insertions, deletions, and substitutions needed to transform one string into the other. Given two strings of length at most $n$,…

Data Structures and Algorithms · Computer Science 2023-07-17 Tomasz Kociumaka , Anish Mukherjee , Barna Saha

We consider the approximate pattern matching problem under the edit distance. Given a text $T$ of length $n$, a pattern $P$ of length $m$, and a threshold $k$, the task is to find the starting positions of all substrings of $T$ that can be…

Data Structures and Algorithms · Computer Science 2022-04-08 Panagiotis Charalampopoulos , Tomasz Kociumaka , Philip Wellnitz

Edit distance is a measure of similarity of two strings based on the minimum number of character insertions, deletions, and substitutions required to transform one string into the other. The edit distance can be computed exactly using a…

Data Structures and Algorithms · Computer Science 2021-02-17 Diptarka Chakraborty , Debarati Das , Elazar Goldenberg , Michal Koucky , Michael Saks

We revisit a fundamental problem in string matching: given a pattern of length m and a text of length n, both over an alphabet of size $\sigma$, compute the Hamming distance between the pattern and the text at every location. Several…

Data Structures and Algorithms · Computer Science 2020-01-03 Timothy M. Chan , Shay Golan , Tomasz Kociumaka , Tsvi Kopelowitz , Ely Porat

The edit distance is a way of quantifying how similar two strings are to one another by counting the minimum number of character insertions, deletions, and substitutions required to transform one string into the other. In this paper we…

Data Structures and Algorithms · Computer Science 2016-07-14 Diptarka Chakraborty , Elazar Goldenberg , Michal Koucký

Given a context free language $\mathcal{L(G)}$ over alphabet $\Sigma$ and a string $s \in \Sigma^*$, {\em the language edit distance} problem seeks the minimum number of edits (insertions, deletions and substitutions) required to convert…

Data Structures and Algorithms · Computer Science 2024-10-25 Tomasz Kociumaka , Barna Saha

Computing the edit distance of two strings is one of the most basic problems in computer science and combinatorial optimization. Tree edit distance is a natural generalization of edit distance in which the task is to compute a measure of…

Data Structures and Algorithms · Computer Science 2022-09-16 Debarati Das , Jacob Gilbert , MohammadTaghi Hajiaghayi , Tomasz Kociumaka , Barna Saha , Hamed Saleh

The edit distance of two strings is the minimum number of insertions, deletions, and substitutions of characters needed to transform one string into the other. The textbook dynamic-programming algorithm computes the edit distance of two…

Data Structures and Algorithms · Computer Science 2023-10-25 Alejandro Cassis , Tomasz Kociumaka , Philip Wellnitz

The edit distance of two strings is the minimum number of insertions, deletions, and substitutions needed to transform one string into the other. The textbook algorithm determines the edit distance of length-$n$ strings in $O(n^2)$ time,…

Data Structures and Algorithms · Computer Science 2025-02-04 Egor Gorbachev , Tomasz Kociumaka

Edit distance is a measurement of similarity between two sequences such as strings, point sequences, or polygonal curves. Many matching problems from a variety of areas, such as signal analysis, bioinformatics, etc., need to be solved in a…

Computational Geometry · Computer Science 2020-09-10 Kyle Fox , Xinyi Li
‹ Prev 1 2 3 10 Next ›