English
Related papers

Related papers: An Improved Sketching Algorithm for Edit Distance

200 papers

We show that in the document exchange problem, where Alice holds $x \in \{0,1\}^n$ and Bob holds $y \in \{0,1\}^n$, Alice can send Bob a message of size $O(K(\log^2 K+\log n))$ bits such that Bob can recover $x$ using the message and his…

Data Structures and Algorithms · Computer Science 2016-07-15 Djamal Belazzougui , Qin Zhang

Edit distance is an important measure of string similarity. It counts the number of insertions, deletions and substitutions one has to make to a string $x$ to get a string $y$. In this paper we design an almost linear-size sketching scheme…

Data Structures and Algorithms · Computer Science 2024-06-18 Michal Koucký , Michael Saks

We present an algorithm for approximating the edit distance $\operatorname{ed}(x, y)$ between two strings $x$ and $y$ in time parameterized by the degree to which one of the strings $x$ satisfies a natural pseudorandomness property. The…

Data Structures and Algorithms · Computer Science 2018-11-13 William Kuszmaul

We consider an efficient two-party protocol for securely computing the similarity of strings w.r.t. an extended edit distance measure. Here, two parties possessing strings $x$ and $y$, respectively, want to jointly compute an approximate…

Cryptography and Security · Computer Science 2019-12-02 Yohei Yoshimoto , Masaharu Kataoka , Yoshimasa Takabatake , Tomohiro I , Kilho Shin , Hiroshi Sakamoto

The edit distance is a way of quantifying how similar two strings are to one another by counting the minimum number of character insertions, deletions, and substitutions required to transform one string into the other. In this paper we…

Data Structures and Algorithms · Computer Science 2016-07-14 Diptarka Chakraborty , Elazar Goldenberg , Michal Koucký

The edit distance of two strings is the minimum number of insertions, deletions, and substitutions of characters needed to transform one string into the other. The textbook dynamic-programming algorithm computes the edit distance of two…

Data Structures and Algorithms · Computer Science 2023-10-25 Alejandro Cassis , Tomasz Kociumaka , Philip Wellnitz

We study the robust communication complexity of maximum matching. Edges of an arbitrary $n$-vertex graph $G$ are randomly partitioned between Alice and Bob independently and uniformly. Alice has to send a single message to Bob such that Bob…

Data Structures and Algorithms · Computer Science 2023-05-03 Amir Azarmehr , Soheil Behnezhad

The shift distance $\mathsf{sh}(S_1,S_2)$ between two strings $S_1$ and $S_2$ of the same length is defined as the minimum Hamming distance between $S_1$ and any rotation (cyclic shift) of $S_2$. We study the problem of sketching the shift…

Data Structures and Algorithms · Computer Science 2020-06-25 Shay Golan , Tomasz Kociumaka , Tsvi Kopelowitz , Ely Porat , Przemysław Uznański

Suppose that we have two parties that possess each a binary string. Suppose that the length of the first string (document) is $n$ and that the two strings (documents) have edit distance (minimal number of deletes, inserts and substitutions…

Data Structures and Algorithms · Computer Science 2015-12-04 Djamal Belazzougui

We present novel randomized approximation schemes for the Edit Distance (ED) problem and the Longest Common Subsequence (LCS) problem that, for any constant $\epsilon>0$, compute a $(1+\epsilon)$-approximation for ED and a…

Data Structures and Algorithms · Computer Science 2026-04-01 Xiao Mao , Aviad Rubinstein

The edit distance $ed(X,Y)$ of two strings $X,Y\in \Sigma^*$ is the minimum number of character edits (insertions, deletions, and substitutions) needed to transform $X$ into $Y$. Its weighted counterpart $ed^w(X,Y)$ minimizes the total cost…

Data Structures and Algorithms · Computer Science 2025-07-04 Itai Boneh , Egor Gorbachev , Tomasz Kociumaka

We study the problem of approximating edit distance in sublinear time. This is formalized as the $(k,k^c)$-Gap Edit Distance problem, where the input is a pair of strings $X,Y$ and parameters $k,c>1$, and the goal is to return YES if…

Data Structures and Algorithms · Computer Science 2022-10-04 Elazar Goldenberg , Tomasz Kociumaka , Robert Krauthgamer , Barna Saha

Given a pair of strings, the problems of computing their Longest Common Subsequence and Edit Distance have been extensively studied for decades. For exact algorithms, LCS and Edit Distance (with character insertions and deletions) are…

Data Structures and Algorithms · Computer Science 2019-04-12 Aviad Rubinstein , Zhao Song

The edit distance (a.k.a. the Levenshtein distance) between two strings is defined as the minimum number of insertions, deletions or substitutions of symbols needed to transform one string into another. The problem of computing the edit…

Computational Complexity · Computer Science 2017-08-17 Arturs Backurs , Piotr Indyk

The edit distance of two strings is the minimum number of insertions, deletions, and substitutions needed to transform one string into the other. The textbook algorithm determines the edit distance of length-$n$ strings in $O(n^2)$ time,…

Data Structures and Algorithms · Computer Science 2025-02-04 Egor Gorbachev , Tomasz Kociumaka

The normalized edit distance is one of the distances derived from the edit distance. It is useful in some applications because it takes into account the lengths of the two strings compared. The normalized edit distance is not defined in…

Neural and Evolutionary Computing · Computer Science 2013-12-09 Muhammad Marwan Muhammad Fuad

In this paper we provide a new locally consistent decomposition of strings. Each string $x$ is decomposed into blocks that can be described by grammars of size $\widetilde{O}(k)$ (using some amount of randomness). If we take two strings $x$…

Data Structures and Algorithms · Computer Science 2023-11-28 Sudatta Bhattacharya , Michal Koucký

The edit distance (ED) and longest common subsequence (LCS) are two fundamental problems which quantify how similar two strings are to one another. In this paper, we consider these problems in the asymmetric streaming model introduced by…

Data Structures and Algorithms · Computer Science 2020-04-17 Alireza Farhadi , MohammadTaghi Hajiaghayi , Aviad Rubinstein , Saeed Seddighin

We present an algorithm for approximating the edit distance between two strings of length $n$ in time $n^{1+\varepsilon}$ up to a constant factor, for any $\varepsilon>0$. Our result completes a research direction set forth in the recent…

Data Structures and Algorithms · Computer Science 2022-07-18 Alexandr Andoni , Negev Shekel Nosatzki

Edit distance is a fundamental measure of distance between strings and has been widely studied in computer science. While the problem of estimating edit distance has been studied extensively, the equally important question of actually…

Data Structures and Algorithms · Computer Science 2018-05-08 Moses Charikar , Ofir Geri , Michael P. Kim , William Kuszmaul
‹ Prev 1 2 3 10 Next ›