English
Related papers

Related papers: Faster Pattern Matching under Edit Distance

200 papers

In the $k$-Edit Circular Pattern Matching ($k$-Edit CPM) problem, we are given a length-$n$ text $T$, a length-$m$ pattern $P$, and a positive integer threshold $k$, and we are to report all starting positions of the substrings of $T$ that…

Data Structures and Algorithms · Computer Science 2024-02-23 Panagiotis Charalampopoulos , Solon P. Pissis , Jakub Radoszewski , Wojciech Rytter , Tomasz Waleń , Wiktor Zuba

Approximate Pattern Matching is among the most fundamental string-processing tasks. Given a text $T$ of length $n$, a pattern $P$ of length $m$, and a threshold $k$, the task is to identify the fragments of $T$ that are at distance at most…

Data Structures and Algorithms · Computer Science 2024-10-10 Tomasz Kociumaka , Jakob Nogler , Philip Wellnitz

In this work, we revisit the fundamental and well-studied problem of approximate pattern matching under edit distance. Given an integer $k$, a pattern $P$ of length $m$, and a text $T$ of length $n \ge m$, the task is to find substrings of…

Data Structures and Algorithms · Computer Science 2021-06-14 Tomasz Kociumaka , Ely Porat , Tatiana Starikovskaya

The decades-old Pattern Matching with Edits problem, given a length-$n$ string $T$ (the text), a length-$m$ string $P$ (the pattern), and a positive integer $k$ (the threshold), asks to list all fragments of $T$ that are at edit distance at…

Data Structures and Algorithms · Computer Science 2024-10-10 Tomasz Kociumaka , Jakob Nogler , Philip Wellnitz

We study the classic Text-to-Pattern Hamming Distances problem: given a pattern $P$ of length $m$ and a text $T$ of length $n$, both over a polynomial-size alphabet, compute the Hamming distance between $P$ and $T[i\, .\, . \, i+m-1]$ for…

Data Structures and Algorithms · Computer Science 2024-12-20 Timothy M. Chan , Ce Jin , Virginia Vassilevska Williams , Yinzhan Xu

We study the classical approximate string matching problem, that is, given strings $P$ and $Q$ and an error threshold $k$, find all ending positions of substrings of $Q$ whose edit distance to $P$ is at most $k$. Let $P$ and $Q$ have…

Data Structures and Algorithms · Computer Science 2011-03-21 Philip Bille

We consider the approximate pattern matching problem under edit distance. In this problem we are given a pattern $P$ of length $w$ and a text $T$ of length $n$ over some alphabet $\Sigma$, and a positive integer $k$. The goal is to find all…

Data Structures and Algorithms · Computer Science 2018-11-06 Diptarka Chakraborty , Debarati Das , Michal Koucky

The problem of approximate string matching is important in many different areas such as computational biology, text processing and pattern recognition. A great effort has been made to design efficient algorithms addressing several variants…

Data Structures and Algorithms · Computer Science 2008-07-29 Dimitris Papamichail , Georgios Papamichail

We study the fundamental problem of approximating the edit distance of two strings. After an extensive line of research led to the development of a constant-factor approximation algorithm in almost-linear time, recent years have witnessed a…

Data Structures and Algorithms · Computer Science 2023-12-05 Karl Bringmann , Alejandro Cassis , Nick Fischer , Tomasz Kociumaka

In the decades-old Pattern Matching with Edits problem, given a length-$n$ string $T$ (the text), a length-$m$ string $P$ (the pattern), and a positive integer $k$ (the threshold), the task is to list the $k$-error occurrences of $P$ in…

Data Structures and Algorithms · Computer Science 2026-04-20 Tomasz Kociumaka , Jakob Nogler , Philip Wellnitz

We revisit a fundamental problem in string matching: given a pattern of length m and a text of length n, both over an alphabet of size $\sigma$, compute the Hamming distance between the pattern and the text at every location. Several…

Data Structures and Algorithms · Computer Science 2020-01-03 Timothy M. Chan , Shay Golan , Tomasz Kociumaka , Tsvi Kopelowitz , Ely Porat

We consider approximate circular pattern matching (CPM, in short) under the Hamming and edit distance, in which we are given a length-$n$ text $T$, a length-$m$ pattern $P$, and a threshold $k>0$, and we are to report all starting positions…

Data Structures and Algorithms · Computer Science 2025-06-13 Panagiotis Charalampopoulos , Tomasz Kociumaka , Jakub Radoszewski , Solon P. Pissis , Wojciech Rytter , Tomasz Waleń , Wiktor Zuba

The $k$-mismatch problem consists in computing the Hamming distance between a pattern $P$ of length $m$ and every length-$m$ substring of a text $T$ of length $n$, if this distance is no more than $k$. In many real-world applications, any…

Approximate pattern matching is a natural and well-studied problem on strings: Given a text $T$, a pattern $P$, and a threshold $k$, find (the starting positions of) all substrings of $T$ that are at distance at most $k$ from $P$. We…

Data Structures and Algorithms · Computer Science 2020-11-17 Panagiotis Charalampopoulos , Tomasz Kociumaka , Philip Wellnitz

The text-to-pattern Hamming distances problem asks to compute the Hamming distances between a given pattern of length $m$ and all length-$m$ substrings of a given text of length $n\ge m$. We focus on the $k$-mismatch version of the problem,…

Data Structures and Algorithms · Computer Science 2022-03-30 Raphaël Clifford , Paweł Gawrychowski , Tomasz Kociumaka , Daniel P. Martin , Przemysław Uznański

Given a pattern of length $m$ and a text of length $n$, the goal in $k$-mismatch pattern matching is to compute, for every $m$-substring of the text, the exact Hamming distance to the pattern or report that it exceeds $k$. This can be…

Data Structures and Algorithms · Computer Science 2017-04-06 Paweł Gawrychowski , Przemysław Uznański

The classic exact pattern matching problem, given two strings -- a pattern $P$ of length $m$ and a text $T$ of length $n$ -- asks whether $P$ occurs as a substring of $T$. A property tester for the problem needs to distinguish (with high…

Data Structures and Algorithms · Computer Science 2025-10-21 Ce Jin , Tomasz Kociumaka

Given a text $T$ of length $n$ and a pattern $P$ of length $m$, the string matching problem is a task to find all occurrences of $P$ in $T$. In this study, we propose an algorithm that solves this problem in $O((n + m)q)$ time considering…

Data Structures and Algorithms · Computer Science 2020-04-14 Satoshi Kobayashi , Diptarama Hendrian , Ryo Yoshinaka , Ayumi Shinohara

In this paper, we design new sublinear-time algorithms for solving the gap edit distance problem and for embedding edit distance to Hamming distance. For the gap edit distance problem, we give an $\tilde{O}(\frac{n}{k}+k^2)$-time greedy…

Data Structures and Algorithms · Computer Science 2020-11-17 Tomasz Kociumaka , Barna Saha

In this work, we address the problem of approximate pattern matching with wildcards. Given a pattern $P$ of length $m$ containing $D$ wildcards, a text $T$ of length $n$, and an integer $k$, our objective is to identify all fragments of $T$…

Data Structures and Algorithms · Computer Science 2024-05-22 Gabriel Bathie , Panagiotis Charalampopoulos , Tatiana Starikovskaya
‹ Prev 1 2 3 10 Next ›