English
Related papers

Related papers: Efficient Computation of Sequence Mappability

200 papers

In the k-mappability problem, we are given a string x of length n and integers m and k, and we are asked to count, for each length-m factor y of x, the number of other factors of length m of x that are at Hamming distance at most k from y.…

Data Structures and Algorithms · Computer Science 2017-05-12 Mai Alzamel , Panagiotis Charalampopoulos , Costas S. Iliopoulos , Solon P. Pissis , Jakub Radoszewski , Wing-Kin Sung

The $k$-mappability problem has two integers parameters $m$ and $k$. For every subword of size $m$ in a text $S$, we wish to report the number of indices in $S$ in which the word occurs with at most $k$ mismatches. The problem was lately…

Data Structures and Algorithms · Computer Science 2021-06-15 Amihood Amir , Itai Boneh , Eitan Kondratovsky

The $k$-mismatch problem consists in computing the Hamming distance between a pattern $P$ of length $m$ and every length-$m$ substring of a text $T$ of length $n$, if this distance is no more than $k$. In many real-world applications, any…

Given a pattern of length $m$ and a text of length $n$, the goal in $k$-mismatch pattern matching is to compute, for every $m$-substring of the text, the exact Hamming distance to the pattern or report that it exceeds $k$. This can be…

Data Structures and Algorithms · Computer Science 2017-04-06 Paweł Gawrychowski , Przemysław Uznański

The text-to-pattern Hamming distances problem asks to compute the Hamming distances between a given pattern of length $m$ and all length-$m$ substrings of a given text of length $n\ge m$. We focus on the $k$-mismatch version of the problem,…

Data Structures and Algorithms · Computer Science 2022-03-30 Raphaël Clifford , Paweł Gawrychowski , Tomasz Kociumaka , Daniel P. Martin , Przemysław Uznański

A central task in string processing is text indexing, where the goal is to preprocess a text (a string of length $n$) into an efficient index (a data structure) supporting queries about the text. Cole, Gottlieb, and Lewenstein (STOC 2004)…

Data Structures and Algorithms · Computer Science 2025-10-31 Tomasz Kociumaka , Jakub Radoszewski

One of the most fundamental problems in Computer Science is the Knapsack problem. Given a set of n items with different weights and values, it asks to pick the most valuable subset whose total weight is below a capacity threshold T. Despite…

Data Structures and Algorithms · Computer Science 2018-07-16 Kyriakos Axiotis , Christos Tzamos

The longest common substring with $k$-mismatches problem is to find, given two strings $S_1$ and $S_2$, a longest substring $A_1$ of $S_1$ and $A_2$ of $S_2$ such that the Hamming distance between $A_1$ and $A_2$ is $\le k$. We introduce a…

Data Structures and Algorithms · Computer Science 2015-04-08 Tomas Flouri , Emanuele Giaquinta , Kassian Kobert , Esko Ukkonen

In the $k$-mismatch problem we are given a pattern of length $n$ and a text and must find all locations where the Hamming distance between the pattern and the text is at most $k$. A series of recent breakthroughs have resulted in an…

Data Structures and Algorithms · Computer Science 2021-06-22 Paweł Gawrychowski , Tatiana Starikovskaya

We consider the approximate pattern matching problem under the edit distance. Given a text $T$ of length $n$, a pattern $P$ of length $m$, and a threshold $k$, the task is to find the starting positions of all substrings of $T$ that can be…

Data Structures and Algorithms · Computer Science 2022-04-08 Panagiotis Charalampopoulos , Tomasz Kociumaka , Philip Wellnitz

The algorithmic tasks of computing the Hamming distance between a given pattern of length $m$ and each location in a text of length $n$ is one of the most fundamental algorithmic tasks in string algorithms. Unfortunately, there is evidence…

Data Structures and Algorithms · Computer Science 2015-12-15 Tsvi Kopelowitz , Ely Porat

In this paper, we design new sublinear-time algorithms for solving the gap edit distance problem and for embedding edit distance to Hamming distance. For the gap edit distance problem, we give an $\tilde{O}(\frac{n}{k}+k^2)$-time greedy…

Data Structures and Algorithms · Computer Science 2020-11-17 Tomasz Kociumaka , Barna Saha

The classical pattern matching asks for locating all occurrences of one string, called the pattern, in another, called the text, where a string is simply a sequence of characters. Due to the potential practical applications, it is desirable…

Data Structures and Algorithms · Computer Science 2024-10-30 Jonas Ellert , Paweł Gawrychowski , Adam Górkiewicz , Tatiana Starikovskaya

We revisit the complexity of one of the most basic problems in pattern matching. In the k-mismatch problem we must compute the Hamming distance between a pattern of length m and every m-length substring of a text of length n, as long as…

Data Structures and Algorithms · Computer Science 2015-08-28 Raphaël Clifford , Allyx Fontaine , Ely Porat , Benjamin Sach , Tatiana Starikovskaya

Intervals have been generated in many applications (e.g., temporal databases), and they are often associated with weights, such as prices. This paper addresses the problem of processing top-k weighted stabbing queries on interval data.…

Databases · Computer Science 2024-05-24 Daichi Amagata , Junya Yamada , Yuchen Ji , Takahiro Hara

Permutation patterns and pattern avoidance have been intensively studied in combinatorics and computer science, going back at least to the seminal work of Knuth on stack-sorting (1968). Perhaps the most natural algorithmic question in this…

Data Structures and Algorithms · Computer Science 2019-08-14 Benjamin Aram Berendsohn , László Kozma , Dániel Marx

Let us call a sequence of numbers heapable if they can be sequentially inserted to form a binary tree with the heap property, where each insertion subsequent to the first occurs at a leaf of the tree, i.e. below a previously placed number.…

Data Structures and Algorithms · Computer Science 2010-07-15 John Byers , Brent Heeringa , Michael Mitzenmacher , Georgios Zervas

In this paper we study lower bounds for the fundamental problem of text indexing with mismatches and differences. In this problem we are given a long string of length $n$, the "text", and the task is to preprocess it into a data structure…

Data Structures and Algorithms · Computer Science 2018-12-24 Vincent Cohen-Addad , Laurent Feuilloley , Tatiana Starikovskaya

Palindromes are strings that read the same forward and backward. Problems of computing palindromic structures in strings have been studied for many years with a motivation of their application to biology. The longest palindrome problem is…

Data Structures and Algorithms · Computer Science 2023-06-21 Kazuki Mitani , Takuya Mieno , Kazuhisa Seto , Takashi Horiyama

We consider the streaming complexity of a fundamental task in approximate pattern matching: the $k$-mismatch problem. It asks to compute Hamming distances between a pattern of length $n$ and all length-$n$ substrings of a text for which the…

Data Structures and Algorithms · Computer Science 2018-04-10 Raphaël Clifford , Tomasz Kociumaka , Ely Porat
‹ Prev 1 2 3 10 Next ›