English
Related papers

Related papers: Closed Repeats

200 papers

Repeat finding in strings has important applications in subfields such as computational biology. Surprisingly, all prior work on repeat finding did not consider the constraint on the locality of repeats. In this paper, we propose and study…

Data Structures and Algorithms · Computer Science 2015-01-27 Atalay Mert İleri , M. Oğuzhan Külekci , Bojian Xu

In this paper we initiate the study of computing a maximal (not necessarily maximum) repeating pattern in a single input string, where the corresponding problems have been studied (e.g., a maximal common subsequence) only in two or more…

Data Structures and Algorithms · Computer Science 2026-01-21 Mingyang Gong , Adiesha Liyanage , Braeden Sopp , Binhai Zhu

A string is closed if it has length 1 or has a nonempty border without internal occurrences. In this paper we introduce the definition of a \emph{maximal closed substring} (MCS), which is an occurrence of a closed substring that cannot be…

Data Structures and Algorithms · Computer Science 2024-05-15 Golnaz Badkobeh , Alessandro De Luca , Gabriele Fici , Simon Puglisi

Motivated by computing duplication patterns in sequences, a new fundamental problem called the longest subsequence-repeated subsequence (LSRS) is proposed. Given a sequence $S$ of length $n$, a letter-repeated subsequence is a subsequence…

Data Structures and Algorithms · Computer Science 2023-09-01 Manuel Lafond , Wenfeng Lai , Adiesha Liyanage , Binhai Zhu

The cornerstone of any algorithm computing all repetitions in a string of length n in O(n) time is the fact that the number of runs (or maximal repetitions) is O(n). We give a simple proof of this result. As a consequence of our approach,…

Data Structures and Algorithms · Computer Science 2008-02-21 Maxime Crochemore , Lucian Ilie

Following (Kolpakov et al., 2013; Gawrychowski and Manea, 2015), we continue the study of {\em $\alpha$-gapped repeats} in strings, defined as factors $uvu$ with $|uv|\leq \alpha |u|$. Our main result is the $O(\alpha n)$ bound on the…

Formal Languages and Automata Theory · Computer Science 2015-10-05 Maxime Crochemore , Roman Kolpakov , Gregory Kucherov

A longest repeat query on a string, motivated by its applications in many subfields including computational biology, asks for the longest repetitive substring(s) covering a particular string position (point query). In this paper, we extend…

Data Structures and Algorithms · Computer Science 2015-11-10 Bojian Xu

We give a new characterization of maximal repetitions (or runs) in strings based on Lyndon words. The characterization leads to a proof of what was known as the "runs" conjecture (Kolpakov \& Kucherov (FOCS '99)), which states that the…

Discrete Mathematics · Computer Science 2018-07-03 Hideo Bannai , Tomohiro I , Shunsuke Inenaga , Yuto Nakashima , Masayuki Takeda , Kazuya Tsuruta

In this paper we study the fundamental problem of maintaining a dynamic collection of strings under the following operations: concat - concatenates two strings, split - splits a string into two at a given position, compare - finds the…

Data Structures and Algorithms · Computer Science 2016-04-11 Paweł Gawrychowski , Adam Karczmarz , Tomasz Kociumaka , Jakub Łącki , Piotr Sankowski

A maximal repetition, or run, in a string, is a maximal periodic substring whose smallest period is at most half the length of the substring. In this paper, we consider runs that correspond to a path on a trie, or in other words, on a…

Data Structures and Algorithms · Computer Science 2021-04-21 Ryo Sugahara , Yuto Nakashima , Shunsuke Inenaga , Hideo Bannai , Masayuki Takeda

A gapped repeat is a factor of the form $uvu$ where $u$ and $v$ are nonempty words. The period of the gapped repeat is defined as $|u|+|v|$. The gapped repeat is maximal if it cannot be extended to the left or to the right by at least one…

Formal Languages and Automata Theory · Computer Science 2013-10-01 Roman Kolpakov , Mikhail Podolskiy , Mikhail Posypkin , Nickolay Khrapov

The classic string indexing problem is to preprocess a string $S$ into a compact data structure that supports efficient subsequent pattern matching queries, that is, given a pattern string $P$, report all occurrences of $P$ within $S$. In…

Data Structures and Algorithms · Computer Science 2024-02-15 Philip Bille , Inge Li Gørtz , Max Rishøj Pedersen , Eva Rotenberg , Teresa Anna Steiner

Repeat finding in strings has important applications in subfields such as computational biology. The challenge of finding the longest repeats covering particular string positions was recently proposed and solved by \.{I}leri et al., using a…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-01-28 Yun Tian , Bojian Xu

We solve the problems of detecting and counting various forms of regularities in a string represented as a Straight Line Program (SLP). Given an SLP of size $n$ that represents a string $s$ of length $N$, our algorithm compute all runs and…

Data Structures and Algorithms · Computer Science 2013-04-29 Tomohiro I , Wataru Matsubara , Kouji Shimohira , Shunsuke Inenaga , Hideo Bannai , Masayuki Takeda , Kazuyuki Narisawa , Ayumi Shinohara

This paper provides an upper bound for several subsets of maximal repeats and maximal pairs in compressed strings and also presents a formerly unknown relationship between maximal pairs and the run-length Burrows-Wheeler transform. This…

Data Structures and Algorithms · Computer Science 2020-02-18 Julian Pape-Lange

A closed string $u$ is either of length one or contains a border that occurs only as a prefix and as a suffix in $u$ and nowhere else within $u$. In this paper, we present fast $\mathcal{O}(n\log n)$ time algorithms to compute all…

Data Structures and Algorithms · Computer Science 2026-01-12 Samkith K Jain , Neerja Mhaskar

An occurrence of a repeated substring $u$ in a string $S$ is called a net occurrence if extending the occurrence to the left or to the right decreases the number of occurrences to 1. The net frequency (NF) of a repeated substring $u$ in a…

Data Structures and Algorithms · Computer Science 2026-01-01 Shunsuke Inenaga

The classic string indexing problem is to preprocess a string S into a compact data structure that supports efficient pattern matching queries. Typical queries include existential queries (decide if the pattern occurs in S), reporting…

Data Structures and Algorithms · Computer Science 2021-02-05 Philip Bille , Inge Li Gørtz , Max Rishøj Pedersen , Teresa Anna Steiner

We study the problem of computing a longest increasing subsequence in a sequence $S$ of $n$ distinct elements in the presence of persistent comparison errors. In this model, every comparison between two elements can return the wrong result…

Data Structures and Algorithms · Computer Science 2018-08-13 Barbara Geissmann

The Karp-Rabin fingerprint of a string is a type of hash value that due to its strong properties has been used in many string algorithms. In this paper we show how to construct a data structure for a string $S$ of size $N$ compressed by a…

Data Structures and Algorithms · Computer Science 2013-05-17 Philip Bille , Patrick Hagge Cording , Inge Li Gørtz , Benjamin Sach , Hjalte Wedel Vildhøj , Søren Vind
‹ Prev 1 2 3 10 Next ›