English
Related papers

Related papers: Maximal Closed Substrings

200 papers

Much research in stringology focuses on structures that can, in a way, ``grasp'' repeats (substrings that occur multiple times) as, for example, the so-called runs, a.k.a. maximal repetitions, compactly describe all tandem repeats. In this…

Data Structures and Algorithms · Computer Science 2024-10-02 Dmitry Kosolobov

A closed string $u$ is either of length one or contains a border that occurs only as a prefix and as a suffix in $u$ and nowhere else within $u$. In this paper, we present fast $\mathcal{O}(n\log n)$ time algorithms to compute all…

Data Structures and Algorithms · Computer Science 2026-01-12 Samkith K Jain , Neerja Mhaskar

Finding the common subsequences of $L$ multiple strings has many applications in the area of bioinformatics, computational linguistics, and information retrieval. A well-known result states that finding a Longest Common Subsequence (LCS)…

Data Structures and Algorithms · Computer Science 2020-09-09 Jin Cao , Dewei Zhong

Maximal repetition of a string is the maximal length of a repeated substring. This paper investigates maximal repetition of strings drawn from stochastic processes. Strengthening previous results, two new bounds for the almost sure growth…

Information Theory · Computer Science 2020-03-11 Łukasz Dębowski

Maximal Common Subsequences (MCSs) between two strings X and Y are subsequences of both X and Y that are maximal under inclusion. MCSs relax and generalize the well known and widely used concept of Longest Common Subsequences (LCSs), which…

Data Structures and Algorithms · Computer Science 2023-07-26 Alessio Conte , Roberto Grossi , Giulia Punzi , Takeaki Uno

We propose efficient algorithms for enumerating maximal common subsequences (MCSs) of two strings. Efficiency of the algorithms are estimated by the preprocessing-time, space, and delay-time complexities. One algorithm prepares a…

Data Structures and Algorithms · Computer Science 2023-07-21 Miyuji Hirota , Yoshifumi Sakai

A string $w$ is said to be a minimal unique substring (MUS) of a string $T$ if $w$ occurs exactly once in $T$, and any proper substring of $w$ occurs at least twice in $T$. It is known that the number of MUSs in a string $T$ of length $n$…

Data Structures and Algorithms · Computer Science 2025-08-25 Hiroto Fujimaru , Takuya Mieno , Shunsuke Inenaga

Finding an Approximate Longest Common Substring (ALCS) within a given set $S=\{s_1,s_2,\ldots,s_m\}$ of $m \ge 2$ strings is a key problem in computational biology, such as identifying related mutations across multiple genetic sequences. We…

Data Structures and Algorithms · Computer Science 2025-09-22 Hamed Hasibi , Neerja Mhaskar , W. F. Smyth

Let $S$ be a string of length $n$. In this paper we introduce the notion of \emph{string attractor}: a subset of the string's positions $[1,n]$ such that every distinct substring of $S$ has an occurrence crossing one of the attractor's…

Data Structures and Algorithms · Computer Science 2017-09-20 Nicola Prezza

A border of a string is a non-empty prefix of the string that is also a suffix of the string, and a string is unbordered if it has no border other than itself. Loptev, Kucherov, and Starikovskaya [CPM 2015] conjectured the following: If we…

Data Structures and Algorithms · Computer Science 2018-12-18 Patrick Hagge Cording , Travis Gagie , Mathias Bæk Tejs Knudsen , Tomasz Kociumaka

In this paper we initiate the study of computing a maximal (not necessarily maximum) repeating pattern in a single input string, where the corresponding problems have been studied (e.g., a maximal common subsequence) only in two or more…

Data Structures and Algorithms · Computer Science 2026-01-21 Mingyang Gong , Adiesha Liyanage , Braeden Sopp , Binhai Zhu

This paper performs the analysis necessary to bound the running time of known, efficient algorithms for generating all longest common subsequences. That is, we bound the running time as a function of input size for algorithms with time…

Discrete Mathematics · Computer Science 2007-05-23 Ronald I. Greenberg

We consider the longest common subsequence (LCS) problem with the restriction that the common subsequence is required to consist of at least $k$ length substrings. First, we show an $O(mn)$ time algorithm for the problem which gives a…

Data Structures and Algorithms · Computer Science 2017-02-07 Yohei Ueki , Diptarama , Masatoshi Kurihara , Yoshiaki Matsuoka , Kazuyuki Narisawa , Ryo Yoshinaka , Hideo Bannai , Shunsuke Inenaga , Ayumi Shinohara

A run is an inclusion maximal occurrence in a string (as a subinterval) of a repetition $v$ with a period $p$ such that $2p \le |v|$. The exponent of a run is defined as $|v|/p$ and is $\ge 2$. We show new bounds on the maximal sum of…

Discrete Mathematics · Computer Science 2015-05-18 Maxime Crochemore , Marcin Kubica , Jakub Radoszewski , Wojciech Rytter , Tomasz Walen

A border of a string is a non-empty proper prefix of the string that is also a suffix. A string is unbordered if it has no border. The longest unbordered factor is a fundamental notion in stringology, closely related to string periodicity.…

Data Structures and Algorithms · Computer Science 2025-07-23 Shoma Sekizaki , Takuya Mieno

A run is a maximal occurrence of a repetition $v$ with a period $p$ such that $2p \le |v|$. The maximal number of runs in a string of length $n$ was studied by several authors and it is known to be between $0.944 n$ and $1.029 n$. We…

Data Structures and Algorithms · Computer Science 2009-07-14 Maxime Crochemore , Costas Iliopoulos , Marcin Kubica , Jakub Radoszewski , Wojciech Rytter , Tomasz Walen

We consider the problem of computing the Maximal Exact Matches (MEMs) of a given pattern $P[1 .. m]$ on a large repetitive text collection $T[1 .. n]$, which is represented as a (hopefully much smaller) run-length context-free grammar of…

Data Structures and Algorithms · Computer Science 2023-09-06 Gonzalo Navarro

The longest common subsequence (LCS) is a fundamental problem in string processing which has numerous algorithmic studies, extensions, and applications. A sequence $u_1, \ldots, u_f$ of $f$ strings s said to be an ($f$-)segmentation of a…

Data Structures and Algorithms · Computer Science 2025-02-27 Yuki Yonemoto , Takuya Mieno , Shunsuke Inenaga , Ryo Yoshinaka , Ayumi Shinohara

A longest repeat query on a string, motivated by its applications in many subfields including computational biology, asks for the longest repetitive substring(s) covering a particular string position (point query). In this paper, we extend…

Data Structures and Algorithms · Computer Science 2015-11-10 Bojian Xu

The cornerstone of any algorithm computing all repetitions in a string of length n in O(n) time is the fact that the number of runs (or maximal repetitions) is O(n). We give a simple proof of this result. As a consequence of our approach,…

Data Structures and Algorithms · Computer Science 2008-02-21 Maxime Crochemore , Lucian Ilie
‹ Prev 1 2 3 10 Next ›