The Longest Subsequence-Repeated Subsequence Problem
Abstract
Motivated by computing duplication patterns in sequences, a new fundamental problem called the longest subsequence-repeated subsequence (LSRS) is proposed. Given a sequence of length , a letter-repeated subsequence is a subsequence of in the form of with a subsequence of , and for all in and in . We first present an time algorithm to compute the longest cubic subsequences of all the substrings of , improving the trivial bound. Then, an time algorithm for computing the longest subsequence-repeated subsequence (LSRS) of is obtained. Finally we focus on two variants of this problem. We first consider the constrained version when is unbounded, each letter appears in at most times and all the letters in must appear in the solution. We show that the problem is NP-hard for , via a reduction from a special version of SAT (which is obtained from 3-COLORING). We then show that when each letter appears in at most times, then the problem is solvable in time.
Cite
@article{arxiv.2304.06862,
title = {The Longest Subsequence-Repeated Subsequence Problem},
author = {Manuel Lafond and Wenfeng Lai and Adiesha Liyanage and Binhai Zhu},
journal= {arXiv preprint arXiv:2304.06862},
year = {2023}
}
Comments
15 pages, 1 figure