English
Related papers

Related papers: On The Closest String and Substring Problems

200 papers

We study the fundamental problem of finding the best string to represent a given set, in the form of the Closest String problem: Given a set $X \subseteq \Sigma^d$ of $n$ strings, find the string $x^*$ minimizing the radius of the smallest…

Computational Complexity · Computer Science 2023-05-30 Amir Abboud , Nick Fischer , Elazar Goldenberg , Karthik C. S. , Ron Safier

Finding an Approximate Longest Common Substring (ALCS) within a given set $S=\{s_1,s_2,\ldots,s_m\}$ of $m \ge 2$ strings is a key problem in computational biology, such as identifying related mutations across multiple genetic sequences. We…

Data Structures and Algorithms · Computer Science 2025-09-22 Hamed Hasibi , Neerja Mhaskar , W. F. Smyth

The problem of approximate string matching is important in many different areas such as computational biology, text processing and pattern recognition. A great effort has been made to design efficient algorithms addressing several variants…

Data Structures and Algorithms · Computer Science 2008-07-29 Dimitris Papamichail , Georgios Papamichail

The Shortest Common Superstring problem (SCS) consists, for a set of strings S = {s_1,...,s_n}, in finding a minimum length string that contains all s_i, 1<= i <= n, as substrings. While a 2+11/30 approximation ratio algorithm has recently…

Data Structures and Algorithms · Computer Science 2018-05-02 Tristan Braquelaire , Marie Gasparoux , Mathieu Raffinot , Raluca Uricaru

In this paper we consider the $p$-Norm Hamming Centroid problem which asks to determine whether some given binary strings have a centroid with a bound on the $p$-norm of its Hamming distances to the strings. Specifically, given a set of…

Computational Complexity · Computer Science 2019-06-28 Jiehua Chen , Danny Hermelin , Manuel Sorge

The approximate string matching is a fundamental and recurrent problem that arises in most computer science fields. This problem can be defined as follows: Let $D=\{x_1,x_2,\ldots x_d\}$ be a set of $d$ words defined on an alphabet…

Data Structures and Algorithms · Computer Science 2017-01-31 Ibrahim Chegrane

We report (to our knowledge) the first evaluation of Constraint Satisfaction as a computational framework for solving closest string problems. We show that careful consideration of symbol occurrences can provide search heuristics that…

Artificial Intelligence · Computer Science 2010-05-04 Tom Kelsey , Lars Kotthoff

The Closest String Problem is an NP-hard problem that aims to find a string that has the minimum distance from all sequences that belong to the given set of strings. Its applications can be found in coding theory, computational biology, and…

Artificial Intelligence · Computer Science 2024-07-19 Alireza Abdi , Marko Djukanovic , Hesam Tahmasebi Boldaji , Hadis Salehi , Aleksandar Kartelj

The Shortest Common Superstring (SCS) problem is a fundamental task in sequence analysis. In genome assembly, however, the double-stranded nature of DNA implies that each fragment may occur either in its original orientation or as its…

Data Structures and Algorithms · Computer Science 2026-03-30 Ryosuke Yamano , Tetsuo Shibuya

The problem of finding longest common subsequence (LCS) is one of the fundamental problems in computer science, which finds application in fields such as computational biology, text processing, information retrieval, data compression etc.…

Computational Complexity · Computer Science 2020-06-25 Amey Bhangale , Diptarka Chakraborty , Rajendra Kumar

The closest string problem is an NP-hard problem, whose task is to find a string that minimizes maximum Hamming distance to a given set of strings. This can be reduced to an integer program (IP). However, to date, there exists no known…

Data Structures and Algorithms · Computer Science 2011-05-12 Jing-Chao Chen

This study investigates whether reoptimization can help in solving the closest substring problem. We are dealing with the following reoptimization scenario. Suppose, we have an optimal l-length closest substring of a given set of sequences…

Data Structures and Algorithms · Computer Science 2017-03-21 Jhoirene B. Clemente , Henry N. Adorna

Many problems in bioinformatics are about finding strings that approximately represent a collection of given strings. We look at more general problems where some input strings can be classified as outliers. The Close to Most Strings problem…

Data Structures and Algorithms · Computer Science 2012-02-14 Christina Boucher , Gad M. Landau , Avivit Levy , David Pritchard , Oren Weimann

In the Shortest Superstring problem, we are given a set of strings and we are asking for a common superstring, which has the minimum number of characters. The Shortest Superstring problem is NP-hard and several constant-factor approximation…

Data Structures and Algorithms · Computer Science 2021-11-09 Matthias Englert , Nicolaos Matsakis , Pavel Veselý

String consensus problems aim at finding a string that minimizes some given distance with respect to an input set of strings. In particular, in the Closest string problem, we are given a set of strings of equal length and a radius $d$. The…

Data Structures and Algorithms · Computer Science 2025-07-29 Estéban Gabory , Laurent Bulteau , Gabriele Fici , Hilde Verbeek

This paper investigates the approximability of the Longest Common Subsequence (LCS) problem. The fastest algorithm for solving the LCS problem exactly runs in essentially quadratic time in the length of the input, and it is known that under…

Data Structures and Algorithms · Computer Science 2021-05-10 Shyan Akmal , Virginia Vassilevska Williams

The {\em shortest common superstring} and the {\em shortest common supersequence} are two well studied problems having a wide range of applications. In this paper we consider both problems with resource constraints, denoted as the…

Data Structures and Algorithms · Computer Science 2010-06-29 Raphaël Clifford , Zvi Gotthilf , Moshe Lewenstein , Alexandru Popa

The Longest Common Subsequence (LCS) is a fundamental string similarity measure, and computing the LCS of two strings is a classic algorithms question. A textbook dynamic programming algorithm gives an exact algorithm in quadratic time, and…

Data Structures and Algorithms · Computer Science 2023-02-13 Xiaoyu He , Ray Li

The Longest Common Subsequence (LCS) of two strings is a fundamental string similarity measure with a classical dynamic programming solution taking quadratic time. Despite significant efforts, little progress was made in improving the…

Data Structures and Algorithms · Computer Science 2021-12-17 Negev Shekel Nosatzki

In the Maximum Duo-Preservation String Mapping problem we are given two strings and wish to map the letters of the former to the letters of the latter so as to maximise the number of duos. A duo is a pair of consecutive letters that is…

Data Structures and Algorithms · Computer Science 2017-05-31 Bartłomiej Dudek , Paweł Gawrychowski , Piotr Ostropolski-Nalewaja
‹ Prev 1 2 3 10 Next ›