English

On the String Consensus Problem and the Manhattan Sequence Consensus Problem

Data Structures and Algorithms 2014-07-24 v1

Abstract

In the Manhattan Sequence Consensus problem (MSC problem) we are given kk integer sequences, each of length ll, and we are to find an integer sequence xx of length ll (called a consensus sequence), such that the maximum Manhattan distance of xx from each of the input sequences is minimized. For binary sequences Manhattan distance coincides with Hamming distance, hence in this case the string consensus problem (also called string center problem or closest string problem) is a special case of MSC. Our main result is a practically efficient O(l)O(l)-time algorithm solving MSC for k5k\le 5 sequences. Practicality of our algorithms has been verified experimentally. It improves upon the quadratic algorithm by Amir et al.\ (SPIRE 2012) for string consensus problem for k=5k=5 binary strings. Similarly as in Amir's algorithm we use a column-based framework. We replace the implied general integer linear programming by its easy special cases, due to combinatorial properties of the MSC for k5k\le 5. We also show that for a general parameter kk any instance can be reduced in linear time to a kernel of size k!k!, so the problem is fixed-parameter tractable. Nevertheless, for k4k\ge 4 this is still too large for any naive solution to be feasible in practice.

Keywords

Cite

@article{arxiv.1407.6144,
  title  = {On the String Consensus Problem and the Manhattan Sequence Consensus Problem},
  author = {Tomasz Kociumaka and Jakub W. Pachocki and Jakub Radoszewski and Wojciech Rytter and Tomasz Waleń},
  journal= {arXiv preprint arXiv:1407.6144},
  year   = {2014}
}

Comments

accepted to SPIRE 2014

R2 v1 2026-06-22T05:10:43.885Z