English

Quantum algorithm for position weight matrix matching

Quantum Physics 2023-03-08 v1 Quantitative Methods

Abstract

We propose two quantum algorithms for a problem in bioinformatics, position weight matrix (PWM) matching, which aims to find segments (sequence motifs) in a biological sequence such as DNA and protein that have high scores defined by the PWM and are thus of informational importance related to biological function. The two proposed algorithms, the naive iteration method and the Monte-Carlo-based method, output matched segments, given the oracular accesses to the entries in the biological sequence and the PWM. The former uses quantum amplitude amplification (QAA) for sequence motif search, resulting in the query complexity scaling on the sequence length nn, the sequence motif length mm and the number of the PWMs KK as O~(mKn)\widetilde{O}\left(m\sqrt{Kn}\right), which means speedup over existing classical algorithms with respect to nn and KK. The latter also uses QAA, and further, quantum Monte Carlo integration for segment score calculation, instead of iteratively operating quantum circuits for arithmetic in the naive iteration method; then it provides the additional speedup with respect to mm in some situation. As a drawback, these algorithms use quantum random access memories and their initialization takes O(n)O(n) time. Nevertheless, our algorithms keep the advantage especially when we search matches in a sequence for many PWMs in parallel.

Keywords

Cite

@article{arxiv.2303.03569,
  title  = {Quantum algorithm for position weight matrix matching},
  author = {Koichi Miyamoto and Naoki Yamamoto and Yasubumi Sakakibara},
  journal= {arXiv preprint arXiv:2303.03569},
  year   = {2023}
}