English

Accelerating DNA Read Mapping with Digital Processing-in-Memory

Hardware Architecture 2024-11-22 v2 Distributed, Parallel, and Cluster Computing Quantitative Methods

Abstract

Genome analysis has revolutionized fields such as personalized medicine and forensics. Modern sequencing machines generate vast amounts of fragmented strings of genome data called reads. The alignment of these reads into a complete DNA sequence of an organism (the read mapping process) requires extensive data transfer between processing units and memory, leading to execution bottlenecks. Prior studies have primarily focused on accelerating specific stages of the read-mapping task. Conversely, this paper introduces a holistic framework called DART-PIM that accelerates the entire read-mapping process. DART-PIM facilitates digital processing-in-memory (PIM) for an end-to-end acceleration of the entire read-mapping process, from indexing using a unique data organization schema to filtering and read alignment with an optimized Wagner Fischer algorithm. A comprehensive performance evaluation with real genomic data shows that DART-PIM achieves a 5.7x and 257x improvement in throughput and a 92x and 27x energy efficiency enhancement compared to state-of-the-art GPU and PIM implementations, respectively.

Keywords

Cite

@article{arxiv.2411.03832,
  title  = {Accelerating DNA Read Mapping with Digital Processing-in-Memory},
  author = {Rotem Ben-Hur and Orian Leitersdorf and Ronny Ronen and Lidor Goldshmidt and Idan Magram and Lior Kaplun and Leonid Yavitz and Shahar Kvatinsky},
  journal= {arXiv preprint arXiv:2411.03832},
  year   = {2024}
}