English
Related papers

Related papers: Sam2bam: High-Performance Framework for NGS Data P…

200 papers

Motivation: Illumina DNA sequencing is now the predominant source of raw genomic data, and data volumes are growing rapidly. Bioinformatic analysis pipelines are having trouble keeping pace. A common bottleneck in such pipelines is the…

Genomics · Quantitative Biology 2014-09-09 Gregory G. Faust , Ira M. Hall

DNA sequencing, especially of microbial genomes and metagenomes, has been at the core of recent research advances in large-scale comparative genomics. The data deluge has resulted in exponential growth in genomic datasets over the past…

Genome sequence analysis has enabled significant advancements in medical and scientific areas such as personalized medicine, outbreak tracing, and the understanding of evolution. Unfortunately, it is currently bottlenecked by the…

Genome sequence analysis plays a pivotal role in enabling many medical and scientific advancements in personalized medicine, outbreak tracing, and forensics. However, the analysis of genome sequencing data is currently bottlenecked by the…

Hardware Architecture · Computer Science 2021-11-04 Damla Senol Cali

Sequence alignment data is often ordered by coordinate (id of the reference sequence plus position on the sequence where the fragment was mapped) when stored in BAM files, as this simplifies the extraction of variants between the mapped…

Genomics · Quantitative Biology 2014-06-24 German Tischler , Steven Leonard

DNA sequencing is the physical/biochemical process of identifying the location of the four bases (Adenine, Guanine, Cytosine, Thymine) in a DNA strand. As semiconductor technology revolutionized computing, modern DNA sequencing technology…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-05-06 S. Karen Khatamifard , Zamshed Chowdhury , Nakul Pande , Meisam Razaviyayn , Chris Kim , Ulya R. Karpuzcu

Processing-using-DRAM has been proposed for a limited set of basic operations (i.e., logic operations, addition). However, in order to enable the full adoption of processing-using-DRAM, it is necessary to provide support for more complex…

Next Generation Sequencing (NGS) platforms and, more generally, high-throughput technologies are giving rise to an exponential growth in the size of nucleotide sequence databases. Moreover, many emerging applications of nucleotide datasets…

Databases · Computer Science 2019-10-11 Ferdinando Montecuollo , Giovannni Schmid , Roberto Tagliaferri

Background: Identifying all possible mapping locations of next-generation sequencing (NGS) reads is highly essential in several applications such as prediction of genomic variants or protein binding motifs located in repeat regions, isoform…

Genomics · Quantitative Biology 2020-03-25 Ngoc Hieu Tran , Xin Chen

Recent DNA pre-alignment filter designs employ DRAM for storing the reference genome and its associated meta-data. However, DRAM incurs increasingly high energy consumption background and refresh energy as devices scale. To overcome this…

Emerging Technologies · Computer Science 2022-12-27 Fazal Hameed , Asif Ali Khan , Sebastien Ollivier , Alex K. Jones , Jeronimo Castrillon

In the rapidly evolving domain of next generation sequencing and bioinformatics analysis, data generation is one aspect that is increasing at a concomitant rate. The burden associated with processing large amounts of sequencing data has…

Genomics · Quantitative Biology 2015-03-11 Matthew J. Wade , Thomas P. Curtis , Russell J. Davenport

Next-generation sequencing (NGS) is a pivotal technique in genome sequencing due to its high throughput, rapid results, cost-effectiveness, and enhanced accuracy. Its significance extends across various domains, playing a crucial role in…

Genomics · Quantitative Biology 2025-04-28 Fathima Nuzla Ismail , Shanika Amarasoma

Motivation: Modern genomics laboratories generate massive volumes of sequencing data, often resulting in significant storage costs. Genomics storage consists of duplicate files, temporary processing files, and redundant intermediate data.…

Genomics · Quantitative Biology 2025-11-27 Margot Celerie , Andrew Oldfield , William Ritchie

Processing-using-DRAM has been proposed for a limited set of basic operations (i.e., logic operations, addition). However, in order to enable full adoption of processing-using-DRAM, it is necessary to provide support for more complex…

A genome read data set can be quickly and efficiently remapped from one reference to another similar reference (e.g., between two reference versions or two similar species) using a variety of tools, e.g., the commonly-used CrossMap tool.…

Genomics · Quantitative Biology 2023-11-21 Jeremie S. Kim , Can Firtina , Meryem Banu Cavlak , Damla Senol Cali , Can Alkan , Onur Mutlu

Large-scale genomic workflows used in precision medicine can process datasets spanning tens to hundreds of gigabytes per sample, leading to high memory spikes, intensive disk I/O, and task failures due to out-of-memory errors. Simple static…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-11-21 Daniel Mas Montserrat , Ray Verma , Míriam Barrabés , Francisco M. de la Vega , Carlos D. Bustamante , Alexander G. Ioannidis

With small-scale quantum processors transitioning from experimental physics labs to industrial products, these processors allow us to efficiently compute important algorithms in various fields. In this paper, we propose a quantum algorithm…

Quantum Physics · Physics 2020-05-22 Aritra Sarkar , Zaid Al-Ars , Carmen G. Almudever , Koen Bertels

Deep learning demonstrates effectiveness across a wide range of tasks. However, the dense and over-parameterized nature of these models results in significant resource consumption during deployment. In response to this issue, weight…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-05 Cong Ma , Du Wu , Zhelang Deng , Jiang Chen , Xiaowen Huang , Jintao Meng , Wenxi Zhu , Bingqiang Wang , Amelie Chi Zhou , Peng Chen , Minwen Deng , Yanjie Wei , Shengzhong Feng , Yi Pan

As the amount of data produced in society continues to grow at an exponential rate, modern applications are incurring significant performance and energy penalties due to high data movement between the CPU and memory/storage. While…

Hardware Architecture · Computer Science 2024-03-12 Ryan Wong , Nikita Kim , Kevin Higgs , Sapan Agarwal , Engin Ipek , Saugata Ghose , Ben Feinberg

Segment Anything Model 2 (SAM 2) serves as a core foundation model in the field of video segmentation. Building upon the original SAM model, it introduces a memory bank mechanism and demonstrates outstanding performance in tasks such as…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Zhaoyuan Ding , Yijing Yang , Han Shu , Xinghao Chen
‹ Prev 1 2 3 10 Next ›