English
Related papers

Related papers: wgatools: an ultrafast toolkit for manipulating wh…

200 papers

Motivation: The rapid growth in genome-wide association studies (GWAS) in plants and animals has brought about the need for a central resource that facilitates i) performing GWAS, ii) accessing data and results of other GWAS, and iii)…

Background: While the importance of gene-gene interactions in human diseases has been well recognized, identifying them has been a great challenge, especially through association studies with millions of genetic markers and thousands of…

Quantitative Methods · Quantitative Biology 2015-05-07 Changshuai Wei , Qing Lu

The AGP format is a tab-separated table format describing how components of a genome assembly fit together. A standard submission format for genome assemblies is a fasta file giving the sequence of contigs along with an AGP file showing how…

Genomics · Quantitative Biology 2025-03-27 Edward S. Ricemeyer , Rachel A. Carroll , Wesley C. Warren

The effective visualization of genomic data is crucial for exploring and interpreting complex relationships within and across genes and genomes. Despite advances in developing dedicated bioinformatics software, common visualization tools…

Genomics · Quantitative Biology 2024-11-22 Thomas Hackl , Markus Ankenbrand , Bart van Adrichem , David Wilkins , Kristina Haslinger

Motivation: A pan-genome graph represents a collection of genomes and encodes sequence variations between them. It is a powerful data structure for studying multiple similar genomes. Sequence-to-graph alignment is an essential step for the…

Genomics · Quantitative Biology 2022-06-29 Haowen Zhang , Shiqi Wu , Srinivas Aluru , Heng Li

In recent years, Whole Genome Sequencing (WGS) evolved from a futuristic-sounding research project to an increasingly affordable technology for determining complete genome sequences of complex organisms, including humans. This prompts a…

Cryptography and Security · Computer Science 2015-02-17 Erman Ayday , Emiliano De Cristofaro , Jean-Pierre Hubaux , Gene Tsudik

Metabolomic data sets provide a direct read-out of cellular phenotypes and are increasingly generated to study biological questions. Our previous work revealed the potential of analyzing extracellular metabolomic data in the context of the…

Molecular Networks · Quantitative Biology 2016-06-10 Maike K. Aurich , Ronan M. T. Fleming , Ines Thiele

Next-generation sequencing (NGS) is a pivotal technique in genome sequencing due to its high throughput, rapid results, cost-effectiveness, and enhanced accuracy. Its significance extends across various domains, playing a crucial role in…

Genomics · Quantitative Biology 2025-04-28 Fathima Nuzla Ismail , Shanika Amarasoma

The generation of high-quality assemblies, even for large eukaryotic genomes, has become a routine task for many biologists thanks to recent advances in sequencing technologies. However, the annotation of these assemblies - a crucial step…

Genomics · Quantitative Biology 2021-04-07 Roman Martin , Thomas Hackl , Georges Hattab , Matthias G. Fischer , Dominik Heider

A genome read data set can be quickly and efficiently remapped from one reference to another similar reference (e.g., between two reference versions or two similar species) using a variety of tools, e.g., the commonly-used CrossMap tool.…

Genomics · Quantitative Biology 2023-11-21 Jeremie S. Kim , Can Firtina , Meryem Banu Cavlak , Damla Senol Cali , Can Alkan , Onur Mutlu

How to compare whole genome sequences at large scale has not been achieved via conventional methods based on pair-wisely base-to-base comparison; nevertheless, no attention was paid to handle in-one-sitting a number of genomes crossing…

Genomics · Quantitative Biology 2014-03-05 Yuncan Ai , Hannan Ai , Fanmei Meng , Lei Zhao

Genome sequence analysis plays a pivotal role in enabling many medical and scientific advancements in personalized medicine, outbreak tracing, and forensics. However, the analysis of genome sequencing data is currently bottlenecked by the…

Hardware Architecture · Computer Science 2021-11-04 Damla Senol Cali

Summary: BWA-MEM is a new alignment algorithm for aligning sequence reads or long query sequences against a large reference genome such as human. It automatically chooses between local and end-to-end alignments, supports paired-end reads…

Genomics · Quantitative Biology 2013-05-28 Heng Li

Motivation: The multiple sequence alignment (MSA) problem has been extensively studied, with numerous approaches developed over recent years. With the rapid growth of sequence data, there is an increasing need for fast and accurate MSA…

Computational Engineering, Finance, and Science · Computer Science 2026-01-23 Emily G. Light , Morgan Prior , Noah M. Daniels , Najib Ishaq

Motivation: Protein-to-genome alignment is critical to annotating genes in non-model organisms. While there are a few tools for this purpose, all of them were developed over ten years ago and did not incorporate the latest advances in…

Genomics · Quantitative Biology 2022-12-29 Heng Li

Tool learning is increasingly important for large language models (LLMs) to effectively coordinate and utilize a diverse set of tools in order to solve complex real-world tasks. By selecting and integrating appropriate tools, LLMs extend…

Machine Learning · Computer Science 2026-01-21 Zheng Fang , Wolfgang Mayer , Zeyu Zhang , Jian Wang , Hong-Yu Zhang , Wanli Li , Zaiwen Feng

Aligning millions of short DNA or RNA reads, of 75 to 250 base pairs each, to a reference genome is a significant computation problem in bioinformatics. We present a flexible and fast FPGA-based short read alignment tool. Our aligner makes…

Genomics · Quantitative Biology 2018-05-02 Nathaniel McVicar , Akina Hoshino , Anna La Torre , Thomas A. Reh , Walter L. Ruzzo , Scott Hauck

We now need more than ever to make genome analysis more intelligent. We need to read, analyze, and interpret our genomes not only quickly, but also accurately and efficiently enough to scale the analysis to population level. There currently…

Massively parallel sequencing techniques have revolutionized biological and medical sciences by providing unprecedented insight into the genomes of humans, animals, and microbes. Modern sequencing platforms generate enormous amounts of…

To uncover the genetic basis of complex disease, individuals are often measured at a large number of genetic variants (usually SNPs) across the genome. GemTools provides computationally efficient tools for modeling genetic ancestry based on…

Applications · Statistics 2011-04-07 Lambertus Klei , Brian P. Kent , Nadine Melhem , Bernie Devlin , Kathryn Roeder
‹ Prev 1 2 3 10 Next ›