English
Related papers

Related papers: Fast genomic optical map assembly algorithm using …

200 papers

Genome assembly from the high-throughput sequencing (HTS) reads is a fundamental yet challenging computational problem. An intrinsic challenge is the uncertainty caused by the widespread repetitive elements. Here we get around the…

Genomics · Quantitative Biology 2016-09-13 Anqi Wang , Zheng Li , Zhanyu Wang , Lei M. Li

De novo genome assembly is challenging in highly repetitive regions; however, reference-guided assemblers often suffer from bias. We propose a framework for pangenome-guided sequence assembly, which can resolve short-read data in complex…

Quantum Physics · Physics 2026-02-11 Josh Cudby , James Bonfield , Chenxi Zhou , Richard Durbin , Sergii Strelchuk

Genome sequencing is essential to decode genetic information, identify organisms, understand diseases and advance personalized medicine. A critical step in any genome sequencing technique is genome assembly. However, de novo genome…

(An updated version of this manuscript has been accepted to Scientific Reports in 2016, please refer to http://www.nature.com/articles/srep31900) The highly anticipated transition from next generation sequencing (NGS) to third generation…

Genomics · Quantitative Biology 2016-09-06 Chengxi Ye , Chris Hill , Shigang Wu , Jue Ruan , Zhanshan , Ma

Recent genomic foundation models largely adopt large language model architectures that treat DNA as a one-dimensional token sequence. However, exhaustive sequential reading is structurally misaligned with sparse and discontinuous genomic…

Computer Vision and Pattern Recognition · Computer Science 2026-02-03 Hongxin Xiang , Pengsen Ma , Yunkang Cao , Di Yu , Haowen Chen , Xinyu Yang , Xiangxiang Zeng

Over the past two decades, a series of works have aimed at studying the problem of genome assembly: the process of reconstructing a genome from sequence reads. An early formulation of the genome assembly problem showed that genome…

Genomics · Quantitative Biology 2013-12-30 Henry Lin

The first step in any genome assembly algorithm entails the conversion from the domain of strings and overlaps to the language of graphs and paths, typically using one of the two conventional methods: de Bruijn graphs or overlap graphs.…

Genomics · Quantitative Biology 2026-04-27 Anton Bankevich

Genome assembly asks to reconstruct an unknown string from many shorter substrings of it. Even though it is one of the key problems in Bioinformatics, it is generally lacking major theoretical advances. Its hardness stems both from…

Discrete Mathematics · Computer Science 2020-11-10 Massimo Cairo , Romeo Rizzi , Alexandru I. Tomescu , Elia C. Zirondelli

Genome assembly using high throughput data with short reads, arguably, remains an unresolvable task in repetitive genomes, since when the length of a repeat exceeds the read length, it becomes difficult to unambiguously connect the flanking…

Quantitative Methods · Quantitative Biology 2013-07-31 Viraj Deshpande , Eric DK Fung , Son Pham , Vineet Bafna

The de novo assembly of large, complex genomes is a significant challenge with currently available DNA sequencing technology. While many de novo assembly software packages are available, comparatively little attention has been paid to…

Genomics · Quantitative Biology 2013-07-31 Jared T. Simpson

A crucial problem in genome assembly is the discovery and correction of misassembly errors in draft genomes. We develop a method that will enhance the quality of draft genomes by identifying and removing misassembly errors using paired…

Genomics · Quantitative Biology 2014-11-24 Martin D. Muggli , Simon J. Puglisi , Roy Ronen , Christina Boucher

Motivation: Single Molecule Real-Time (SMRT) sequencing technology and Oxford Nanopore technologies (ONT) produce reads over 10kbp in length, which have enabled high-quality genome assembly at an affordable cost. However, at present, long…

Genomics · Quantitative Biology 2017-08-07 Heng Li

Recent emergence of next-generation DNA sequencing technology has enabled acquisition of genetic information at unprecedented scales. In order to determine the genetic blueprint of an organism, sequencing platforms typically employ…

Genomics · Quantitative Biology 2015-06-19 Xiaohu Shen , Manohar Shamaiah , Haris Vikalo

High read depth can be used to assemble short sequence repeats. The existing genome assemblers fail in repetitive regions of longer than average read. I propose a new algorithm for a DNA assembly which uses the relative frequency of reads…

Genomics · Quantitative Biology 2015-01-08 Robert M. Nowak

Today, machine learning tools, particularly artificial neural networks, have become crucial for diverse applications. However, current digital computing tools to train and deploy artificial neural networks often struggle with massive data…

Emerging Technologies · Computer Science 2025-02-14 Bora Çarpınlıoğlu , Uğur Teğin

Motivation: New long read sequencers promise to transform sequencing and genome assembly by producing reads tens of kilobases long. However their high error rate significantly complicates assembly and requires expensive correction steps to…

Genomics · Quantitative Biology 2017-07-18 Antoine Recanati , Thomas Brüls , Alexandre d'Aspremont

Despite the recent progress in genome sequencing and assembly, many of the currently available assembled genomes come in a draft form. Such draft genomes consist of a large number of genomic fragments (scaffolds), whose order and/or…

Genomics · Quantitative Biology 2022-05-27 Sergey Aganezov , Pavel Avdeyev , Nikita Alexeev , Yongwu Rong , Max A. Alekseyev

Advances in data collecting technologies in genomics have significantly increased the need for tools designed to study the genetic basis of many diseases. Effective statistical methods should excel in both prediction accuracy and biomarker…

Methodology · Statistics 2025-11-13 Anthony-Alexander Christidis , Stefan Van Aelst , Ruben Zamar

In the past several years, the problem of genome assembly has received considerable attention from both biologists and computer scientists. An important component of current assembly methods is the scaffolding process. This process involves…

Genomics · Quantitative Biology 2013-07-03 Karl R. B. Schmitt , Aleksey V. Zimin , Guillaume Marcaçs , James A. Yorke , Michelle Girvan

Binary embedding of high-dimensional data aims to produce low-dimensional binary codes while preserving discriminative power. State-of-the-art methods often suffer from high computation and storage costs. We present a simple and fast…

Information Theory · Computer Science 2016-01-26 Sung-Hsien Hsieh , Chun-Shien Lu , Soo-Chang Pei
‹ Prev 1 2 3 10 Next ›