English
Related papers

Related papers: Do Read Errors Matter for Genome Assembly?

200 papers

High read depth can be used to assemble short sequence repeats. The existing genome assemblers fail in repetitive regions of longer than average read. I propose a new algorithm for a DNA assembly which uses the relative frequency of reads…

Genomics · Quantitative Biology 2015-01-08 Robert M. Nowak

The prevalent technique for DNA sequencing consists of two main steps: shotgun sequencing, where many randomly located fragments, called reads, are extracted from the overall sequence, followed by an assembly algorithm that aims to…

Genomics · Quantitative Biology 2016-01-28 Shirshendu Ganguly , Elchanan Mossel , Miklos Z. Racz

The study of functional genomics--particularly in non-model organisms has been dramatically improved over the last few years by use of transcriptomes and RNAseq. While these studies are potentially extremely powerful, a computationally…

Genomics · Quantitative Biology 2013-07-25 Matthew D MacManes , Michael B Eisen

DNA sequencing is the basic workhorse of modern day biology and medicine. Shotgun sequencing is the dominant technique used: many randomly located short fragments called reads are extracted from the DNA sequence, and these reads are…

Information Theory · Computer Science 2013-02-15 Abolfazl Motahari , Guy Bresler , David Tse

The de novo assembly of large, complex genomes is a significant challenge with currently available DNA sequencing technology. While many de novo assembly software packages are available, comparatively little attention has been paid to…

Genomics · Quantitative Biology 2013-07-31 Jared T. Simpson

Recent emergence of next-generation DNA sequencing technology has enabled acquisition of genetic information at unprecedented scales. In order to determine the genetic blueprint of an organism, sequencing platforms typically employ…

Genomics · Quantitative Biology 2015-06-19 Xiaohu Shen , Manohar Shamaiah , Haris Vikalo

Recent work identified the fundamental limits on the information requirements in terms of read length and coverage depth required for successful de novo genome reconstruction from shotgun sequencing data, based on the idealistic assumption…

Genomics · Quantitative Biology 2014-02-28 Ka-Kit Lam , Asif Khalak , David Tse

In this paper, fundamental limits in sequencing of a set of closely related DNA molecules are addressed. This problem is called pooled-DNA sequencing which encompasses many interesting problems such as haplotype phasing, metageomics, and…

Information Theory · Computer Science 2016-04-20 Amir Najafi , Damoun Nashta-ali , Seyed Abolfazl Motahari , Mehrdad Khani , Babak H. Khalaj , Hamid R. Rabiee

Genome assembly using high throughput data with short reads, arguably, remains an unresolvable task in repetitive genomes, since when the length of a repeat exceeds the read length, it becomes difficult to unambiguously connect the flanking…

Quantitative Methods · Quantitative Biology 2013-07-31 Viraj Deshpande , Eric DK Fung , Son Pham , Vineet Bafna

Genome sequencing is the basis for many modern biological and medicinal studies. With recent technological advances, metagenomics has become a problem of interest. This problem entails the analysis and reconstruction of multiple DNA…

Probability · Mathematics 2022-01-14 Marlee Herring

(An updated version of this manuscript has been accepted to Scientific Reports in 2016, please refer to http://www.nature.com/articles/srep31900) The highly anticipated transition from next generation sequencing (NGS) to third generation…

Genomics · Quantitative Biology 2016-09-06 Chengxi Ye , Chris Hill , Shigang Wu , Jue Ruan , Zhanshan , Ma

De novo assembly is the process of reconstructing the genome sequence of an organism from sequencing reads. Genome sequences are essential to biology, and assembly has been a central problem in bioinformatics for four decades. Until…

Genomics · Quantitative Biology 2023-08-16 Heng Li , Richard Durbin

Nanopore sequencing technology has the potential to render other sequencing technologies obsolete with its ability to generate long reads and provide portability. However, high error rates of the technology pose a challenge while generating…

Genomics · Quantitative Biology 2019-12-20 Damla Senol Cali , Jeremie S. Kim , Saugata Ghose , Can Alkan , Onur Mutlu

Next-generation sequencing technologies generate millions of short sequence reads, which are usually aligned to a reference genome. In many applications, the key information required for downstream analysis is the number of reads mapping to…

Genomics · Quantitative Biology 2016-07-26 Yang Liao , Gordon K Smyth , Wei Shi

DNA is a leading candidate as the next archival storage media due to its density, durability and sustainability. To read (and write) data DNA storage exploits technology that has been developed over decades to sequence naturally occurring…

Emerging Technologies · Computer Science 2022-05-12 Jasmine Quah , Omer Sella , Thomas Heinis

The shotgun sequencing process involves fragmenting a long DNA sequence (input string) into numerous shorter, unordered, and overlapping segments (referred to as \emph{reads}). The reads are sequenced, and later aligned to reconstruct the…

Information Theory · Computer Science 2025-09-26 Mohammed Ihsan Ali , Hrishi Narayanan , Prasad Krishnan

Over the past two decades, a series of works have aimed at studying the problem of genome assembly: the process of reconstructing a genome from sequence reads. An early formulation of the genome assembly problem showed that genome…

Genomics · Quantitative Biology 2013-12-30 Henry Lin

Earlier formulations of the DNA assembly problem were all in the context of perfect assembly; i.e., given a set of reads from a long genome sequence, is it possible to perfectly reconstruct the original sequence? In practice, however, it is…

Information Theory · Computer Science 2016-05-09 Ilan Shomorony , Govinda M. Kamath , Fei Xia , Thomas A. Courtade , David N. Tse

Most DNA sequencing technologies are based on the shotgun paradigm: many short reads are obtained from random unknown locations in the DNA sequence. A fundamental question, studied in arXiv:1203.6233, is what read length and coverage depth…

Information Theory · Computer Science 2022-02-09 Aditya Narayan Ravi , Alireza Vahid , Ilan Shomorony

Short-read DNA sequencing instruments can yield over 1e+12 bases per run, typically composed of reads 150 bases long. Despite this high throughput, de novo assembly algorithms have difficulty reconstructing contiguous genome sequences using…

Genomics · Quantitative Biology 2023-06-09 Eric Chen , Justin Chu , Jessica Zhang , Rene L. Warren , Inanc Birol
‹ Prev 1 2 3 10 Next ›