English
Related papers

Related papers: Codes for DNA Sequence Profiles

200 papers

We consider the problem of assembling a sequence based on a collection of its substrings observed through a noisy channel. The mathematical basis of the problem is the construction and design of sequences that may be discriminated based on…

Information Theory · Computer Science 2015-11-04 Han Mao Kiah , Gregory J. Puleo , Olgica Milenkovic

We consider the problem of coding for the substring channel, in which information strings are observed only through their (multisets of) substrings. Due to existing DNA sequencing techniques and applications in DNA-based storage systems,…

Information Theory · Computer Science 2024-03-27 Yonatan Yehezkeally , Nikita Polyanskii

DNA as a data storage medium has several advantages, including far greater data density compared to electronic media. We propose that schemes for data storage in the DNA of living organisms may benefit from studying the reconstruction…

Information Theory · Computer Science 2019-09-10 Yonatan Yehezkeally , Moshe Schwartz

DNA storage is now being considered as a new archival storage method for its durability and high information density, but still facing some challenges like high costs and low throughput. By reducing sequencing sample size for decoding…

Information Theory · Computer Science 2025-04-22 Ruiying Cao , Xin Chen

This paper studies the problem of encoding messages into sequences which can be uniquely recovered from some noisy observations about their substrings. The observed reads comprise consecutive substrings with some given minimum overlap. This…

Information Theory · Computer Science 2023-12-11 Hengjia Wei , Moshe Schwartz , Gennian Ge

We provide an overview of current approaches to DNA-based storage system design and accompanying synthesis, sequencing and editing methods. We also introduce and analyze a suite of new constrained coding schemes for both archival and random…

Emerging Technologies · Computer Science 2015-07-08 S. M. Hossein Tabatabaei Yazdi , Han Mao Kiah , Eva Ruiz Garcia , Jian Ma , Huimin Zhao , Olgica Milenkovic

In this review paper, we delve into the nascent field of molecular data storage, focusing on system implementations and code constructions. We start by providing an overview of basic concepts in synthetic and computational biology.…

Emerging Technologies · Computer Science 2023-10-10 Olgica Milenkovic , Chao Pan

Due to its longevity and enormous information density, DNA is an attractive medium for archival data storage. Thanks to rapid technological advances, DNA storage is becoming practically feasible, as demonstrated by a number of experimental…

Information Theory · Computer Science 2022-11-11 Ilan Shomorony , Reinhard Heckel

Due to the redundant nature of DNA synthesis and sequencing technologies, a basic model for a DNA storage system is a multi-draw "shuffling-sampling" channel. In this model, a random number of noisy copies of each sequence is observed at…

Information Theory · Computer Science 2021-12-06 Kel Levick , Reinhard Heckel , Ilan Shomorony

Recent experiments have demonstrated the feasibility of storing digital information in macromolecules such as DNA and protein. However, the DNA storage channel is prone to errors such as deletions, insertions, and substitutions. During the…

Information Theory · Computer Science 2024-10-22 Aryan Abbasian , Mahtab Mirmohseni , Masoumeh Nasiri Kenari

Synthetic DNA approaches 227.5 exabytes per gram of storage density with stability over millennial timescales. Realising this capacity requires error-correction codes that recover data from substantial synthesis and sequencing errors.…

Information Theory · Computer Science 2026-04-23 James L. Banal

Motivated by DNA-based storage applications, we study the problem of reconstructing a coded sequence from multiple traces. We consider the model where the traces are outputs of independent deletion channels, where each channel deletes each…

Information Theory · Computer Science 2022-07-13 Serge Kas Hanna

DNA storage has emerged as a promising solution for large-scale and long-term data preservation. Among various error types, insertions are the most frequent errors occurring in DNA sequences, where the inserted symbol is often identical or…

Information Theory · Computer Science 2026-04-07 Hengfeng Liu , Chunming Tang , Cuiling Fan

DNA storage has emerged as an important area of research. The reliability of DNA storage system depends on designing the DNA strings (called DNA codes) that are sufficiently dissimilar. In this work, we introduce DNA codes that satisfy a…

Information Theory · Computer Science 2022-11-28 Krishna Gopal Benerjee , Sourav Deb , Manish K Gupta

We study the amount of reliable information that can be stored in a DNA-based storage system with noisy sequencing, where each codeword is composed of short DNA molecules. We analyze a concatenated coding scheme, where the outer code is…

Information Theory · Computer Science 2026-05-19 Ran Tamir , Nir Weinberger , Albert Guillén i Fàbregas

The process of DNA-based data storage (DNA storage for short) can be mathematically modelled as a communication channel, termed DNA storage channel, whose inputs and outputs are sets of unordered sequences. To design error correcting codes…

Information Theory · Computer Science 2020-06-11 Wentu Song , Kui Cai , Kees A. Schouhamer Immink

To increase the information capacity of DNA storage, composite DNA letters were introduced. We propose a novel channel model for composite DNA in which composite sequences are decomposed into ordered standard non-composite sequences. The…

Information Theory · Computer Science 2025-10-31 Besart Dollma , Ohad Elishco , Eitan Yaakobi

Due to its longevity and enormous information density, DNA is an attractive medium for archival storage. In this work, we study the fundamental limits and trade-offs of DNA-based storage systems by introducing a new channel model, which we…

Information Theory · Computer Science 2020-01-20 Ilan Shomorony , Reinhard Heckel

DNA has immense potential as an emerging data storage medium. The principle of DNA storage is the conversion and flow of digital information between binary code stream, quaternary base, and actual DNA fragments. This process will inevitably…

Information Retrieval · Computer Science 2022-10-21 Yun Qin , Fei Zhu , Bo Xi

The synthesis of DNA strands remains the most costly part of the DNA storage system. Thus, to make DNA storage system more practical, the time and materials used in the synthesis process have to be optimized. We consider the most common…

Information Theory · Computer Science 2023-05-15 Johan Chrisnata , Han Mao Kiah , Van Long Phuoc Pham
‹ Prev 1 2 3 10 Next ›