English
Related papers

Related papers: Coding for Synthesis Defects

200 papers

The synthesis of DNA strands remains the most costly part of the DNA storage system. Thus, to make DNA storage system more practical, the time and materials used in the synthesis process have to be optimized. We consider the most common…

Information Theory · Computer Science 2023-05-15 Johan Chrisnata , Han Mao Kiah , Van Long Phuoc Pham

Due to their sequential nature, traditional DNA synthesis methods are expensive in terms of time and resources. They also fabricate multiple copies of the same strand, introducing redundancy. This redundancy can be leveraged to enhance the…

Information Theory · Computer Science 2025-10-29 Frederik Walter , Yonatan Yehezkeally

DNA synthesis is considered as one of the most expensive components in current DNA storage systems. In this paper, focusing on a common synthesis machine, which generates multiple DNA strands in parallel following a fixed supersequence,we…

Information Theory · Computer Science 2025-05-13 Yajuan Liu , Tolga M. Duman

Due to its higher data density, longevity, energy efficiency, and ease of generating copies, DNA is considered a promising storage technology for satisfying future needs. However, a diverse set of errors including deletions, insertions,…

Information Theory · Computer Science 2022-08-05 Yuanyuan Tang , Shuche Wang , Hao Lou , Ryan Gabrys , Farzad Farnoud

Due to its high data density and longevity, DNA is considered a promising medium for satisfying ever-increasing data storage needs. However, the diversity of errors that occur in DNA sequences makes efficient error-correction a challenging…

Information Theory · Computer Science 2020-11-12 Yuanyuan Tang , Farzad Farnoud

The ability to store data in the DNA of a living organism has applications in a variety of areas including synthetic biology and watermarking of patented genetically-modified organisms. Data stored in this medium is subject to errors…

Information Theory · Computer Science 2016-11-17 Siddharth Jain , Farzad Farnoud , Moshe Schwartz , Jehoshua Bruck

DNA sequences are prone to creating secondary structures by folding back on themselves by non-specific hybridization among its nucleotides. The formation of secondary structures makes the sequences chemically inactive towards synthesis and…

Information Theory · Computer Science 2022-11-30 Siddhartha Siddhiprada Bhoi , Paramapalli Udaya , Abhay Kumar Singh

In this paper, we study error-correcting codes for the storage of data in synthetic deoxyribonucleic acid (DNA). We investigate a storage model where data is represented by an unordered set of $M$ sequences, each of length $L$. Errors…

Information Theory · Computer Science 2018-05-10 Andreas Lenz , Paul H. Siegel , Antonia Wachter-Zeh , Eitan Yaakobi

Composite DNA is a recent method to increase the base alphabet size in DNA-based data storage.This paper models synthesizing and sequencing of composite DNA and introduces coding techniques to correct substitutions, losses of entire…

Information Theory · Computer Science 2025-10-29 Frederik Walter , Omer Sabary , Antonia Wachter-Zeh , Eitan Yaakobi

Synthetic DNA can in principle be used for the archival storage of arbitrary data. Because errors are introduced during DNA synthesis, storage, and sequencing, an error-correcting code (ECC) is necessary for error-free recovery of the data.…

Quantitative Methods · Quantitative Biology 2018-12-05 William H. Press , John A. Hawkins

Correcting insertions/deletions as well as substitution errors simultaneously plays an important role in DNA-based storage systems as well as in classical communications. This paper deals with the fundamental task of constructing codes that…

Information Theory · Computer Science 2020-05-20 Ilia Smagloy , Lorenz Welter , Antonia Wachter-Zeh , Eitan Yaakobi

DNA-based storage offers unprecedented density and durability, but its scalability is fundamentally limited by the efficiency of parallel strand synthesis. Existing methods either allow unconstrained nucleotide additions to individual…

Information Theory · Computer Science 2025-10-27 Boaz Moav , Ryan Gabrys , Eitan Yaakobi

Because of its high data density and longevity, DNA is emerging as a promising candidate for satisfying increasing data storage needs. Compared to conventional storage media, however, data stored in DNA is subject to a wider range of errors…

Information Theory · Computer Science 2020-08-20 Yuanyuan Tang , Farzad Farnoud

We study the theoretical problem of synthesizing multiple DNA strands under spatial constraints, motivated by large-scale DNA synthesis technologies. In this setting, strands are arranged in an array and synthesized according to a fixed…

Information Theory · Computer Science 2026-01-14 Boaz Moav , Ryan Gabrys , Eitan Yaakobi

Labeling of DNA molecules is a fundamental technique for DNA visualization and analysis. This process was mathematically modeled in [1], where the received sequence indicates the positions of the used labels. In this work, we develop error…

Information Theory · Computer Science 2025-11-04 Dganit Hanania , Eitan Yaakobi

Error-correcting codes over sets, with applications to DNA storage, are studied. The DNA-storage channel receives a set of sequences, and produces a corrupted version of the set, including sequence loss, symbol substitution, symbol…

Information Theory · Computer Science 2021-07-12 Hengjia Wei , Moshe Schwartz

Storing digital data in synthetic DNA faces challenges in ensuring data reliability in the presence of edit errors--deletions, insertions, and substitutions--that occur randomly during various stages of the storage process. Current…

Information Theory · Computer Science 2025-09-11 Serge Kas Hanna

Motivated by mutation processes occurring in in-vivo DNA-storage applications, a channel that mutates stored strings by duplicating substrings as well as substituting symbols is studied. Two models of such a channel are considered: one in…

Information Theory · Computer Science 2020-07-01 Yuanyuan Tang , Yonatan Yehezkeally , Moshe Schwartz , Farzad Farnoud

In array-based DNA synthesis, multiple strands of DNA are synthesized in parallel to reduce the time cost from the sum of their lengths to the length their shortest common supersequences. To maximize the amount of information that can be…

Information Theory · Computer Science 2024-05-29 Hsin-Po Wang , Chi-Wei Chin

In this paper we study error-correcting codes for the storage of data in synthetic deoxyribonucleic acid (DNA). We investigate a storage model where a data set is represented by an unordered set of $M$ sequences, each of length $L$. Errors…

Information Theory · Computer Science 2020-02-13 Andreas Lenz , Paul H. Siegel , Antonia Wachter-Zeh , Eitan Yaakobi
‹ Prev 1 2 3 10 Next ›