English
Related papers

Related papers: Capacity-Approaching Constrained Codes with Error …

200 papers

In DNA-based data storage, DNA codes with biochemical constraints and error correction are designed to protect data reliability. Single-stranded DNA sequences with secondary structure avoidance (SSA) help to avoid undesirable secondary…

Information Theory · Computer Science 2023-07-04 Shu Liu , Chaoping Xing , Yaqian Zhang

DNA Data storage has recently attracted much attention due to its durable preservation and extremely high information density (bits per gram) properties. In this work, we propose a hybrid coding strategy comprising of generalized…

Information Theory · Computer Science 2021-12-20 Yixin Wang , Li Deng , Md. Noor-A-Rahim , Erry Gunawan , Yong L. Guan , Zhi P. Shi , Chueh L. Poh

DNA has emerged as a promising alternative for long-term data storage due to its high capacity, durability, and low-energy potential. However, storing data in DNA presents several challenges. First, it requires complex and costly…

Other Quantitative Biology · Quantitative Biology 2025-11-20 Sara Al Sayyed , Aline Roumy , Thomas Maugey

As a medium for cold data storage, DNA stands out as it promises significant gains in storage capacity and lifetime. However, it comes with its own data processing challenges to overcome. Constrained codes over the DNA alphabet…

Information Theory · Computer Science 2025-10-08 Canberk İrimağzı , Ahmed Hareedy

We describe properties and constructions of constraint-based codes for DNA-based data storage which account for the maximum repetition length and AT/GC balance. We present algorithms for computing the number of sequences with maximum…

Information Theory · Computer Science 2018-12-18 Kees A. Schouhamer Immink , Kui Cai

DNA synthesis is considered as one of the most expensive components in current DNA storage systems. In this paper, focusing on a common synthesis machine, which generates multiple DNA strands in parallel following a fixed supersequence,we…

Information Theory · Computer Science 2025-05-13 Yajuan Liu , Tolga M. Duman

DNA is an attractive medium for digital data storage. When data is stored on DNA, errors occur, which makes error-correcting coding techniques critical for reliable DNA data storage. To reduce the errors, a common technique is to include…

Information Theory · Computer Science 2024-06-27 Franziska Weindel , Andreas L. Gimpel , Robert N. Grass , Reinhard Heckel

Composite DNA is a recent novel method to increase the information capacity of DNA-based data storage above the theoretical limit of 2 bits/symbol. In this method, every composite symbol does not store a single DNA nucleotide but a mixture…

Information Theory · Computer Science 2025-01-22 Tuan Thanh Nguyen , Chen Wang , Kui Cai , Yiwei Zhang , Zohar Yakhini

DNA strands serve as a storage medium for $4$-ary data over the alphabet $\{A,T,G,C\}$. DNA data storage promises formidable information density, long-term durability, and ease of replicability. However, information in this intriguing…

Information Theory · Computer Science 2024-08-06 Canberk İrimağzı , Yusuf Uslan , Ahmed Hareedy

Storing digital data in synthetic DNA faces challenges in ensuring data reliability in the presence of edit errors--deletions, insertions, and substitutions--that occur randomly during various stages of the storage process. Current…

Information Theory · Computer Science 2025-09-11 Serge Kas Hanna

DNA storage has emerged as an important area of research. The reliability of DNA storage system depends on designing the DNA strings (called DNA codes) that are sufficiently dissimilar. In this work, we introduce DNA codes that satisfy a…

Information Theory · Computer Science 2022-11-28 Krishna Gopal Benerjee , Sourav Deb , Manish K Gupta

We describe a strategy for constructing codes for DNA-based information storage by serial composition of weighted finite-state transducers. The resulting state machines can integrate correction of substitution errors; synchronization by…

Information Theory · Computer Science 2016-11-18 Ian Holmes

In this paper, we study error-correcting codes for the storage of data in synthetic deoxyribonucleic acid (DNA). We investigate a storage model where data is represented by an unordered set of $M$ sequences, each of length $L$. Errors…

Information Theory · Computer Science 2018-05-10 Andreas Lenz , Paul H. Siegel , Antonia Wachter-Zeh , Eitan Yaakobi

DNA, with remarkable properties of high density, durability, and replicability, is one of the most appealing storage media. Emerging DNA storage technologies use composite DNA letters, where information is represented by probability…

Information Theory · Computer Science 2025-03-26 Wenkai Zhang , Zhiying Wang

Synthetic DNA can in principle be used for the archival storage of arbitrary data. Because errors are introduced during DNA synthesis, storage, and sequencing, an error-correcting code (ECC) is necessary for error-free recovery of the data.…

Quantitative Methods · Quantitative Biology 2018-12-05 William H. Press , John A. Hawkins

DNA is an attractive candidate for data storage. Its millennial durability and nanometer scale offer exceptional data density and longevity. Its relevance to medical applications also drives advances in DNA-related biotechnology. To protect…

Information Theory · Computer Science 2025-11-25 Yu-Ting Lin , Hsin-Po Wang , Venkatesan Guruswami

In this paper, we consider a concatenated coding based class of DNA storage codes in which the selected molecules are constrained to be taken from an ``inner'' codebook associated with the sequencing channel. This codebook is used in a…

Information Theory · Computer Science 2025-06-19 Yan Hao Ling , Jonathan Scarlett

Due to its high data density and longevity, DNA is considered a promising medium for satisfying ever-increasing data storage needs. However, the diversity of errors that occur in DNA sequences makes efficient error-correction a challenging…

Information Theory · Computer Science 2020-11-12 Yuanyuan Tang , Farzad Farnoud

Labeling of DNA molecules is a fundamental technique for DNA visualization and analysis. This process was mathematically modeled in [1], where the received sequence indicates the positions of the used labels. In this work, we develop error…

Information Theory · Computer Science 2025-11-04 Dganit Hanania , Eitan Yaakobi

We study the amount of reliable information that can be stored in a DNA-based storage system with noisy sequencing, where each codeword is composed of short DNA molecules. We analyze a concatenated coding scheme, where the outer code is…

Information Theory · Computer Science 2026-05-19 Ran Tamir , Nir Weinberger , Albert Guillén i Fàbregas
‹ Prev 1 2 3 10 Next ›