English
Related papers

Related papers: Coding over Sets for DNA Storage

200 papers

In this paper, we study error-correcting codes for the storage of data in synthetic deoxyribonucleic acid (DNA). We investigate a storage model where data is represented by an unordered set of $M$ sequences, each of length $L$. Errors…

Information Theory · Computer Science 2018-05-10 Andreas Lenz , Paul H. Siegel , Antonia Wachter-Zeh , Eitan Yaakobi

Error-correcting codes over sets, with applications to DNA storage, are studied. The DNA-storage channel receives a set of sequences, and produces a corrupted version of the set, including sequence loss, symbol substitution, symbol…

Information Theory · Computer Science 2021-07-12 Hengjia Wei , Moshe Schwartz

The process of DNA-based data storage (DNA storage for short) can be mathematically modelled as a communication channel, termed DNA storage channel, whose inputs and outputs are sets of unordered sequences. To design error correcting codes…

Information Theory · Computer Science 2020-06-11 Wentu Song , Kui Cai , Kees A. Schouhamer Immink

DNA-based storage is an emerging storage technology that provides high information density and long duration. Due to the physical constraints in the reading and writing processes, error correction in DNA storage poses several interesting…

Information Theory · Computer Science 2023-10-04 Jin Sima , Netanel Raviv , Moshe Schwartz , Jehoshua Bruck

In this paper, we study achievable rates of concatenated coding schemes over a deoxyribonucleic acid (DNA) storage channel. Our channel model incorporates the main features of DNA-based data storage. First, information is stored on many,…

Information Theory · Computer Science 2020-05-04 Andreas Lenz , Lorenz Welter , Sven Puchinger

To increase the information capacity of DNA storage, composite DNA letters were introduced. We propose a novel channel model for composite DNA in which composite sequences are decomposed into ordered standard non-composite sequences. The…

Information Theory · Computer Science 2025-10-31 Besart Dollma , Ohad Elishco , Eitan Yaakobi

The DNA storage channel is considered, in which the $M$ Deoxyribonucleic acid (DNA) molecules comprising each codeword are stored without order, sampled $N$ times with replacement, and then sequenced over a discrete memoryless channel. For…

Information Theory · Computer Science 2022-02-15 Nir Weinberger , Neri Merhav

Data storage in DNA is developing as a possible solution for archival digital data. Recently, to further increase the potential capacity of DNA-based data storage systems, the combinatorial composite DNA synthesis method was suggested. This…

Information Theory · Computer Science 2024-05-28 Omer Sabary , Inbal Preuss , Ryan Gabrys , Zohar Yakhini , Leon Anavy , Eitan Yaakobi

DNA is a leading candidate as the next archival storage media due to its density, durability and sustainability. To read (and write) data DNA storage exploits technology that has been developed over decades to sequence naturally occurring…

Emerging Technologies · Computer Science 2022-05-12 Jasmine Quah , Omer Sella , Thomas Heinis

In this paper, we consider a concatenated coding based class of DNA storage codes in which the selected molecules are constrained to be taken from an ``inner'' codebook associated with the sequencing channel. This codebook is used in a…

Information Theory · Computer Science 2025-06-19 Yan Hao Ling , Jonathan Scarlett

We consider error-correcting coding for deoxyribonucleic acid (DNA)-based storage using nanopore sequencing. We model the DNA storage channel as a sampling noise channel where the input data is chunked into $M$ short DNA strands, which are…

Information Theory · Computer Science 2024-06-21 Lorenz Welter , Roman Sokolovskii , Thomas Heinis , Antonia Wachter-Zeh , Eirik Rosnes , Alexandre Graell i Amat

This paper introduces a new solution to DNA storage that integrates all three steps of retrieval, namely clustering, reconstruction, and error correction. DNA-correcting codes are presented as a unique solution to the problem of ensuring…

Information Theory · Computer Science 2024-07-02 Avital Boruchovsky , Daniella Bar-Lev , Eitan Yaakobi

DNA, with remarkable properties of high density, durability, and replicability, is one of the most appealing storage media. Emerging DNA storage technologies use composite DNA letters, where information is represented by probability…

Information Theory · Computer Science 2025-03-26 Wenkai Zhang , Zhiying Wang

Storing digital data in synthetic DNA faces challenges in ensuring data reliability in the presence of edit errors--deletions, insertions, and substitutions--that occur randomly during various stages of the storage process. Current…

Information Theory · Computer Science 2025-09-11 Serge Kas Hanna

In [1], the authors proposed a new model of DNA storage system that integrates all three steps of retrieval and introduced the concept of DNA-correcting codes, which guarantees that the output of the storage system can be decoded to the…

Information Theory · Computer Science 2023-11-17 Huawei Wu

Composite DNA is a recent method to increase the base alphabet size in DNA-based data storage.This paper models synthesizing and sequencing of composite DNA and introduces coding techniques to correct substitutions, losses of entire…

Information Theory · Computer Science 2025-10-29 Frederik Walter , Omer Sabary , Antonia Wachter-Zeh , Eitan Yaakobi

Motivated by DNA-based data storage, we investigate a system where digital information is stored in an unordered set of several vectors over a finite alphabet. Each vector begins with a unique index that represents its position in the whole…

Information Theory · Computer Science 2019-01-23 Andreas Lenz , Paul H. Siegel , Antonia Wachter-Zeh , Eitan Yaakobi

The ability to store data in the DNA of a living organism has applications in a variety of areas including synthetic biology and watermarking of patented genetically-modified organisms. Data stored in this medium is subject to errors…

Information Theory · Computer Science 2016-11-17 Siddharth Jain , Farzad Farnoud , Moshe Schwartz , Jehoshua Bruck

The DNA storage channel is considered, in which a codeword is comprised of $M$ unordered DNA molecules. At reading time, $N$ molecules are sampled with replacement, and then each molecule is sequenced. A coded-index concatenated-coding…

Information Theory · Computer Science 2022-05-23 Nir Weinberger

We provide an overview of current approaches to DNA-based storage system design and accompanying synthesis, sequencing and editing methods. We also introduce and analyze a suite of new constrained coding schemes for both archival and random…

Emerging Technologies · Computer Science 2015-07-08 S. M. Hossein Tabatabaei Yazdi , Han Mao Kiah , Eva Ruiz Garcia , Jian Ma , Huimin Zhao , Olgica Milenkovic
‹ Prev 1 2 3 10 Next ›