English
Related papers

Related papers: Coding for Polymer-Based Data Storage

200 papers

Motivated by polymer-based data-storage platforms that use chains of binary synthetic polymers as the recording media and read the content via tandem mass spectrometers, we propose a new family of codes that allows for unique string…

Information Theory · Computer Science 2019-04-22 Srilakshmi Pattabiraman , Ryan Gabrys , Olgica Milenkovic

We consider the problem of correcting mass readout errors in information encoded in binary polymer strings. Our work builds on results for string reconstruction problems using composition multisets [Acharya et al., 2015] and the unique…

Information Theory · Computer Science 2020-01-15 Ryan Gabrys , Srilakshmi Pattabiraman , Olgica Milenkovic

Synthetic polymer-based storage seems to be a particularly promising candidate that could help to cope with the ever-increasing demand for archival storage requirements. It involves designing molecules of distinct masses to represent the…

Information Theory · Computer Science 2026-04-02 Anisha Banerjee , Antonia Wachter-Zeh , Eitan Yaakobi

The problem of reconstructing strings from their substring spectra has a long history and in its most simple incarnation asks for determining under which conditions the spectrum uniquely determines the string. We study the problem of coded…

Information Theory · Computer Science 2019-04-24 Ryan Gabrys , Olgica Milenkovic

Motivated by studies of data retrieval in polymer-based storage systems, we consider the problem of reconstructing a multiset of binary strings that have the same length and the same weight from the compositions of their prefixes and…

Discrete Mathematics · Computer Science 2024-11-07 Yaoyu Yang , Zitan Chen

Motivated by applications in polymer-based data storage, we study the problem of reconstructing a string from part of its composition multiset. We give a full description of the structure of the strings that cannot be uniquely reconstructed…

Information Theory · Computer Science 2022-10-18 Zuo Ye , Ohad Elishco

The problem of string reconstruction from substring information has found many applications due to its relevance in DNA- and polymer-based data storage. One practically important and challenging paradigm requires reconstructing mixtures of…

Information Theory · Computer Science 2020-10-22 Ryan Gabrys , Srilakshmi Pattabiraman , Olgica Milenkovic

The problem of storing large amounts of information safely for a long period of time has become essential. One of the most promising new data storage mediums are the polymer-based data storage systems, like the DNA-storage system. These…

Information Theory · Computer Science 2025-04-21 Ville Junnila , Tero Laihonen , Tuomo Lehtilä

The problem of reconstructing strings from substring information has found many applications due to its importance in genomic data sequencing and DNA- and polymer-based data storage. One practically important and challenging paradigm…

Information Theory · Computer Science 2021-10-07 Ryan Gabrys , Srilakshmi Pattabiraman , Olgica Milenkovic

Motivated by mass-spectrometry protein sequencing, we consider a simply-stated problem of reconstructing a string from the multiset of its substring compositions. We show that all strings of length 7, one less than a prime, or one less than…

Discrete Mathematics · Computer Science 2014-03-12 Jayadev Acharya , Hirakendu Das , Olgica Milenkovic , Alon Orlitsky , Shengjun Pan

We propose a new compression scheme for genomic data given as sequence fragments called reads. The scheme uses a reference genome at the decoder side only, freeing the encoder from the burdens of storing references and performing…

Information Theory · Computer Science 2023-02-10 Yotam Gershon , Yuval Cassuto

This paper studies the problem of encoding messages into sequences which can be uniquely recovered from some noisy observations about their substrings. The observed reads comprise consecutive substrings with some given minimum overlap. This…

Information Theory · Computer Science 2023-12-11 Hengjia Wei , Moshe Schwartz , Gennian Ge

In the \emph{trace reconstruction problem}, an unknown source string $x \in \{0,1\}^n$ is transmitted through a probabilistic \emph{deletion channel} which independently deletes each bit with some fixed probability $\delta$ and concatenates…

Data Structures and Algorithms · Computer Science 2020-12-09 Xi Chen , Anindya De , Chin Ho Lee , Rocco A. Servedio , Sandip Sinha

The number of zeros and the number of ones in a binary string are referred to as the composition of the string, and the prefix-suffix compositions of a string are a multiset formed by the compositions of the prefixes and suffixes of all…

Information Theory · Computer Science 2025-03-18 Zitan Chen

DNA as a data storage medium has several advantages, including far greater data density compared to electronic media. We propose that schemes for data storage in the DNA of living organisms may benefit from studying the reconstruction…

Information Theory · Computer Science 2019-09-10 Yonatan Yehezkeally , Moshe Schwartz

DNA Data storage has recently attracted much attention due to its durable preservation and extremely high information density (bits per gram) properties. In this work, we propose a hybrid coding strategy comprising of generalized…

Information Theory · Computer Science 2021-12-20 Yixin Wang , Li Deng , Md. Noor-A-Rahim , Erry Gunawan , Yong L. Guan , Zhi P. Shi , Chueh L. Poh

A system is offered for imitation resistant transmitting of encrypted information in wireless communication networks on the basis of redundant residue polynomial codes. The particular feature of this solution is complexing of methods for…

Cryptography and Security · Computer Science 2018-09-10 Dmitry Samoylenko , Mikhail Eremeev , Oleg Finko , Sergey Dichenko

Reconstruction codes are generalizations of error-correcting codes that can correct errors by a given number of noisy reads. The study of such codes was initiated by Levenshtein in 2001 and developed recently due to applications in modern…

Information Theory · Computer Science 2023-02-21 Zuo Ye , Xin Liu , Xiande Zhang , Gennian Ge

Motivated by applications in polymer-based data storage we introduced the new problem of characterizing the code rate and designing constant-weight binary $B_2$-sequences. Binary $B_2$-sequences are collections of binary strings of length…

Information Theory · Computer Science 2023-03-24 Jin Sima , Yun-Han Li , Ilan Shomorony , Olgica Milenkovic

We present a construction of subspace codes along with an efficient algorithm for list decoding from both insertions and deletions, handling an information-theoretically maximum fraction of these with polynomially small rate. Our…

Information Theory · Computer Science 2012-02-03 Venkatesan Guruswami , Srivatsan Narayanan , Carol Wang
‹ Prev 1 2 3 10 Next ›