English
Related papers

Related papers: Improved Lower Bounds for Constant GC-Content DNA …

200 papers

We derive theoretical upper and lower bounds on the maximum size of DNA codes of length n with constant GC-content w and minimum Hamming distance d, both with and without the additional constraint that the minimum Hamming distance between…

Combinatorics · Mathematics 2007-05-23 Oliver D. King

DNA storage has emerged as an important area of research. The reliability of DNA storage system depends on designing the DNA strings (called DNA codes) that are sufficiently dissimilar. In this work, we introduce DNA codes that satisfy a…

Information Theory · Computer Science 2022-11-28 Krishna Gopal Benerjee , Sourav Deb , Manish K Gupta

DNA Data storage has recently attracted much attention due to its durable preservation and extremely high information density (bits per gram) properties. In this work, we propose a hybrid coding strategy comprising of generalized…

Information Theory · Computer Science 2021-12-20 Yixin Wang , Li Deng , Md. Noor-A-Rahim , Erry Gunawan , Yong L. Guan , Zhi P. Shi , Chueh L. Poh

We propose coding techniques that limit the length of homopolymers runs, ensure the GC-content constraint, and are capable of correcting a single edit error in strands of nucleotides in DNA-based data storage systems. In particular, for…

Information Theory · Computer Science 2020-01-10 Tuan Thanh Nguyen , Kui Cai , Kees A. Schouhamer Immink , Han Mao Kiah

As DNA data storage moves closer to practical deployment, minimizing sequencing coverage depth is essential to reduce both operational costs and retrieval latency. This paper addresses the recently studied Random Access Problem, which…

Information Theory · Computer Science 2026-01-13 Chen Wang , Eitan Yaakobi

Linear error-correcting codes form the mathematical backbone of modern digital communication and storage systems, but identifying champion linear codes (linear codes achieving or exceeding the best known minimum Hamming distance) remains…

Information Theory · Computer Science 2025-12-16 Yang-Hui He , Alexander Kasprzyk , Q Le , Dmitrii Riabchenko

Composite DNA is a recent novel method to increase the information capacity of DNA-based data storage above the theoretical limit of 2 bits/symbol. In this method, every composite symbol does not store a single DNA nucleotide but a mixture…

Information Theory · Computer Science 2025-01-22 Tuan Thanh Nguyen , Chen Wang , Kui Cai , Yiwei Zhang , Zohar Yakhini

Local Search problem, which finds a local minimum of a black-box function on a given graph, is of both practical and theoretical importance to combinatorial optimization, complexity theory and many other areas in theoretical computer…

Quantum Physics · Physics 2007-05-23 Shengyu Zhang

We describe properties and constructions of constraint-based codes for DNA-based data storage which account for the maximum repetition length and AT/GC balance. We present algorithms for computing the number of sequences with maximum…

Information Theory · Computer Science 2018-12-18 Kees A. Schouhamer Immink , Kui Cai

Constrained clustering leverages limited domain knowledge to improve clustering performance and interpretability, but incorporating pairwise must-link and cannot-link constraints is an NP-hard challenge, making global optimization…

Machine Learning · Computer Science 2025-10-28 Pedro Chumpitaz-Flores , My Duong , Cristobal Heredia , Kaixun Hua

In DNA-based data storage, DNA codes with biochemical constraints and error correction are designed to protect data reliability. Single-stranded DNA sequences with secondary structure avoidance (SSA) help to avoid undesirable secondary…

Information Theory · Computer Science 2023-07-04 Shu Liu , Chaoping Xing , Yaqian Zhang

DNA strings and their properties are widely studied since last 20 years due to its applications in DNA computing. In this area, one designs a set of DNA strings (called DNA code) which satisfies certain thermodynamic and combinatorial…

Information Theory · Computer Science 2025-10-03 Krishna Gopal Benerjee , Manish K Gupta

DNA codes have garnered significant interest due to their utilization in digital media storage, cryptography, and DNA computing. In this paper, we first extend the results of constructing reversible group codes \cite{Cengellenmis} and…

Information Theory · Computer Science 2024-09-11 Guodong Wang , Hongwei Liu , Xueyan Chen

We consider the problem of efficiently designing sets (codes) of equal-length DNA strings (words) that satisfy certain combinatorial constraints. This problem has numerous motivations including DNA computing and DNA self-assembly. Previous…

Data Structures and Algorithms · Computer Science 2007-05-23 Ming-Yang Kao , Manan Sanghi , Robert Schweller

Storing digital data in synthetic DNA faces challenges in ensuring data reliability in the presence of edit errors--deletions, insertions, and substitutions--that occur randomly during various stages of the storage process. Current…

Information Theory · Computer Science 2025-09-11 Serge Kas Hanna

We study the amount of reliable information that can be stored in a DNA-based storage system composed of short DNA molecules. In this regime, Shomorony and Heckel (2022) put forward a conjecture on the scaling of the number of information…

Information Theory · Computer Science 2025-11-19 Ran Tamir , Nir Weinberger , Albert Guillén i Fàbregas

Hopping cyclic codes (HCCs) are (non-linear) cyclic codes with the additional property that the $n$ cyclic shifts of every given codeword are all distinct, where $n$ is the code length. Constant weight binary hopping cyclic codes are also…

Information Theory · Computer Science 2023-01-06 Chenyang Zhang , Chong Shangguan , Gennian Ge

Regenerating codes allow distributed storage systems to recover from the loss of a storage node while transmitting the minimum possible amount of data across the network. We present a systematic computer search for optimal systematic…

Information Theory · Computer Science 2009-10-14 Daniel Cullina , Alexandros G. Dimakis , Tracey Ho

In this paper we study error-correcting codes for the storage of data in synthetic deoxyribonucleic acid (DNA). We investigate a storage model where a data set is represented by an unordered set of $M$ sequences, each of length $L$. Errors…

Information Theory · Computer Science 2020-02-13 Andreas Lenz , Paul H. Siegel , Antonia Wachter-Zeh , Eitan Yaakobi

The problem of fast items retrieval from a fixed collection is often encountered in most computer science areas, from operating system components to databases and user interfaces. We present an approach based on hash tables that focuses on…

Neural and Evolutionary Computing · Computer Science 2020-07-17 Dan Domnita , Ciprian Oprisa
‹ Prev 1 2 3 10 Next ›