English
Related papers

Related papers: String Reconstruction from Substring Compositions

200 papers

Motivated by applications in polymer-based data storage, we study the problem of reconstructing a string from part of its composition multiset. We give a full description of the structure of the strings that cannot be uniquely reconstructed…

Information Theory · Computer Science 2022-10-18 Zuo Ye , Ohad Elishco

The problem of string reconstruction from substring information has found many applications due to its relevance in DNA- and polymer-based data storage. One practically important and challenging paradigm requires reconstructing mixtures of…

Information Theory · Computer Science 2020-10-22 Ryan Gabrys , Srilakshmi Pattabiraman , Olgica Milenkovic

We consider the problem of binary string reconstruction from the multiset of its substring compositions, i.e., referred to as the substring composition multiset, first introduced and studied by Acharya et al. We introduce a new algorithm…

Information Theory · Computer Science 2023-06-05 Utkarsh Gupta , Hessam Mahdavifar

Motivated by studies of data retrieval in polymer-based storage systems, we consider the problem of reconstructing a multiset of binary strings that have the same length and the same weight from the compositions of their prefixes and…

Discrete Mathematics · Computer Science 2024-11-07 Yaoyu Yang , Zitan Chen

The problem of reconstructing strings from substring information has found many applications due to its importance in genomic data sequencing and DNA- and polymer-based data storage. One practically important and challenging paradigm…

Information Theory · Computer Science 2021-10-07 Ryan Gabrys , Srilakshmi Pattabiraman , Olgica Milenkovic

The problem of reconstructing strings from their substring spectra has a long history and in its most simple incarnation asks for determining under which conditions the spectrum uniquely determines the string. We study the problem of coded…

Information Theory · Computer Science 2019-04-24 Ryan Gabrys , Olgica Milenkovic

The problem of string reconstruction based on its substrings spectrum has received significant attention recently due to its applicability to DNA data storage and sequencing. In contrast to previous works, we consider in this paper a setup…

Information Theory · Computer Science 2022-05-11 Yonatan Yehezkeally , Sagi Marcovich , Eitan Yaakobi

The problem called "String reconstruction from substrings" is a mathematical model of sequencing by hybridization that plays an important role in DNA sequencing. In this problem, we are given a blackbox oracle holding an unknown string…

Data Structures and Algorithms · Computer Science 2018-08-03 Kazuo Iwama , Junichi Teruyama , Shuntaro Tsuyama

This paper studies reconstruction of strings based upon their substrings spectrum. Under this paradigm, it is assumed that all substrings of some fixed length are received and the goal is to reconstruct the string. While many existing works…

Information Theory · Computer Science 2021-06-02 Sagi Marcovich , Eitan Yaakobi

The problem of reconstructing a sequence from the set of its length-$k$ substrings has received considerable attention due to its various applications in genomics. We study an uncoded version of this problem where multiple random sources…

Information Theory · Computer Science 2023-05-11 Kel Levick , Ilan Shomorony

In the trace reconstruction problem, one seeks to reconstruct a binary string $s$ from a collection of traces, each of which is obtained by passing $s$ through a deletion channel. It is known that $\exp(\tilde O(n^{1/5}))$ traces suffice to…

Information Theory · Computer Science 2022-10-21 Kayvon Mazooji , Ilan Shomorony

The number of zeros and the number of ones in a binary string are referred to as the composition of the string, and the prefix-suffix compositions of a string are a multiset formed by the compositions of the prefixes and suffixes of all…

Information Theory · Computer Science 2025-03-18 Zitan Chen

Motivated by polymer-based data-storage platforms that use chains of binary synthetic polymers as the recording media and read the content via tandem mass spectrometers, we propose a new family of codes that allows for unique string…

Information Theory · Computer Science 2019-04-22 Srilakshmi Pattabiraman , Ryan Gabrys , Olgica Milenkovic

In the usual trace reconstruction problem, the goal is to exactly reconstruct an unknown string of length $n$ after it passes through a deletion channel many times independently, producing a set of traces (i.e., random subsequences of the…

Data Structures and Algorithms · Computer Science 2020-12-17 Sami Davies , Miklos Z. Racz , Cyrus Rashtchian , Benjamin G. Schiffer

This paper introduces a new family of reconstruction codes which is motivated by applications in DNA data storage and sequencing. In such applications, DNA strands are sequenced by reading some subset of their substrings. While previous…

Information Theory · Computer Science 2023-04-21 Yonatan Yehezkeally , Daniella Bar-Lev , Sagi Marcovich , Eitan Yaakobi

Graphs are a powerful tool for analyzing large data sets, but many real-world phenomena involve interactions that go beyond the simple pairwise relationships captured by a graph. In this paper we introduce and study a simple combinatorial…

Data Structures and Algorithms · Computer Science 2025-11-25 Elise Tate , Joshua A. Grochow

This paper investigates the number of quantum queries made to solve the problem of reconstructing an unknown string from its substrings in a certain query model. More concretely, the goal of the problem is to identify an unknown string $S$…

We generalize the problem of reconstructing strings from their substring compositions first introduced by Acharya et al. in 2015 motivated by polymer-based advanced data storage systems utilizing mass spectrometry. Namely, we see strings as…

Combinatorics · Mathematics 2025-04-02 Antoine Dailly , Tuomo Lehtilä

This paper introduces a new family of reconstruction codes which is motivated by applications in DNA data storage and sequencing. In such applications, DNA strands are sequenced by reading some subset of their substrings. While previous…

Information Theory · Computer Science 2022-05-10 Yonatan Yehezkeally , Daniella Bar-Lev , Sagi Marcovich , Eitan Yaakobi

Motivated by polymer-based data-storage platforms that use chains of binary synthetic polymers as the recording media and read the content via tandem mass spectrometers, we propose a new family of codes that allows for both unique string…

Information Theory · Computer Science 2021-06-29 Srilakshmi Pattabiraman , Ryan Gabrys , Olgica Milenkovic
‹ Prev 1 2 3 10 Next ›