English
Related papers

Related papers: Compressibility-Aware Quantum Algorithms on String…

200 papers

We propose algorithms that, given the input string of length $n$ over integer alphabet of size $\sigma$, construct the Burrows-Wheeler transform (BWT), the permuted longest-common-prefix (PLCP) array, and the LZ77 parsing in…

Data Structures and Algorithms · Computer Science 2020-12-09 Dominik Kempa

The compression of highly repetitive strings (i.e., strings with many repetitions) has been a central research topic in string processing, and quite a few compression methods for these strings have been proposed thus far. Among them, an…

Data Structures and Algorithms · Computer Science 2022-02-17 Takaaki Nishimoto , Shunsuke Kanda , Yasuo Tabei

Indexing highly repetitive strings (i.e., strings with many repetitions) for fast queries has become a central research topic in string processing, because it has a wide variety of applications in bioinformatics and natural language…

Data Structures and Algorithms · Computer Science 2021-04-19 Takaaki Nishimoto , Yasuo Tabei

The Lempel-Ziv factorization (LZ77) and the Run-Length encoded Burrows-Wheeler Transform (RLBWT) are two important tools in text compression and indexing, being their sizes $z$ and $r$ closely related to the amount of text…

Data Structures and Algorithms · Computer Science 2017-02-07 Alberto Policriti , Nicola Prezza

Burrows-Wheeler transform (BWT) is an invertible text transformation that, given a text $T$ of length $n$, permutes its symbols according to the lexicographic order of suffixes of $T$. BWT is one of the most heavily studied algorithms in…

Data Structures and Algorithms · Computer Science 2020-12-09 Dominik Kempa , Tomasz Kociumaka

We study quantum algorithms for several fundamental string problems, including Longest Common Substring, Lexicographically Minimal String Rotation, and Longest Square Substring. These problems have been widely studied in the stringology…

Data Structures and Algorithms · Computer Science 2021-10-22 Shyan Akmal , Ce Jin

Run-length encoding Burrows-Wheeler Transformed strings, resulting in Run-Length BWT (RLBWT), is a powerful tool for processing highly repetitive strings. We propose a new algorithm for online RLBWT working in run-compressed space, which…

Data Structures and Algorithms · Computer Science 2017-10-17 Tatsuya Ohno , Yoshimasa Takabatake , Tomohiro I , Hiroshi Sakamoto

Lempel-Ziv (LZ77) factorization is a fundamental problem in string processing: Greedily partition a given string $T$ from left to right into blocks (called phrases) so that each phrase is either the leftmost occurrence of a letter or the…

Data Structures and Algorithms · Computer Science 2025-06-19 Dominik Kempa , Tomasz Kociumaka

Classically, the edit distance of two length-$n$ strings can be computed in $O(n^2)$ time, whereas an $O(n^{2-\epsilon})$-time procedure would falsify the Orthogonal Vectors Hypothesis. If the edit distance does not exceed $k$, the running…

Data Structures and Algorithms · Computer Science 2023-11-06 Daniel Gibney , Ce Jin , Tomasz Kociumaka , Sharma V. Thankachan

Computing the LZ factorization (or LZ77 parsing) of a string is a computational bottleneck in many diverse applications, including data compression, text indexing, and pattern discovery. We describe new linear time LZ factorization…

Data Structures and Algorithms · Computer Science 2020-12-11 Juha Kärkkäinen , Dominik Kempa , Simon J. Puglisi

The Burrows-Wheeler Transform (BWT) is an invertible text transformation that permutes symbols of a text according to the lexicographical order of its suffixes. BWT is the main component of popular lossless compression programs (such as…

Data Structures and Algorithms · Computer Science 2021-04-13 Dominik Kempa , Tomasz Kociumaka

The Longest Common Substring (LCS) and Longest Palindromic Substring (LPS) are classical problems in computer science, representing fundamental challenges in string processing. Both problems can be solved in linear time using a classical…

Data Structures and Algorithms · Computer Science 2023-09-06 Domenico Cantone , Simone Faro , Arianna Pavone , Caterina Viola

In this paper, we show that the LZ77 factorization of a text T {\in\Sigma^n} can be computed in O(R log n) bits of working space and O(n log R) time, R being the number of runs in the Burrows-Wheeler transform of T reversed. For extremely…

Data Structures and Algorithms · Computer Science 2015-10-22 Nicola Prezza , Alberto Policriti

The Burrows-Wheeler-Transform (BWT), a reversible string transformation, is one of the fundamental components of many current data structures in string processing. It is central in data compression, as well as in efficient query algorithms…

Data Structures and Algorithms · Computer Science 2024-04-16 Sara Giuliani , Shunsuke Inenaga , Zsuzsanna Lipták , Nicola Prezza , Marinella Sciortino , Anna Toffanello

The field of succinct data structures has flourished over the last 16 years. Starting from the compressed suffix array (CSA) by Grossi and Vitter (STOC 2000) and the FM-index by Ferragina and Manzini (FOCS 2000), a number of generalizations…

Data Structures and Algorithms · Computer Science 2016-09-22 Djamal Belazzougui , Fabio Cunial , Juha Kärkkäinen , Veli Mäkinen

The Lempel-Ziv 77 (LZ77) factorization is a fundamental compression scheme widely used in text processing and data compression. In this work, we investigate the time complexity of maintaining the LZ77 factorization of a dynamic string. By…

Data Structures and Algorithms · Computer Science 2025-10-28 Itai Boneh , Shay Golan , Matan Kraus

Longest common substring (LCS), longest palindrome substring (LPS), and Ulam distance (UL) are three fundamental string problems that can be classically solved in near linear time. In this work, we present sublinear time quantum algorithms…

Quantum Physics · Physics 2023-12-29 François Le Gall , Saeed Seddighin

Converting a compressed format of a string into another compressed format without an explicit decompression is one of the central research topics in string processing. We discuss the problem of converting the run-length Burrows-Wheeler…

Data Structures and Algorithms · Computer Science 2019-02-15 Takaaki Nishimoto , Yasuo Tabei

We raise the question of approximating the compressibility of a string with respect to a fixed compression scheme, in sublinear time. We study this question in detail for two popular lossless compression schemes: run-length encoding (RLE)…

Data Structures and Algorithms · Computer Science 2007-06-11 Sofya Raskhodnikova , Dana Ron , Ronitt Rubinfeld , Adam Smith

We present a new semi-external algorithm that builds the Burrows--Wheeler transform variant of Bauer et al. (a.k.a., BCR BWT) in linear expected time. Our method uses compression techniques to reduce computational costs when the input is…

Data Structures and Algorithms · Computer Science 2023-08-15 Diego Díaz-Domínguez , Gonzalo Navarro
‹ Prev 1 2 3 10 Next ›