Related papers: Generalized Straight-Line Programs

Iterated Straight-Line Programs

We explore an extension to straight-line programs (SLPs) that outperforms, for some text families, the measure $\delta$ based on substring complexity, a lower bound for most measures and compressors exploiting repetitiveness (which are…

Data Structures and Algorithms · Computer Science 2024-02-16 Gonzalo Navarro , Cristian Urbina

Balancing Run-Length Straight-Line Programs*

It was recently proved that any SLP generating a given string $w$ can be transformed in linear time into an equivalent balanced SLP of the same asymptotic size. We show that this result also holds for RLSLPs, which are SLPs extended with…

Data Structures and Algorithms · Computer Science 2022-06-28 Gonzalo Navarro , Francisco Olivares , Cristian Urbina

Compression by Contracting Straight-Line Programs

In grammar-based compression a string is represented by a context-free grammar, also called a straight-line program (SLP), that generates only that string. We refine a recent balancing result stating that one can transform an SLP of size…

Data Structures and Algorithms · Computer Science 2021-07-02 Moses Ganardi

Detecting regularities on grammar-compressed strings

We solve the problems of detecting and counting various forms of regularities in a string represented as a Straight Line Program (SLP). Given an SLP of size $n$ that represents a string $s$ of length $N$, our algorithm compute all runs and…

Data Structures and Algorithms · Computer Science 2013-04-29 Tomohiro I , Wataru Matsubara , Kouji Shimohira , Shunsuke Inenaga , Hideo Bannai , Masayuki Takeda , Kazuyuki Narisawa , Ayumi Shinohara

Solving Classical String Problems on Compressed Texts

Here we study the complexity of string problems as a function of the size of a program that generates input. We consider straight-line programs (SLP), since all algorithms on SLP-generated strings could be applied to processing…

Data Structures and Algorithms · Computer Science 2007-05-23 Yury Lifshits

Balancing Two-Dimensional Straight-Line Programs

We consider building, given a straight-line program (SLP) consisting of $g$ productions deriving a two-dimensional string $T$ of size $N\times N$, a structure capable of providing random access to any character of $T$. For one-dimensional…

Data Structures and Algorithms · Computer Science 2025-12-01 Itai Boneh , Estéban Gabory , Paweł Gawrychowski , Adam Górkiewicz

Speeding-up $q$-gram mining on grammar-based compressed texts

We present an efficient algorithm for calculating $q$-gram frequencies on strings represented in compressed form, namely, as a straight line program (SLP). Given an SLP $\mathcal{T}$ of size $n$ that represents string $T$, the algorithm…

Data Structures and Algorithms · Computer Science 2013-05-27 Keisuke Goto , Hideo Bannai , Shunsuke Inenaga , Masayuki Takeda

Faster fully compressed pattern matching by recompression

In this paper, a fully compressed pattern matching problem is studied. The compression is represented by straight-line programs (SLPs), i.e. a context-free grammars generating exactly one string; the term fully means that both the pattern…

Data Structures and Algorithms · Computer Science 2013-06-26 Artur Jeż

On the Smallest Size of Internal Collage Systems

A Straight-Line Program (SLP) for a string $T$ is a context-free grammar in Chomsky normal form that derives $T$ only, which can be seen as a compressed form of $T$. Kida et al.\ introduced collage systems [Theor. Comput. Sci., 2003] to…

Data Structures and Algorithms · Computer Science 2026-01-27 Soichiro Migita , Kyotaro Uehata , Tomohiro I

Efficient Lyndon factorization of grammar compressed text

We present an algorithm for computing the Lyndon factorization of a string that is given in grammar compressed form, namely, a Straight Line Program (SLP). The algorithm runs in $O(n^4 + mn^3h)$ time and $O(n^2)$ space, where $m$ is the…

Data Structures and Algorithms · Computer Science 2013-04-29 Tomohiro I , Yuto Nakashima , Shunsuke Inenaga , Hideo Bannai , Masayuki Takeda

Space-efficient conversions from SLPs

We give algorithms that, given a straight-line program (SLP) with $g$ rules that generates (only) a text $T [1..n]$, builds within $O(g)$ space the Lempel-Ziv (LZ) parse of $T$ (of $z$ phrases) in time $O(n\log^2 n)$ or in time…

Data Structures and Algorithms · Computer Science 2023-10-11 Travis Gagie , Adrián Goga , Artur Jeż , Gonzalo Navarro

Logarithmic-Time Internal Pattern Matching Queries in Compressed and Dynamic Texts

Internal Pattern Matching (IPM) queries on a text $T$, given two fragments $X$ and $Y$ of $T$ such that $|Y|<2|X|$, ask to compute all exact occurrences of $X$ within $Y$. IPM queries have been introduced by Kociumaka, Radoszewski, Rytter,…

Data Structures and Algorithms · Computer Science 2025-03-06 Anouk Duyster , Tomasz Kociumaka

Computing convolution on grammar-compressed text

The convolution between a text string $S$ of length $N$ and a pattern string $P$ of length $m$ can be computed in $O(N \log m)$ time by FFT. It is known that various types of approximate string matching problems are reducible to…

Data Structures and Algorithms · Computer Science 2013-03-19 Toshiya Tanaka , Tomohiro I , Shunsuke Inenaga , Hideo Bannai , Masayuki Takeda

Grammar-compressed Self-index with Lyndon Words

We introduce a new class of straight-line programs (SLPs), named the Lyndon SLP, inspired by the Lyndon trees (Barcelo, 1990). Based on this SLP, we propose a self-index data structure of $O(g)$ words of space that can be built from a…

Data Structures and Algorithms · Computer Science 2020-04-28 Kazuya Tsuruta , Dominik Köppl , Yuto Nakashima , Shunsuke Inenaga , Hideo Bannai , Masayuki Takeda

Efficient LZ78 factorization of grammar compressed text

We present an efficient algorithm for computing the LZ78 factorization of a text, where the text is represented as a straight line program (SLP), which is a context free grammar in the Chomsky normal form that generates a single string.…

Data Structures and Algorithms · Computer Science 2013-05-27 Hideo Bannai , Shunsuke Inenaga , Masayuki Takeda

Tree compression using string grammars

We study the compressed representation of a ranked tree by a (string) straight-line program (SLP) for its preorder traversal, and compare it with the well-studied representation by straight-line context free tree grammars (which are also…

Formal Languages and Automata Theory · Computer Science 2015-09-29 Moses Ganardi , Danny Hucke , Markus Lohrey , Eric Noeth

A Succinct Grammar Compression

We solve an open problem related to an optimal encoding of a straight line program (SLP), a canonical form of grammar compression deriving a single string deterministically. We show that an information-theoretic lower bound for representing…

Data Structures and Algorithms · Computer Science 2013-06-18 Yasuo Tabei , Yoshimasa Takabatake , Hiroshi Sakamoto

Universal Length Generalization with Turing Programs

Length generalization refers to the ability to extrapolate from short training sequences to long test sequences and is a challenge for current large language models. While prior work has proposed some architecture or data format changes to…

Machine Learning · Computer Science 2024-07-04 Kaiying Hou , David Brandfonbrener , Sham Kakade , Samy Jelassi , Eran Malach

Sublinear-time Reductions for Big Data Computing

With the rapid popularization of big data, the dichotomy between tractable and intractable problems in big data computing has been shifted. Sublinear time, rather than polynomial time, has recently been regarded as the new standard of…

Computational Complexity · Computer Science 2021-12-01 Xiangyu Gao , Jianzhong Li , Dongjing Miao

Faster subsequence recognition in compressed strings

Computation on compressed strings is one of the key approaches to processing massive data sets. We consider local subsequence recognition problems on strings compressed by straight-line programs (SLP), which is closely related to…

Data Structures and Algorithms · Computer Science 2011-11-10 Alexander Tiskin