Testing Suffixient Sets
Data Structures and Algorithms
2025-06-11 v1
Abstract
Suffixient sets are a novel prefix array (PA) compression technique based on subsampling PA (rather than compressing the entire array like previous techniques used to do): by storing very few entries of PA (in fact, a compressed number of entries), one can prove that pattern matching via binary search is still possible provided that random access is available on the text. In this paper, we tackle the problems of determining whether a given subset of text positions is (1) a suffixient set or (2) a suffixient set of minimum cardinality. We provide linear-time algorithms solving these problems.
Cite
@article{arxiv.2506.08225,
title = {Testing Suffixient Sets},
author = {Davide Cenzato and Francisco Olivares and Nicola Prezza},
journal= {arXiv preprint arXiv:2506.08225},
year = {2025}
}