English
Related papers

Related papers: Faster Wavelet Tree Queries

200 papers

Rank and select queries are basic operations on sequences, with applications in compressed text indexes and other space-efficient data structures. One of the standard data structures supporting these queries is the wavelet tree. In this…

Data Structures and Algorithms · Computer Science 2026-04-14 Eric Chiu , Dominik Kempa

Wavelet trees are widely used in the representation of sequences, permutations, text collections, binary relations, discrete points, and other succinct data structures. We show, however, that this still falls short of exploiting all of the…

Data Structures and Algorithms · Computer Science 2010-11-23 Travis Gagie , Gonzalo Navarro , Simon J. Puglisi

The wavelet tree (Grossi et al. [SODA, 2003]) and wavelet matrix (Claude et al. [Inf. Syst., 47:15--32, 2015]) are compact indices for texts over an alphabet $[0,\sigma)$ that support rank, select and access queries in $O(\lg \sigma)$ time.…

Data Structures and Algorithms · Computer Science 2017-11-13 Johannes Fischer , Florian Kurpicz , Marvin Löbel

Rank and select queries on bitmaps are essential building bricks of many compressed data structures, including text indexes, membership and range supporting spatial data structures, compressed graphs, and more. Theoretically considered yet…

Data Structures and Algorithms · Computer Science 2016-05-13 Szymon Grabowski , Marcin Raniszewski

Bit vectors are fundamental building blocks of many succinct data structures. They can be used to represent graphs, are an important part of many text indices in the form of the wavelet tree, and can be used to encode ordered sequences of…

Data Structures and Algorithms · Computer Science 2022-11-08 Florian Kurpicz

We consider succinct data structures for representing a set of $n$ horizontal line segments in the plane given in rank space to support \emph{segment access}, \emph{segment selection}, and \emph{segment rank} queries. A segment access query…

Data Structures and Algorithms · Computer Science 2024-12-09 Philip Bille , Inge Li Gørtz , Simon R. Tarnow

Ranked document retrieval is a fundamental task in search engines. Such queries are solved with inverted indexes that require additional 45%-80% of the compressed text space, and take tens to hundreds of microseconds per query. In this…

Information Retrieval · Computer Science 2015-03-20 Nieves R. Brisaboa , Ana Cerdeira-Pena , Gonzalo Navarro , Oscar Pedreira

We show how to use a balanced wavelet tree as a data structure that stores a list of numbers and supports efficient {\em range quantile queries}. A range quantile query takes a rank and the endpoints of a sublist and returns the number with…

Data Structures and Algorithms · Computer Science 2015-05-13 Travis Gagie , Simon J. Puglisi , Andrew Turpin

An indexed sequence of strings is a data structure for storing a string sequence that supports random access, searching, range counting and analytics operations, both for exact matches and prefix search. String sequences lie at the core of…

Data Structures and Algorithms · Computer Science 2012-04-17 Roberto Grossi , Giuseppe Ottaviano

Rank and select data structures seek to preprocess a bit vector to quickly answer two kinds of queries: rank(i) gives the number of 1 bits in slots 0 through i, and select(j) gives the first slot s with rank(s) = j. A succinct data…

Data Structures and Algorithms · Computer Science 2024-05-09 Matthew D. Laws , Jocelyn Bliven , Kit Conklin , Elyes Laalai , Samuel McCauley , Zach S. Sturdevant

We present an improved wavelet tree construction algorithm and discuss its applications to a number of rank/select problems for integer keys and strings. Given a string of length n over an alphabet of size $\sigma\leq n$, our method builds…

Data Structures and Algorithms · Computer Science 2015-05-18 Maxim Babenko , Paweł Gawrychowski , Tomasz Kociumaka , Tatiana Starikovskaya

Suffix trees are a fundamental data structure in stringology, but their space usage, though linear, is an important problem for its applications. We design and implement a new compressed suffix tree targeted to highly repetitive texts, such…

Data Structures and Algorithms · Computer Science 2019-02-12 Manuel Cáceres , Gonzalo Navarro

It has been shown in the indexing literature that there is an essential difference between prefix/range searches on the one hand, and predecessor/rank searches on the other hand, in that the former provably allows faster query resolution.…

Data Structures and Algorithms · Computer Science 2018-04-16 Djamal Belazzougui , Paolo Boldi , Rasmus Pagh , Sebastiano Vigna

This paper studies the performances of BERT combined with tree structure in short sentence ranking task. In retrieval-based question answering system, we retrieve the most similar question of the query question by ranking all the questions…

Computation and Language · Computer Science 2019-09-09 Tong Guo , Huilin Gao

This work presents a discovery to advance the wisdom in a particular Succinct Data Structure: Wavelet Tree (Grossi, Gupta, and Vitter 2003). The discovery is first made by showing the feasibility of Reversed Indexes = Values: for integers…

Information Theory · Computer Science 2024-02-27 Xiangjun Peng

One of the central problems in the design of compressed data structures is the efficient support for rank and select queries on bitvectors. These two operations form the backbone of more complex data structures (such as wavelet trees) used…

Data Structures and Algorithms · Computer Science 2025-09-09 Eric Chiu , Dominik Kempa

The rank problem in succinct data structures asks to preprocess an array A[1..n] of bits into a data structure using as close to n bits as possible, and answer queries of the form rank(k) = Sum_{i=1}^k A[i]. The problem has been intensely…

Data Structures and Algorithms · Computer Science 2009-07-08 Mihai Patrascu

Succinct data structures give space-efficient representations of large amounts of data without sacrificing performance. They rely one cleverly designed data representations and algorithms. We present here the formalization in Coq/SSReflect…

Programming Languages · Computer Science 2019-07-03 Reynald Affeldt , Jacques Garrigue , Xuanrui Qi , Kazunari Tanaka

In many modern applications, including analysis of gene expression and text documents, the data are noisy, high-dimensional, and unordered--with no particular meaning to the given order of the variables. Yet, successful learning is often…

Methodology · Statistics 2008-07-25 Ann B. Lee , Boaz Nadler , Larry Wasserman

A large fraction of an XML document typically consists of text data. The XPath query language allows text search via the equal, contains, and starts-with predicates. Such predicates can efficiently be implemented using a compressed…

Databases · Computer Science 2011-10-06 A. Arroyuelo , F. Claude , S. Maneth , V. Mäkinen , G. Navarro , K. Nguyen , J. Siren , N. Välimäki
‹ Prev 1 2 3 10 Next ›