English
Related papers

Related papers: Relative Select

200 papers

We describe a data structure that stores a string $S$ in space similar to that of its Lempel-Ziv encoding and efficiently supports access, rank and select queries. These queries are fundamental for implementing succinct and compressed data…

Data Structures and Algorithms · Computer Science 2014-12-03 Djamal Belazzougui , Travis Gagie , Paweł Gawrychowski , Juha Kärkkäinen , Alberto Ordóñez , Simon J. Puglisi , Yasuo Tabei

In this article, we show how to transform a colored de Bruijn graph (dBG) into a practical index for processing massive sets of sequencing reads. Similar to previous works, we encode an instance of a colored dBG of the set using BOSS and a…

Data Structures and Algorithms · Computer Science 2019-12-02 Diego Diaz-Domínguez

We introduce the notion of de Bruijn entropy of an Eulerian quiver and show how the corresponding relative entropy can be applied to practical string similarity problems. This approach explicitly links the combinatorial and…

Discrete Mathematics · Computer Science 2022-01-24 Steve Huntsman , Arman Rezaee

The problem of storing a set of strings --- a string dictionary --- in compact form appears naturally in many cases. While classically it has represented a small part of the whole data to be processed (e.g., for Natural Language processing…

Data Structures and Algorithms · Computer Science 2011-01-31 Nieves R. Brisaboa , Rodrigo Cánovas , Miguel A. Martínez-Prieto , Gonzalo Navarro

Most of the fastest-growing string collections today are repetitive, that is, most of the constituent documents are similar to many others. As these collections keep growing, a key approach to handling them is to exploit their…

Information Retrieval · Computer Science 2017-05-22 Travis Gagie , Aleksi Hartikainen , Kalle Karhu , Juha Kärkkäinen , Gonzalo Navarro , Simon J. Puglisi , Jouni Sirén

Let G be a graph with a list assignment L. Suppose a preferred color is given for some of the vertices; how many of these preferences can be respected when L-coloring G? We explore several natural questions arising in this context, and…

Combinatorics · Mathematics 2018-11-20 Zdeněk Dvořák , Sergey Norin , Luke Postle

A 'degenerate string' is a sequence of subsets of some alphabet; it represents any string obtainable by selecting one character from each set from left to right. Recently, Alanko et al. generalized the rank-select problem to degenerate…

Data Structures and Algorithms · Computer Science 2023-12-05 Philip Bille , Inge Li Gørtz , Tord Stordalen

We present a space- and time-efficient fully dynamic implementation de Bruijn graphs, which can also support fixed-length jumbled pattern matching.

Data Structures and Algorithms · Computer Science 2016-07-21 Djamal Belazzougui , Travis Gagie , Veli Mäkinen , Marco Previtali

Pangenomes serve as a framework for joint analysis of genomes of related organisms. Several pangenome models were proposed, offering different functionalities, applications provided by available tools, their efficiency etc. Among them, two…

Genomics · Quantitative Biology 2025-03-20 Adam Cicherski , Norbert Dojer

Let P be a set of n points and each of the points is colored with one of the k possible colors. We present efficient algorithms to pre-process P such that for a given query point q, we can quickly identify the smallest color spanning object…

Computational Geometry · Computer Science 2019-05-14 Ankush Acharyya , Anil Maheshwari , Subhas C. Nandy

In this paper, we consider the problem of identifying patterns of interest in colored strings. A colored string is a string where each position is assigned one of a finite set of colors. Our task is to find substrings of the colored string…

Data Structures and Algorithms · Computer Science 2024-04-16 Zsuzsanna Lipták , Simon J. Puglisi , Massimiliano Rossi

Relevance Models are well-known retrieval models and capable of producing competitive results. However, because they use query expansion they can be very slow. We address this slowness by incorporating two variants of locality sensitive…

Information Retrieval · Computer Science 2016-07-12 Dominik Wurzer , Miles Osborne , Victor Lavrenko

Rank and select queries on bitmaps are essential building bricks of many compressed data structures, including text indexes, membership and range supporting spatial data structures, compressed graphs, and more. Theoretically considered yet…

Data Structures and Algorithms · Computer Science 2016-05-13 Szymon Grabowski , Marcin Raniszewski

In modern ranking problems, different and disparate representations of the items to be ranked are often available. It is sensible, then, to try to combine these representations to improve ranking. Indeed, learning to rank via combining…

We consider compact representations of collections of similar strings that support random access queries. The collection of strings is given by a rooted tree where edges are labeled by an edit operation (inserting, deleting, or replacing a…

Data Structures and Algorithms · Computer Science 2021-02-12 Philip Bille , Inge Li Gørtz

A graph is $\ell$-choosable if, for any choice of lists of $\ell$ colors for each vertex, there is a list coloring, which is a coloring where each vertex receives a color from its list. We study complexity issues of choosability of graphs…

Discrete Mathematics · Computer Science 2017-08-14 Marc Demange , Dominique de Werra

We prove that sparse string graphs in a fixed surface have linear expansion. We extend this result to the more general setting of sparse region intersection graphs over any proper minor-closed class. The proofs are combinatorial and…

Combinatorics · Mathematics 2026-04-10 Nikolai Karol , David R. Wood

De novo DNA assembly is a fundamental task in Bioinformatics, and finding Eulerian paths on de Bruijn graphs is one of the dominant approaches to it. In most of the cases, there may be no one order for the de Bruijn graph that works well…

Data Structures and Algorithms · Computer Science 2018-05-15 Diego Díaz-Domínguez , Djamal Belazzougui , Travis Gagie , Veli Mäkinen , Gonzalo Navarro , Simon J. Puglisi

The main challenge in de novo assembly of NGS data is certainly to deal with repeats that are longer than the reads. This is particularly true for RNA- seq data, since coverage information cannot be used to flag repeated sequences, of which…

Data Structures and Algorithms · Computer Science 2014-06-05 Gustavo Sacomoto , Blerina Sinaimeri , Camille Marchet , Vincent Miele , Marie-France Sagot , Vincent Lacroix

Given a static reference string $R$ and a source string $S$, a relative compression of $S$ with respect to $R$ is an encoding of $S$ as a sequence of references to substrings of $R$. Relative compression schemes are a classic model of…

Data Structures and Algorithms · Computer Science 2016-09-19 Philip Bille , Patrick Hagge Cording , Inge Li Gørtz , Frederik Rye Skjoldjensen , Hjalte Wedel Vildhøj , Søren Vind
‹ Prev 1 2 3 10 Next ›