English
Related papers

Related papers: An External-Memory Algorithm for String Graph Cons…

200 papers

Indexing very large collections of strings, such as those produced by the widespread next generation sequencing technologies, heavily relies on multistring generalization of the Burrows-Wheeler Transform (BWT): large requirements of…

Data Structures and Algorithms · Computer Science 2020-12-07 Paola Bonizzoni , Gianluca Della Vedova , Yuri Pirola , Marco Previtali , Raffaella Rizzi

We present a new scalable, lightweight algorithm to incrementally construct the BWT and FM-index of large string sets such as those produced by Next Generation Sequencing. The algorithm is designed for massive parallelism and can…

Data Structures and Algorithms · Computer Science 2014-10-03 Jacopo Pantaleoni

Constructing the Burrows-Wheeler transform (BWT) for long strings poses significant challenges regarding construction time and memory usage. We use a prefix of the suffix array to partition a long string into shorter substrings, thereby…

Data Structures and Algorithms · Computer Science 2025-05-15 Enno Adler , Stefan Böttcher , Rita Hartel

Indexing of very large collections of strings such as those produced by the widespread sequencing technologies, heavily relies on multi-string generalizations of the Burrows-Wheeler Transform (BWT), and for this problem various in-memory…

Data Structures and Algorithms · Computer Science 2016-07-29 Paola Bonizzoni , Gianluca Della Vedova , Serena Nicosia , Marco Previtali , Raffaella Rizzi

The Burrows-Wheeler Transform (BWT) serves as the basis for many important sequence indexes. On very large datasets (e.g. genomic databases), classical BWT construction algorithms are often infeasible because they usually need to have the…

Data Structures and Algorithms · Computer Science 2025-09-24 Jannik Olbrich

We present an algorithm for building the extended BWT (eBWT) of a string collection from its grammar-compressed representation. Our technique exploits the string repetitions captured by the grammar to boost the computation of the eBWT.…

Genomics · Quantitative Biology 2021-02-10 Diego Diaz-Dominguez annd Gonzalo Navarro

The Burrows Wheeler transform has applications in data compression as well as full text indexing. Despite its important applications and various existing algorithmic approaches the construction of the transform for large data sets is still…

Data Structures and Algorithms · Computer Science 2016-04-25 German Tischler

The Burrows-Wheeler transform (BWT) is a well studied text transformation widely used in data compression and text indexing. The BWT of two strings can also provide similarity measures between them, based on the observation that the more…

Data Structures and Algorithms · Computer Science 2020-09-10 Felipe A. Louza , Guilherme P. Telles , Simon Gog , Liang Zhao

The Burrows-Wheeler transform (BWT) is integral to the FM-index, which is used extensively in text compression, indexing, pattern search, and bioinformatic problems as de novo assembly and read alignment. Thus, efficient construction of the…

Data Structures and Algorithms · Computer Science 2025-02-04 Enno Adler , Stefan Böttcher , Rita Hartel , Cederic Alexander Steininger

Due to the exponential growth of genomic data, constructing dedicated data structures has become the principal bottleneck in common bioinformatics applications. In particular, the Burrows-Wheeler Transform (BWT) is the basis of some of the…

Data Structures and Algorithms · Computer Science 2023-05-15 Francesco Masillo

We present a new semi-external algorithm that builds the Burrows--Wheeler transform variant of Bauer et al. (a.k.a., BCR BWT) in linear expected time. Our method uses compression techniques to reduce computational costs when the input is…

Data Structures and Algorithms · Computer Science 2023-08-15 Diego Díaz-Domínguez , Gonzalo Navarro

Introduced about thirty years ago in the field of Data Compression, the Burrows-Wheeler Transform (BWT) is a string transformation that, besides being a booster of the performance of memoryless compressors, plays a fundamental role in the…

Data Structures and Algorithms · Computer Science 2023-05-09 Raffaele Giancarlo , Giovanni Manzini , Antonio Restivo , Giovanna Rosone , Marinella Sciortino

In order to avoid the reference bias introduced by mapping reads to a reference genome, bioinformaticians are investigating reference-free methods for analyzing sequenced genomes. With large projects sequencing thousands of individuals,…

Data Structures and Algorithms · Computer Science 2016-01-15 Jouni Sirén

Burrows-Wheeler transform (BWT) is an invertible text transformation that, given a text $T$ of length $n$, permutes its symbols according to the lexicographic order of suffixes of $T$. BWT is one of the most heavily studied algorithms in…

Data Structures and Algorithms · Computer Science 2020-12-09 Dominik Kempa , Tomasz Kociumaka

Detecting all the strings that occur in a text more frequently or less frequently than expected according to an IID or a Markov model is a basic problem in string mining, yet current algorithms are based on data structures that are either…

Data Structures and Algorithms · Computer Science 2015-08-13 Djamal Belazzougui , Fabio Cunial

In this paper, we present a number of network-analysis algorithms in the external-memory model. We focus on methods for large naturally sparse graphs, that is, n-vertex graphs that have O(n) edges and are structured so that this sparsity…

Data Structures and Algorithms · Computer Science 2011-07-01 Michael T. Goodrich , Pawel Pszona

We propose an external memory algorithm for the computation of the BWT and LCP array for a collection of sequences. Our algorithm takes the amount of available memory as an input parameter, and tries to make the best use of it by splitting…

Data Structures and Algorithms · Computer Science 2020-09-10 Lavinia Egidi , Felipe A. Louza , Giovanni Manzini , Guilherme P. Telles

The boom of genomic sequencing makes compression of set of sequences inescapable. This underlies the need for multi-string indexing data structures that helps compressing the data. The most prominent example of such data structures is the…

Data Structures and Algorithms · Computer Science 2021-11-18 Bastien Cazaux , Eric Rivals

The string graph for a collection of next-generation reads is a lossless data representation that is fundamental for de novo assemblers based on the overlap-layout-consensus paradigm. In this paper, we explore a novel approach to compute…

Data Structures and Algorithms · Computer Science 2017-05-30 Paola Bonizzoni , Gianluca Della Vedova , Yuri Pirola , Marco Previtali , Raffaella Rizzi

This thesis presents a number of results related to path traversal in trees and graphs. In particular, we focus on data structures which allow such traversals to be performed efficiently in the external memory setting. In addition, for…

Data Structures and Algorithms · Computer Science 2013-08-22 Craig Dillabaugh
‹ Prev 1 2 3 10 Next ›