Related papers: XPath Whole Query Optimization

Fast In-Memory XPath Search over Compressed Text and Tree Indexes

A large fraction of an XML document typically consists of text data. The XPath query language allows text search via the equal, contains, and starts-with predicates. Such predicates can efficiently be implemented using a compressed…

Databases · Computer Science 2011-10-06 A. Arroyuelo , F. Claude , S. Maneth , V. Mäkinen , G. Navarro , K. Nguyen , J. Siren , N. Välimäki

XPath Node Selection over Grammar-Compressed Trees

XML document markup is highly repetitive and therefore well compressible using grammar-based compression. Downward, navigational XPath can be executed over grammar-compressed trees in PTIME: the query is translated into an automaton which…

Databases · Computer Science 2013-11-25 Sebastian Maneth , Tom Sebastian

Fast and Tiny Structural Self-Indexes for XML

XML document markup is highly repetitive and therefore well compressible using dictionary-based methods such as DAGs or grammars. In the context of selectivity estimation, grammar-compressed trees were used before as synopsis for structural…

Databases · Computer Science 2010-12-30 Sebastian Maneth , Tom Sebastian

Optimizing XML querying using type-based document projection

XML data projection (or pruning) is a natural optimization for main memory query engines: given a query Q over a document D, the subtrees of D that are not necessary to evaluate Q are pruned, thus producing a smaller document D'; the query…

Databases · Computer Science 2015-03-19 Véronique Benzaken , Giuseppe Castagna , Dario Colazzo , Kim Nguyen

An extension of data automata that captures XPath

We define a new kind of automata recognizing properties of data words or data trees and prove that the automata capture all queries definable in Regular XPath. We show that the automata-theoretic approach may be applied to answer…

Logic in Computer Science · Computer Science 2015-07-01 Mikołaj Bojańczyk , Sławomir Lasota

Bonsai: Compiling Queries to Pruned Tree Traversals

Trees can accelerate queries that search or aggregate values over large collections. They achieve this by storing metadata that enables quick pruning (or inclusion) of subtrees when predicates on that metadata can prove that none (or all)…

Programming Languages · Computer Science 2026-04-23 Alexander J Root , Christophe Gyurgyik , Purvi Goel , Kayvon Fatahalian , Jonathan Ragan-Kelley , Andrew Adams , Fredrik Kjolstad

Reduction of Nondeterministic Tree Automata

We present an efficient algorithm to reduce the size of nondeterministic tree automata, while retaining their language. It is based on new transition pruning techniques, and quotienting of the state space w.r.t. suitable equivalences. It…

Formal Languages and Automata Theory · Computer Science 2016-01-07 Ricardo Almeida , Lukáš Holík , Richard Mayr

Low-Latency Sliding Window Connectivity

Connectivity queries, which check whether vertices belong to the same connected component, are fundamental in graph computations. Sliding window connectivity processes these queries over sliding windows, facilitating real-time streaming…

Databases · Computer Science 2025-01-07 Chao Zhang , Angela Bonifati , Tamer Özsu

Fixpoint Node Selection Query Languages for Trees

The study of node selection query languages for (finite) trees has been a major topic in the recent research on query languages for Web documents. On one hand, there has been an extensive study of XPath and its various extensions. On the…

Databases · Computer Science 2018-11-15 Diego Calvanese , Giuseppe De Giacomo , Maurizio Lenzerini , Moshe Y. Vardi

Near Optimal Decision Trees in a SPLIT Second

Decision tree optimization is fundamental to interpretable machine learning. The most popular approach is to greedily search for the best feature at every decision point, which is fast but provably suboptimal. Recent approaches find the…

Machine Learning · Computer Science 2025-11-19 Varun Babbar , Hayden McTavish , Cynthia Rudin , Margo Seltzer

Alternating Automata on Data Trees and XPath Satisfiability

A data tree is an unranked ordered tree whose every node is labelled by a letter from a finite alphabet and an element ("datum") from an infinite set, where the latter can only be compared for equality. The article considers alternating…

Logic in Computer Science · Computer Science 2010-06-15 Marcin Jurdzinski , Ranko Lazic

XTreePath: A generalization of XPath to handle real world structural variation

We discuss a key problem in information extraction which deals with wrapper failures due to changing content templates. A good proportion of wrapper failures are due to HTML templates changing to cause wrappers to become incompatible after…

Information Retrieval · Computer Science 2017-12-29 Joseph Paul Cohen , Wei Ding , Abraham Bagherjeiran

Bottom-up automata on data trees and vertical XPath

A data tree is a finite tree whose every node carries a label from a finite alphabet and a datum from some infinite domain. We introduce a new model of automata over unranked data trees with a decidable emptiness problem. It is essentially…

Databases · Computer Science 2023-06-22 Diego Figueira , Luc Segoufin

XQuery Streaming by Forest Transducers

Streaming of XML transformations is a challenging task and only very few systems support streaming. Research approaches generally define custom fragments of XQuery and XPath that are amenable to streaming, and then design custom algorithms…

Databases · Computer Science 2016-11-17 Shizuya Hakuta , Sebastian Maneth , Keisuke Nakano , Hideya Iwasaki

XML Reconstruction View Selection in XML Databases: Complexity Analysis and Approximation Scheme

Query evaluation in an XML database requires reconstructing XML subtrees rooted at nodes found by an XML query. Since XML subtree reconstruction can be expensive, one approach to improve query response time is to use reconstruction views -…

Data Structures and Algorithms · Computer Science 2015-05-19 Artem Chebotko , Bin Fu

Efficient Indexing and Querying over Syntactically Annotated Trees

Natural language text corpora are often available as sets of syntactically parsed trees. A wide range of expressive tree queries are possible over such parsed trees that open a new avenue in searching over natural language text. They not…

Databases · Computer Science 2012-08-02 Pirooz Chubak , Davood Rafiei

XML Navigation and Transformation by Tree-Walking Automata and Transducers with Visible and Invisible Pebbles

The pebble tree automaton and the pebble tree transducer are enhanced by additionally allowing an unbounded number of "invisible" pebbles (as opposed to the usual "visible" ones). The resulting pebble tree automata recognize the regular…

Formal Languages and Automata Theory · Computer Science 2018-09-18 Joost Engelfriet , Hendrik Jan Hoogeboom , Bart Samwel

Efficient Incremental Breadth-Depth XML Event Mining

Many applications log a large amount of events continuously. Extracting interesting knowledge from logged events is an emerging active research area in data mining. In this context, we propose an approach for mining frequent events and…

Databases · Computer Science 2011-10-17 Rashed Salem , Jérôme Darmont , Omar Boussaïd

A Survey of XML Tree Patterns

With XML becoming an ubiquitous language for data interoperability purposes in various domains, efficiently querying XML data is a critical issue. This has lead to the design of algebraic frameworks based on tree-shaped patterns akin to the…

Databases · Computer Science 2017-01-18 Marouane Hachicha , Jérôme Darmont

Runtime Optimizations for Prediction with Tree-Based Models

Tree-based models have proven to be an effective solution for web ranking as well as other problems in diverse domains. This paper focuses on optimizing the runtime performance of applying such models to make predictions, given an…

Databases · Computer Science 2013-04-29 Nima Asadi , Jimmy Lin , Arjen P. de Vries