Related papers: Learning XML Twig Queries

Learning Schemas for Unordered XML

We consider unordered XML, where the relative order among siblings is ignored, and we investigate the problem of learning schemas from examples given by the user. We focus on the schema formalisms proposed in [10]: disjunctive multiplicity…

Databases · Computer Science 2013-07-26 Radu Ciucanu , Slawek Staworko

A Survey of XML Tree Patterns

With XML becoming an ubiquitous language for data interoperability purposes in various domains, efficiently querying XML data is a critical issue. This has lead to the design of algebraic frameworks based on tree-shaped patterns akin to the…

Databases · Computer Science 2017-01-18 Marouane Hachicha , Jérôme Darmont

A Tree Pattern Matching Algorithm for XML Queries with Structural Preferences

In the XML community, exact queries allow users to specify exactly what they want to check and/or retrieve in an XML document. When they are applied to a semi-structured document or to a document with an overly complex model, the lack or…

Databases · Computer Science 2019-06-10 Maurice Tchoupé Tchendji , Lionel Tadonfouet , Thomas Tébougang Tchendji

A Grammatical Inference Approach to Language-Based Anomaly Detection in XML

False-positives are a problem in anomaly-based intrusion detection systems. To counter this issue, we discuss anomaly detection for the eXtensible Markup Language (XML) in a language-theoretic view. We argue that many XML-based attacks…

Cryptography and Security · Computer Science 2013-11-13 Harald Lampesberger

Learning definable hypotheses on trees

We study the problem of learning properties of nodes in tree structures. Those properties are specified by logical formulas, such as formulas from first-order or monadic second-order logic. We think of the tree as a database encoding a…

Logic in Computer Science · Computer Science 2019-09-25 Emilie Grienenberger , Martin Ritzert

XPath Node Selection over Grammar-Compressed Trees

XML document markup is highly repetitive and therefore well compressible using grammar-based compression. Downward, navigational XPath can be executed over grammar-compressed trees in PTIME: the query is translated into an automaton which…

Databases · Computer Science 2013-11-25 Sebastian Maneth , Tom Sebastian

Structured Learning from Partial Annotations

Structured learning is appropriate when predicting structured outputs such as trees, graphs, or sequences. Most prior work requires the training set to consist of complete trees, graphs or sequences. Specifying such detailed ground truth…

Machine Learning · Computer Science 2012-07-03 Xinghua Lou , Fred Hamprecht

Provably Learning Attention with Queries

We study the problem of learning Transformer-based sequence models with black-box access to their outputs. In this setting, a learner may adaptively query the oracle with any sequence of vectors and observe the output of the target…

Machine Learning · Computer Science 2026-05-05 Satwik Bhattamishra , Kulin Shah , Michael Hahn , Varun Kanade

Learning Latent Tree Graphical Models

We study the problem of learning a latent tree graphical model where samples are available only from a subset of variables. We propose two consistent and computationally efficient algorithms for learning minimal latent trees, that is, trees…

Machine Learning · Statistics 2010-09-15 Myung Jin Choi , Vincent Y. F. Tan , Animashree Anandkumar , Alan S. Willsky

Mining patterns in syntax trees to automate code reviews of student solutions for programming exercises

In programming education, providing manual feedback is essential but labour-intensive, posing challenges in consistency and timeliness. We introduce ECHO, a machine learning method to automate the reuse of feedback in educational code…

Software Engineering · Computer Science 2024-05-06 Charlotte Van Petegem , Kasper Demeyere , Rien Maertens , Niko Strijbol , Bram De Wever , Bart Mesuere , Peter Dawyndt

An Analysis of Active Learning Algorithms using Real-World Crowd-sourced Text Annotations

Active learning algorithms automatically identify the most informative samples from large amounts of unlabeled data and tremendously reduce human annotation effort in inducing a machine learning model. In a conventional active learning…

Machine Learning · Computer Science 2026-04-28 Varun Totakura , Ankita Singh , Yushun Dong , Shayok Chakraborty

A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation

The usefulness of annotated corpora is greatly increased if there is an associated tool that can allow various kinds of operations to be performed in a simple way. Different kinds of annotation frameworks and many query languages for them…

Computation and Language · Computer Science 2011-08-10 Anil Kumar Singh

The complexity of unsupervised learning of lexicographic preferences

This paper considers the task of learning users' preferences on a combinatorial set of alternatives, as generally used by online configurators, for example. In many settings, only a set of selected alternatives during past interactions is…

Artificial Intelligence · Computer Science 2022-09-26 Hélène Fargier , Pierre-François Gimenez , Jérôme Mengin , Bao Ngoc Le Nguyen

Learning Fast Matching Models from Weak Annotations

This paper proposes a novel training scheme for fast matching models in Search Ads, which is motivated by the real challenges in model training. The first challenge stems from the pursuit of high throughput, which prohibits the deployment…

Information Retrieval · Computer Science 2019-04-23 Xue Li , Zhipeng Luo , Hao Sun , Jianjin Zhang , Weihao Han , Xianqi Chu , Liangjie Zhang , Qi Zhang

Adaptive Hierarchical Clustering Using Ordinal Queries

In many applications of clustering (for example, ontologies or clusterings of animal or plant species), hierarchical clusterings are more descriptive than a flat clustering. A hierarchical clustering over $n$ elements is represented by a…

Data Structures and Algorithms · Computer Science 2018-04-18 Ehsan Emamjomeh-Zadeh , David Kempe

Meta Decision Trees for Explainable Recommendation Systems

We tackle the problem of building explainable recommendation systems that are based on a per-user decision tree, with decision rules that are based on single attribute values. We build the trees by applying learned regression functions to…

Machine Learning · Computer Science 2019-12-20 Eyal Shulman , Lior Wolf

Invariance, encodings, and generalization: learning identity effects with neural networks

Often in language and other areas of cognition, whether two components of an object are identical or not determines if it is well formed. We call such constraints identity effects. When developing a system to learn well-formedness from…

Machine Learning · Computer Science 2022-03-03 S. Brugiapaglia , M. Liu , P. Tupper

Flattening a Hierarchical Clustering through Active Learning

We investigate active learning by pairwise similarity over the leaves of trees originating from hierarchical clustering procedures. In the realizable setting, we provide a full characterization of the number of queries needed to achieve…

Machine Learning · Computer Science 2019-10-15 Fabio Vitale , Anand Rajagopalan , Claudio Gentile

Online Learning of Optimally Diverse Rankings

Search engines answer users' queries by listing relevant items (e.g. documents, songs, products, web pages, ...). These engines rely on algorithms that learn to rank items so as to present an ordered list maximizing the probability that it…

Machine Learning · Computer Science 2021-09-14 Stefan Magureanu , Alexandre Proutiere , Marcus Isaksson , Boxun Zhang

Learning Partitions with Optimal Query and Round Complexities

We consider the basic problem of learning an unknown partition of $n$ elements into at most $k$ sets using simple queries that reveal information about a small subset of elements. Our starting point is the well-studied pairwise same-set…

Data Structures and Algorithms · Computer Science 2025-06-24 Hadley Black , Arya Mazumdar , Barna Saha