Related papers: Learning Schemas for Unordered XML

Simple Schemas for Unordered XML

We consider unordered XML, where the relative order among siblings is ignored, and propose two simple yet practical schema formalisms: disjunctive multiplicity schemas (DMS), and its restriction, disjunction-free multiplicity schemas (MS).…

Databases · Computer Science 2013-06-21 Iovka Boneva , Radu Ciucanu , Slawek Staworko

Learning XML Twig Queries

We investigate the problem of learning XML queries, path queries and tree pattern queries, from examples given by the user. A learning algorithm takes on the input a set of XML documents with nodes annotated by the user and returns a query…

Databases · Computer Science 2012-04-24 Sławomir Staworko , Piotr Wieczorek

Discovering Restricted Regular Expressions with Interleaving

Discovering a concise schema from given XML documents is an important problem in XML applications. In this paper, we focus on the problem of learning an unordered schema from a given set of XML examples, which is actually a problem of…

Databases · Computer Science 2015-04-02 Feifei Peng , Haiming Chen

On Learning Sets of Symmetric Elements

Learning from unordered sets is a fundamental learning setup, recently attracting increasing attention. Research in this area has focused on the case where elements of the set are represented by feature vectors, and far less emphasis has…

Machine Learning · Computer Science 2020-12-01 Haggai Maron , Or Litany , Gal Chechik , Ethan Fetaya

Schemas for Unordered XML on a DIME

We investigate schema languages for unordered XML having no relative order among siblings. First, we propose unordered regular expressions (UREs), essentially regular expressions with unordered concatenation instead of standard…

Databases · Computer Science 2014-10-29 Iovka Boneva , Radu Ciucanu , Sławek Staworko

A Neuro-inspired Interpretation of Unlearning in Large Language Models through Sample-level Unlearning Difficulty

Driven by privacy protection laws and regulations, unlearning in Large Language Models (LLMs) is gaining increasing attention. However, current research often neglects the interpretability of the unlearning process, particularly concerning…

Machine Learning · Computer Science 2025-04-10 Xiaohua Feng , Yuyuan Li , Chengye Wang , Junlin Liu , Li Zhang , Chaochao Chen

Shaping the learning landscape in neural networks around wide flat minima

Learning in Deep Neural Networks (DNN) takes place by minimizing a non-convex high-dimensional loss function, typically by a stochastic gradient descent (SGD) strategy. The learning process is observed to be able to find good minimizers…

Machine Learning · Computer Science 2020-03-12 Carlo Baldassi , Fabrizio Pittorino , Riccardo Zecchina

When Do Curricula Work?

Inspired by human learning, researchers have proposed ordering examples during training based on their difficulty. Both curriculum learning, exposing a network to easier examples early in training, and anti-curriculum learning, showing the…

Machine Learning · Computer Science 2021-02-10 Xiaoxia Wu , Ethan Dyer , Behnam Neyshabur

Learning Languages in the Limit from Positive Information with Finitely Many Memory Changes

We investigate learning collections of languages from texts by an inductive inference machine with access to the current datum and a bounded memory in form of states. Such a bounded memory states (BMS) learner is considered successful in…

Formal Languages and Automata Theory · Computer Science 2021-06-18 Timo Kötzing , Karen Seidel

Memory-Sample Lower Bounds for Learning Parity with Noise

In this work, we show, for the well-studied problem of learning parity under noise, where a learner tries to learn $x=(x_1,\ldots,x_n) \in \{0,1\}^n$ from a stream of random linear equations over $\mathrm{F}_2$ that are correct with…

Machine Learning · Computer Science 2021-07-07 Sumegha Garg , Pravesh K. Kothari , Pengda Liu , Ran Raz

Learning from networked examples

Many machine learning algorithms are based on the assumption that training examples are drawn independently. However, this assumption does not hold anymore when learning from a networked sample because two or more training examples may…

Artificial Intelligence · Computer Science 2017-06-06 Yuyi Wang , Jan Ramon , Zheng-Chu Guo

Unlearning in- vs. out-of-distribution data in LLMs under gradient-based method

Machine unlearning aims to solve the problem of removing the influence of selected training examples from a learned model. Despite the increasing attention to this problem, it remains an open research question how to evaluate unlearning in…

Machine Learning · Computer Science 2024-11-08 Teodora Baluta , Pascal Lamblin , Daniel Tarlow , Fabian Pedregosa , Gintare Karolina Dziugaite

Towards Interpretable Deep Extreme Multi-label Learning

Many Machine Learning algorithms, such as deep neural networks, have long been criticized for being "black-boxes"-a kind of models unable to provide how it arrive at a decision without further efforts to interpret. This problem has raised…

Machine Learning · Statistics 2019-07-04 Yihuang Kang , I-Ling Cheng , Wenjui Mao , Bowen Kuo , Pei-Ju Lee

Limit Learning Equivalence Structures

While most research in Gold-style learning focuses on learning formal languages, we consider the identification of computable structures, specifically equivalence structures. In our core model the learner gets more and more information…

Logic · Mathematics 2019-02-22 Ekaterina Fokina , Timo Kötzing , Luca San Mauro

Data Distribution-based Curriculum Learning

The order of training samples can have a significant impact on the performance of a classifier. Curriculum learning is a method of ordering training samples from easy to hard. This paper proposes the novel idea of a curriculum learning…

Machine Learning · Computer Science 2024-11-12 Shonal Chaudhry , Anuraganand Sharma

Remember What You Want to Forget: Algorithms for Machine Unlearning

We study the problem of unlearning datapoints from a learnt model. The learner first receives a dataset $S$ drawn i.i.d. from an unknown distribution, and outputs a model $\widehat{w}$ that performs well on unseen samples from the same…

Machine Learning · Computer Science 2021-07-23 Ayush Sekhari , Jayadev Acharya , Gautam Kamath , Ananda Theertha Suresh

Learning from Untrusted Data

The vast majority of theoretical results in machine learning and statistics assume that the available training data is a reasonably reliable reflection of the phenomena to be learned or estimated. Similarly, the majority of machine learning…

Machine Learning · Computer Science 2017-06-13 Moses Charikar , Jacob Steinhardt , Gregory Valiant

Learning to learn by gradient descent by gradient descent

The move from hand-designed features to learned features in machine learning has been wildly successful. In spite of this, optimization algorithms are still designed by hand. In this paper we show how the design of an optimization algorithm…

Neural and Evolutionary Computing · Computer Science 2016-12-01 Marcin Andrychowicz , Misha Denil , Sergio Gomez , Matthew W. Hoffman , David Pfau , Tom Schaul , Brendan Shillingford , Nando de Freitas

Strategic Data Ordering: Enhancing Large Language Model Performance through Curriculum Learning

The rapid advancement of Large Language Models (LLMs) has improved text understanding and generation but poses challenges in computational resources. This study proposes a curriculum learning-inspired, data-centric training strategy that…

Computation and Language · Computer Science 2024-05-14 Jisu Kim , Juhwan Lee

Offset Unlearning for Large Language Models

Despite the strong capabilities of Large Language Models (LLMs) to acquire knowledge from their training corpora, the memorization of sensitive information in the corpora such as copyrighted, biased, and private content has led to ethical…

Computation and Language · Computer Science 2025-05-29 James Y. Huang , Wenxuan Zhou , Fei Wang , Fred Morstatter , Sheng Zhang , Hoifung Poon , Muhao Chen