English
Related papers

Related papers: High-arity Sample Compression

200 papers

We develop a theory of high-arity PAC learning, which is statistical learning in the presence of "structured correlation". In this theory, hypotheses are either graphs, hypergraphs or, more generally, structures in finite relational…

Machine Learning · Computer Science 2024-09-18 Leonardo N. Coregliano , Maryanthe Malliaris

Sample compression schemes were defined by Littlestone and Warmuth (1986) as an abstraction of the structure underlying many learning algorithms. Roughly speaking, a sample compression scheme of size $k$ means that given an arbitrary list…

Machine Learning · Computer Science 2015-04-15 Shay Moran , Amir Yehudayoff

Recently, the authors introduced the theory of high-arity PAC learning, which is well-suited for learning graphs, hypergraphs and relational structures. In the same initial work, the authors proved a high-arity analogue of the Fundamental…

Machine Learning · Computer Science 2025-05-22 Leonardo N. Coregliano , Maryanthe Malliaris

Learning distribution families over $\mathbb{R}^d$ is a fundamental problem in unsupervised learning and statistics. A central question in this setting is whether a given family of distributions possesses sufficient structure to be (at…

Machine Learning · Statistics 2025-06-10 Arefe Boushehrian , Amir Najafi

The aim of this note is to overview some of our work in Chernikov, Towsner'20 (arXiv:2010.00726) developing higher arity VC theory (VC$_n$ dimension), including a generalization of Haussler packing lemma, and an associated tame (slice-wise)…

Machine Learning · Statistics 2025-10-16 Artem Chernikov , Henry Towsner

In this paper, we consider the problem of replicable realizable PAC learning. We construct a particularly hard learning problem and show a sample complexity lower bound with a close to $(\log|H|)^{3/2}$ dependence on the size of the…

Machine Learning · Computer Science 2026-02-24 Kasper Green Larsen , Markus Engelund Mathiasen , Chirag Pabbaraju , Clement Svendsen

List learning is a variant of supervised classification where the learner outputs multiple plausible labels for each instance rather than just one. We investigate classical principles related to generalization within the context of list…

Machine Learning · Computer Science 2026-03-24 Steve Hanneke , Shay Moran , Tom Waknine

The standard definition of PAC learning (Valiant 1984) requires learners to succeed under all distributions -- even ones that are intractable to sample from. This stands in contrast to samplable PAC learning (Blum, Furst, Kearns, and Lipton…

Computational Complexity · Computer Science 2025-12-02 Guy Blanc , Caleb Koch , Jane Lange , Carmen Strassle , Li-Yang Tan

This work continues the study of the relationship between sample compression schemes and statistical learning, which has been mostly investigated within the framework of binary classification. The central theme of this work is establishing…

Machine Learning · Computer Science 2017-01-02 Ofir David , Shay Moran , Amir Yehudayoff

Statistical learning theory chiefly studies restricted hypothesis classes, particularly those with finite Vapnik-Chervonenkis (VC) dimension. The fundamental quantity of interest is the sample complexity: the number of samples required to…

Machine Learning · Computer Science 2008-07-10 David Soloveichik

We present a formal proof in Lean of probably approximately correct (PAC) learnability of the concept class of decision stumps. This classic result in machine learning theory derives a bound on error probabilities for a simple type of…

Machine Learning · Computer Science 2021-01-11 Joseph Tassarotti , Koundinya Vajjha , Anindya Banerjee , Jean-Baptiste Tristan

We present novel reductions from sample compression schemes in multiclass classification, regression, and adversarially robust learning settings to binary sample compression schemes. Assuming we have a compression scheme for binary classes…

Machine Learning · Computer Science 2025-04-09 Idan Attias , Steve Hanneke , Arvind Ramaswami

A hypothesis class admits a sample compression scheme, if for every sample labeled by a hypothesis from the class, it is possible to retain only a small subsample, using which the labels on the entire sample can be inferred. The size of the…

Machine Learning · Computer Science 2023-09-22 Chirag Pabbaraju

We study robustness to test-time adversarial attacks in the regression setting with $\ell_p$ losses and arbitrary perturbation sets. We address the question of which function classes are PAC learnable in this setting. We show that classes…

Machine Learning · Computer Science 2024-05-07 Idan Attias , Steve Hanneke

Probably Approximately Correct (i.e., PAC) learning is a core concept of sample complexity theory, and efficient PAC learnability is often seen as a natural counterpart to the class P in classical computational complexity. But while the…

Computational Complexity · Computer Science 2023-04-28 Cornelius Brand , Robert Ganian , Kirill Simonov

We introduce a new and improved characterization of the label complexity of disagreement-based active learning, in which the leading quantity is the version space compression set size. This quantity is defined as the size of the smallest…

Machine Learning · Computer Science 2014-04-08 Yair Wiener , Steve Hanneke , Ran El-Yaniv

Adversarially robust PAC learning has proved to be challenging, with the currently best known learners [Montasser et al., 2021a] relying on improper methods based on intricate compression schemes, resulting in sample complexity exponential…

Machine Learning · Computer Science 2025-02-12 Hassan Ashtiani , Vinayak Pathak , Ruth Urner

Approximate learning machines have become popular in the era of small devices, including quantised, factorised, hashed, or otherwise compressed predictors, and the quest to explain and guarantee good generalisation abilities for such…

Machine Learning · Computer Science 2022-03-16 Andrew J. Turner , Ata Kabán

This work establishes a new upper bound on the number of samples sufficient for PAC learning in the realizable case. The bound matches known lower bounds up to numerical constant factors. This solves a long-standing open problem on the…

Machine Learning · Computer Science 2016-09-13 Steve Hanneke

We analyze a family of supervised learning algorithms based on sample compression schemes that are stable, in the sense that removing points from the training set which were not selected for the compression set does not alter the resulting…

Machine Learning · Computer Science 2020-11-10 Steve Hanneke , Aryeh Kontorovich
‹ Prev 1 2 3 10 Next ›