Related papers: Sample compression schemes for VC classes

Optimally compressing VC classes

Resolving a conjecture of Littlestone and Warmuth, we show that any concept class of VC-dimension $d$ has a sample compression scheme of size $d$.

Machine Learning · Computer Science 2022-01-14 Zachary Chase

Sample compression schemes for balls in structurally sparse graphs

Sample compression schemes were defined by Littlestone and Warmuth (1986) as an abstraction of the structure underlying many learning algorithms. In a sample compression scheme, we are given a large sample of vertices of a fixed hypergraph…

Discrete Mathematics · Computer Science 2026-04-06 Romain Bourneuf , Jędrzej Hodor , Piotr Micek , Clément Rambaud

Sample Compression Scheme Reductions

We present novel reductions from sample compression schemes in multiclass classification, regression, and adversarially robust learning settings to binary sample compression schemes. Assuming we have a compression scheme for binary classes…

Machine Learning · Computer Science 2025-04-09 Idan Attias , Steve Hanneke , Arvind Ramaswami

Unlabelled Sample Compression Schemes for Intersection-Closed Classes and Extremal Classes

The sample compressibility of concept classes plays an important role in learning theory, as a sufficient condition for PAC learnability, and more recently as an avenue for robust generalisation in adaptive data analysis. Whether…

Machine Learning · Computer Science 2022-10-12 J. Hyam Rubinstein , Benjamin I. P. Rubinstein

Teaching and compressing for low VC-dimension

In this work we study the quantitative relation between VC-dimension and two other basic parameters related to learning and teaching. Namely, the quality of sample compression schemes and of teaching sets for classes of low VC-dimension.…

Machine Learning · Computer Science 2016-11-28 Shay Moran , Amir Shpilka , Avi Wigderson , Amir Yehudayoff

Multiclass Learnability Does Not Imply Sample Compression

A hypothesis class admits a sample compression scheme, if for every sample labeled by a hypothesis from the class, it is possible to retain only a small subsample, using which the labels on the entire sample can be inferred. The size of the…

Machine Learning · Computer Science 2023-09-22 Chirag Pabbaraju

A Labelled Sample Compression Scheme of Size at Most Quadratic in the VC Dimension

This paper presents a construction of a proper and stable labelled sample compression scheme of size $O(\VCD^2)$ for any finite concept class, where $\VCD$ denotes the Vapnik-Chervonenkis Dimension. The construction is based on a well-known…

Machine Learning · Computer Science 2022-12-29 Farnam Mansouri , Sandra Zilles

Labeled sample compression schemes for complexes of oriented matroids

We show that the topes of a complex of oriented matroids (abbreviated COM) of VC-dimension $d$ admit a proper labeled sample compression scheme of size $d$. This considerably extends results of Moran and Warmuth on ample classes, of…

Combinatorics · Mathematics 2023-04-21 Victor Chepoi , Kolja Knauer , Manon Philibert

Measurability Aspects of the Compactness Theorem for Sample Compression Schemes

It was proved in 1998 by Ben-David and Litman that a concept space has a sample compression scheme of size d if and only if every finite subspace has a sample compression scheme of size d. In the compactness theorem, measurability of the…

Machine Learning · Statistics 2015-03-20 Damjan Kalajdzievski

Labeled compression schemes for extremal classes

It is a long-standing open problem whether there always exists a compression scheme whose size is of the order of the Vapnik-Chervonienkis (VC) dimension $d$. Recently compression schemes of size exponential in $d$ have been found for any…

Machine Learning · Computer Science 2016-07-25 Shay Moran , Manfred K. Warmuth

List Sample Compression and Uniform Convergence

List learning is a variant of supervised classification where the learner outputs multiple plausible labels for each instance rather than just one. We investigate classical principles related to generalization within the context of list…

Machine Learning · Computer Science 2026-03-24 Steve Hanneke , Shay Moran , Tom Waknine

Bounding Embeddings of VC Classes into Maximum Classes

One of the earliest conjectures in computational learning theory-the Sample Compression conjecture-asserts that concept classes (equivalently set systems) admit compression schemes of size linear in their VC dimension. To-date this…

Machine Learning · Computer Science 2014-02-04 J. Hyam Rubinstein , Benjamin I. P. Rubinstein , Peter L. Bartlett

High-arity Sample Compression

Recently, a series of works have started studying variations of concepts from learning theory for product spaces, which can be collected under the name high-arity learning theory. In this work, we consider a high-arity variant of sample…

Machine Learning · Computer Science 2026-05-15 Leonardo N. Coregliano , William Opich

Unlabeled Compression Schemes Exceeding the VC-dimension

In this note we disprove a conjecture of Kuzmin and Warmuth claiming that every family whose VC-dimension is at most d admits an unlabeled compression scheme to a sample of size at most d. We also study the unlabeled compression schemes of…

Combinatorics · Mathematics 2021-10-15 Dömötör Pálvölgyi , Gábor Tardos

On sample complexity for computational pattern recognition

In statistical setting of the pattern recognition problem the number of examples required to approximate an unknown labelling function is linear in the VC dimension of the target learning class. In this work we consider the question whether…

Machine Learning · Computer Science 2016-06-27 Daniil Ryabko

Dual VC Dimension Obstructs Sample Compression by Embeddings

This work studies embedding of arbitrary VC classes in well-behaved VC classes, focusing particularly on extremal classes. Our main result expresses an impossibility: such embeddings necessarily require a significant increase in dimension.…

Discrete Mathematics · Computer Science 2024-05-28 Zachary Chase , Bogdan Chornomaz , Steve Hanneke , Shay Moran , Amir Yehudayoff

Unlabeled sample compression schemes for oriented matroids

A long-standing sample compression conjecture asks to linearly bound the size of the optimal sample compression schemes by the Vapnik-Chervonenkis (VC) dimension of an arbitrary class. In this paper, we explore the rich metric and…

Combinatorics · Mathematics 2024-03-08 Tilen Marc

On statistical learning via the lens of compression

This work continues the study of the relationship between sample compression schemes and statistical learning, which has been mostly investigated within the framework of binary classification. The central theme of this work is establishing…

Machine Learning · Computer Science 2017-01-02 Ofir David , Shay Moran , Amir Yehudayoff

"Compressed" Compressed Sensing

The field of compressed sensing has shown that a sparse but otherwise arbitrary vector can be recovered exactly from a small number of randomly constructed linear projections (or samples). The question addressed in this paper is whether an…

Information Theory · Computer Science 2010-01-26 Galen Reeves , Michael Gastpar

Sample compression schemes for balls in graphs

One of the open problems in machine learning is whether any set-family of VC-dimension $d$ admits a sample compression scheme of size $O(d)$. In this paper, we study this problem for balls in graphs. For a ball $B=B_r(x)$ of a graph…

Discrete Mathematics · Computer Science 2024-07-12 Jérémie Chalopin , Victor Chepoi , Fionn Mc Inerney , Sébastien Ratel , Yann Vaxès