Related papers: Sample Compression Scheme Reductions

Sample compression schemes for VC classes

Sample compression schemes were defined by Littlestone and Warmuth (1986) as an abstraction of the structure underlying many learning algorithms. Roughly speaking, a sample compression scheme of size $k$ means that given an arbitrary list…

Machine Learning · Computer Science 2015-04-15 Shay Moran , Amir Yehudayoff

Multiclass Learnability Does Not Imply Sample Compression

A hypothesis class admits a sample compression scheme, if for every sample labeled by a hypothesis from the class, it is possible to retain only a small subsample, using which the labels on the entire sample can be inferred. The size of the…

Machine Learning · Computer Science 2023-09-22 Chirag Pabbaraju

A Labelled Sample Compression Scheme of Size at Most Quadratic in the VC Dimension

This paper presents a construction of a proper and stable labelled sample compression scheme of size $O(\VCD^2)$ for any finite concept class, where $\VCD$ denotes the Vapnik-Chervonenkis Dimension. The construction is based on a well-known…

Machine Learning · Computer Science 2022-12-29 Farnam Mansouri , Sandra Zilles

Sample compression schemes for balls in structurally sparse graphs

Sample compression schemes were defined by Littlestone and Warmuth (1986) as an abstraction of the structure underlying many learning algorithms. In a sample compression scheme, we are given a large sample of vertices of a fixed hypergraph…

Discrete Mathematics · Computer Science 2026-04-06 Romain Bourneuf , Jędrzej Hodor , Piotr Micek , Clément Rambaud

Unlabelled Sample Compression Schemes for Intersection-Closed Classes and Extremal Classes

The sample compressibility of concept classes plays an important role in learning theory, as a sufficient condition for PAC learnability, and more recently as an avenue for robust generalisation in adaptive data analysis. Whether…

Machine Learning · Computer Science 2022-10-12 J. Hyam Rubinstein , Benjamin I. P. Rubinstein

On statistical learning via the lens of compression

This work continues the study of the relationship between sample compression schemes and statistical learning, which has been mostly investigated within the framework of binary classification. The central theme of this work is establishing…

Machine Learning · Computer Science 2017-01-02 Ofir David , Shay Moran , Amir Yehudayoff

Optimally compressing VC classes

Resolving a conjecture of Littlestone and Warmuth, we show that any concept class of VC-dimension $d$ has a sample compression scheme of size $d$.

Machine Learning · Computer Science 2022-01-14 Zachary Chase

Measurability Aspects of the Compactness Theorem for Sample Compression Schemes

It was proved in 1998 by Ben-David and Litman that a concept space has a sample compression scheme of size d if and only if every finite subspace has a sample compression scheme of size d. In the compactness theorem, measurability of the…

Machine Learning · Statistics 2015-03-20 Damjan Kalajdzievski

Teaching and compressing for low VC-dimension

In this work we study the quantitative relation between VC-dimension and two other basic parameters related to learning and teaching. Namely, the quality of sample compression schemes and of teaching sets for classes of low VC-dimension.…

Machine Learning · Computer Science 2016-11-28 Shay Moran , Amir Shpilka , Avi Wigderson , Amir Yehudayoff

Dual VC Dimension Obstructs Sample Compression by Embeddings

This work studies embedding of arbitrary VC classes in well-behaved VC classes, focusing particularly on extremal classes. Our main result expresses an impossibility: such embeddings necessarily require a significant increase in dimension.…

Discrete Mathematics · Computer Science 2024-05-28 Zachary Chase , Bogdan Chornomaz , Steve Hanneke , Shay Moran , Amir Yehudayoff

Labeled sample compression schemes for complexes of oriented matroids

We show that the topes of a complex of oriented matroids (abbreviated COM) of VC-dimension $d$ admit a proper labeled sample compression scheme of size $d$. This considerably extends results of Moran and Warmuth on ample classes, of…

Combinatorics · Mathematics 2023-04-21 Victor Chepoi , Kolja Knauer , Manon Philibert

Bounding Embeddings of VC Classes into Maximum Classes

One of the earliest conjectures in computational learning theory-the Sample Compression conjecture-asserts that concept classes (equivalently set systems) admit compression schemes of size linear in their VC dimension. To-date this…

Machine Learning · Computer Science 2014-02-04 J. Hyam Rubinstein , Benjamin I. P. Rubinstein , Peter L. Bartlett

Categorical Feature Compression via Submodular Optimization

In the era of big data, learning from categorical features with very large vocabularies (e.g., 28 million for the Criteo click prediction dataset) has become a practical challenge for machine learning researchers and practitioners. We…

Machine Learning · Computer Science 2019-05-01 MohammadHossein Bateni , Lin Chen , Hossein Esfandiari , Thomas Fu , Vahab S. Mirrokni , Afshin Rostamizadeh

Sample Compression for Real-Valued Learners

We give an algorithmically efficient version of the learner-to-compression scheme conversion in Moran and Yehudayoff (2016). In extending this technique to real-valued hypotheses, we also obtain an efficient regression-to-bounded sample…

Machine Learning · Computer Science 2018-05-23 Steve Hanneke , Aryeh Kontorovich , Menachem Sadigurschi

Labeled compression schemes for extremal classes

It is a long-standing open problem whether there always exists a compression scheme whose size is of the order of the Vapnik-Chervonienkis (VC) dimension $d$. Recently compression schemes of size exponential in $d$ have been found for any…

Machine Learning · Computer Science 2016-07-25 Shay Moran , Manfred K. Warmuth

Sample compression schemes for balls in graphs

One of the open problems in machine learning is whether any set-family of VC-dimension $d$ admits a sample compression scheme of size $O(d)$. In this paper, we study this problem for balls in graphs. For a ball $B=B_r(x)$ of a graph…

Discrete Mathematics · Computer Science 2024-07-12 Jérémie Chalopin , Victor Chepoi , Fionn Mc Inerney , Sébastien Ratel , Yann Vaxès

Robust Model Compression Using Deep Hypotheses

Machine Learning models should ideally be compact and robust. Compactness provides efficiency and comprehensibility whereas robustness provides resilience. Both topics have been studied in recent years but in isolation. Here we present a…

Machine Learning · Computer Science 2021-03-16 Omri Armstrong , Ran Gilad-Bachrach

List Sample Compression and Uniform Convergence

List learning is a variant of supervised classification where the learner outputs multiple plausible labels for each instance rather than just one. We investigate classical principles related to generalization within the context of list…

Machine Learning · Computer Science 2026-03-24 Steve Hanneke , Shay Moran , Tom Waknine

Compression via Pre-trained Transformers: A Study on Byte-Level Multimodal Data

Foundation models are strong data compressors, but when accounting for their parameter size, their compression ratios are inferior to standard compression algorithms. Naively reducing the parameter count does not necessarily help as it…

Machine Learning · Computer Science 2025-05-26 David Heurtel-Depeiges , Anian Ruoss , Joel Veness , Tim Genewein

Sample Compression Unleashed: New Generalization Bounds for Real Valued Losses

The sample compression theory provides generalization guarantees for predictors that can be fully defined using a subset of the training dataset and a (short) message string, generally defined as a binary sequence. Previous works provided…

Machine Learning · Computer Science 2025-03-12 Mathieu Bazinet , Valentina Zantedeschi , Pascal Germain