Related papers: On sample complexity for computational pattern rec…

Statistical Learning of Arbitrary Computable Classifiers

Statistical learning theory chiefly studies restricted hypothesis classes, particularly those with finite Vapnik-Chervonenkis (VC) dimension. The fundamental quantity of interest is the sample complexity: the number of samples required to…

Machine Learning · Computer Science 2008-07-10 David Soloveichik

On the Complexity of Learning from Label Proportions

In the problem of learning with label proportions, which we call LLP learning, the training data is unlabeled, and only the proportions of examples receiving each label are given. The goal is to learn a hypothesis that predicts the…

Machine Learning · Computer Science 2020-04-08 Benjamin Fish , Lev Reyzin

Optimal Sample Complexity of Contrastive Learning

Contrastive learning is a highly successful technique for learning representations of data from labeled tuples, specifying the distance relations within the tuple. We study the sample complexity of contrastive learning, i.e. the minimum…

Machine Learning · Computer Science 2023-12-04 Noga Alon , Dmitrii Avdiukhin , Dor Elboim , Orr Fischer , Grigory Yaroslavtsev

Achievable Rates for Pattern Recognition

Biological and machine pattern recognition systems face a common challenge: Given sensory data about an unknown object, classify the object by comparing the sensory data with a library of internal representations stored in memory. In many…

Information Theory · Computer Science 2007-07-13 M. Brandon Westover , Joseph A. O'Sullivan

Representation Learning for Clustering: A Statistical Framework

We address the problem of communicating domain knowledge from a user to the designer of a clustering algorithm. We propose a protocol in which the user provides a clustering of a relatively small random sample of a data set. The algorithm…

Machine Learning · Statistics 2015-06-22 Hassan Ashtiani , Shai Ben-David

Super-sparse Learning in Similarity Spaces

In several applications, input samples are more naturally represented in terms of similarities between each other, rather than in terms of feature vectors. In these settings, machine-learning algorithms can become very computationally…

Computer Vision and Pattern Recognition · Computer Science 2017-12-19 Ambra Demontis , Marco Melis , Battista Biggio , Giorgio Fumera , Fabio Roli

Initialization-Dependent Sample Complexity of Linear Predictors and Neural Networks

We provide several new results on the sample complexity of vector-valued linear predictors (parameterized by a matrix), and more generally neural networks. Focusing on size-independent bounds, where only the Frobenius norm distance of the…

Machine Learning · Computer Science 2023-10-26 Roey Magen , Ohad Shamir

Deep Unsupervised Learning of Visual Similarities

Exemplar learning of visual similarities in an unsupervised manner is a problem of paramount importance to Computer Vision. In this context, however, the recent breakthrough in deep learning could not yet unfold its full potential. With…

Computer Vision and Pattern Recognition · Computer Science 2018-02-26 Artsiom Sanakoyeu , Miguel A. Bautista , Björn Ommer

Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination

Neural net classifiers trained on data with annotated class labels can also capture apparent visual similarity among categories without being directed to do so. We study whether this observation can be extended beyond the conventional…

Computer Vision and Pattern Recognition · Computer Science 2018-05-08 Zhirong Wu , Yuanjun Xiong , Stella Yu , Dahua Lin

A Characterization of Semi-Supervised Adversarially-Robust PAC Learnability

We study the problem of learning an adversarially robust predictor to test time attacks in the semi-supervised PAC model. We address the question of how many labeled and unlabeled examples are required to ensure learning. We show that…

Machine Learning · Computer Science 2024-05-07 Idan Attias , Steve Hanneke , Yishay Mansour

Provable Lifelong Learning of Representations

In lifelong learning, tasks (or classes) to be learned arrive sequentially over time in arbitrary order. During training, knowledge from previous tasks can be captured and transferred to subsequent ones to improve sample efficiency. We…

Machine Learning · Computer Science 2022-03-02 Xinyuan Cao , Weiyang Liu , Santosh S. Vempala

More data speeds up training time in learning halfspaces over sparse vectors

The increased availability of data in recent years has led several authors to ask whether it is possible to use data as a {\em computational} resource. That is, if more data is available, beyond the sample complexity limit, is it possible…

Machine Learning · Computer Science 2013-11-12 Amit Daniely , Nati Linial , Shai Shalev Shwartz

Virtual Category Learning: A Semi-Supervised Learning Method for Dense Prediction with Extremely Limited Labels

Due to the costliness of labelled data in real-world applications, semi-supervised learning, underpinned by pseudo labelling, is an appealing solution. However, handling confusing samples is nontrivial: discarding valuable confusing samples…

Computer Vision and Pattern Recognition · Computer Science 2024-02-13 Changrui Chen , Jungong Han , Kurt Debattista

Approximability and Generalisation

Approximate learning machines have become popular in the era of small devices, including quantised, factorised, hashed, or otherwise compressed predictors, and the quest to explain and guarantee good generalisation abilities for such…

Machine Learning · Computer Science 2022-03-16 Andrew J. Turner , Ata Kabán

Estimating Learnability in the Sublinear Data Regime

We consider the problem of estimating how well a model class is capable of fitting a distribution of labeled data. We show that it is often possible to accurately estimate this "learnability" even when given an amount of data that is too…

Machine Learning · Computer Science 2019-03-26 Weihao Kong , Gregory Valiant

Sample compression schemes for VC classes

Sample compression schemes were defined by Littlestone and Warmuth (1986) as an abstraction of the structure underlying many learning algorithms. Roughly speaking, a sample compression scheme of size $k$ means that given an arbitrary list…

Machine Learning · Computer Science 2015-04-15 Shay Moran , Amir Yehudayoff

On statistical learning via the lens of compression

This work continues the study of the relationship between sample compression schemes and statistical learning, which has been mostly investigated within the framework of binary classification. The central theme of this work is establishing…

Machine Learning · Computer Science 2017-01-02 Ofir David , Shay Moran , Amir Yehudayoff

The Sample Complexity of Replicable Realizable PAC Learning

In this paper, we consider the problem of replicable realizable PAC learning. We construct a particularly hard learning problem and show a sample complexity lower bound with a close to $(\log|H|)^{3/2}$ dependence on the size of the…

Machine Learning · Computer Science 2026-02-24 Kasper Green Larsen , Markus Engelund Mathiasen , Chirag Pabbaraju , Clement Svendsen

Semi-supervised Object Detection via Virtual Category Learning

Due to the costliness of labelled data in real-world applications, semi-supervised object detectors, underpinned by pseudo labelling, are appealing. However, handling confusing samples is nontrivial: discarding valuable confusing samples…

Computer Vision and Pattern Recognition · Computer Science 2022-10-18 Changrui Chen , Kurt Debattista , Jungong Han

Towards optimal sampling for learning sparse approximation in high dimensions

In this chapter, we discuss recent work on learning sparse approximations to high-dimensional functions on data, where the target functions may be scalar-, vector- or even Hilbert space-valued. Our main objective is to study how the…

Numerical Analysis · Mathematics 2022-02-08 Ben Adcock , Juan M. Cardenas , Nick Dexter , Sebastian Moraga