Related papers: Optimal Learners for Multiclass Problems

Multiclass learnability and the ERM principle

We study the sample complexity of multiclass prediction in several learning settings. For the PAC setting our analysis reveals a surprising phenomenon: In sharp contrast to binary classification, we show that there exist multiclass…

Machine Learning · Computer Science 2016-04-19 Amit Daniely , Sivan Sabato , Shai Ben-David , Shai Shalev-Shwartz

Understanding Aggregations of Proper Learners in Multiclass Classification

Multiclass learnability is known to exhibit a properness barrier: there are learnable classes which cannot be learned by any proper learner. Binary classification faces no such barrier for learnability, but a similar one for optimal…

Machine Learning · Computer Science 2025-08-13 Julian Asilis , Mikael Møller Høgsgaard , Grigoris Velegkas

Proper Learnability and the Role of Unlabeled Data

Proper learning refers to the setting in which learners must emit predictors in the underlying hypothesis class $H$, and often leads to learners with simple algorithmic forms (e.g. empirical risk minimization (ERM), structural risk…

Machine Learning · Computer Science 2025-12-10 Julian Asilis , Siddartha Devic , Shaddin Dughmi , Vatsal Sharan , Shang-Hua Teng

Regularization and Optimal Multiclass Learning

The quintessential learning algorithm of empirical risk minimization (ERM) is known to fail in various settings for which uniform convergence does not characterize learning. It is therefore unsurprising that the practice of machine learning…

Machine Learning · Computer Science 2024-06-26 Julian Asilis , Siddartha Devic , Shaddin Dughmi , Vatsal Sharan , Shang-Hua Teng

Learning Rules-First Classifiers

Complex classifiers may exhibit "embarassing" failures in cases where humans can easily provide a justified classification. Avoiding such failures is obviously of key importance. In this work, we focus on one such setting, where a label is…

Machine Learning · Computer Science 2019-06-14 Deborah Cohen , Amit Daniely , Amir Globerson , Gal Elidan

Learning from Positive and Unlabeled Data under the Selected At Random Assumption

For many interesting tasks, such as medical diagnosis and web page classification, a learner only has access to some positively labeled examples and many unlabeled examples. Learning from this type of data requires making assumptions about…

Machine Learning · Computer Science 2018-08-28 Jessa Bekker , Jesse Davis

Efficient and Differentiable Conformal Prediction with General Function Classes

Quantifying the data uncertainty in learning tasks is often done by learning a prediction interval or prediction set of the label given the input. Two commonly desired properties for learned prediction sets are \emph{valid coverage} and…

Machine Learning · Computer Science 2022-05-31 Yu Bai , Song Mei , Huan Wang , Yingbo Zhou , Caiming Xiong

Robust Empirical Risk Minimization with Tolerance

Developing simple, sample-efficient learning algorithms for robust classification is a pressing issue in today's tech-dominated world, and current theoretical techniques requiring exponential sample complexity and complicated improper…

Machine Learning · Computer Science 2023-02-07 Robi Bhattacharjee , Max Hopkins , Akash Kumar , Hantao Yu , Kamalika Chaudhuri

Risk-Consistent Multiclass Learning from Random Label-Subset Membership Queries

Obtaining accurate class labels is often costly or unreliable, and may also be limited by privacy or other practical conditions. Compared with asking an annotator to provide the exact class, it is often easier to ask whether the true label…

Machine Learning · Computer Science 2026-05-11 Jiaxu Su , Junpeng Li , Changchun Hua , Yana Yang

Rethinking generalization of classifiers in separable classes scenarios and over-parameterized regimes

We investigate the learning dynamics of classifiers in scenarios where classes are separable or classifiers are over-parameterized. In both cases, Empirical Risk Minimization (ERM) results in zero training error. However, there are many…

Machine Learning · Computer Science 2024-10-23 Julius Martinetz , Christoph Linse , Thomas Martinetz

Quantifying the Prediction Uncertainty of Machine Learning Models for Individual Data

Machine learning models have exhibited exceptional results in various domains. The most prevalent approach for learning is the empirical risk minimizer (ERM), which adapts the model's weights to reduce the loss on a training set and…

Machine Learning · Computer Science 2024-12-11 Koby Bibas

Collaborative Learning with Different Labeling Functions

We study a variant of Collaborative PAC Learning, in which we aim to learn an accurate classifier for each of the $n$ data distributions, while minimizing the number of samples drawn from them in total. Unlike in the usual collaborative…

Machine Learning · Computer Science 2024-05-24 Yuyang Deng , Mingda Qiao

Consistency and Finite Sample Behavior of Binary Class Probability Estimation

In this work we investigate to which extent one can recover class probabilities within the empirical risk minimization (ERM) paradigm. The main aim of our paper is to extend existing results and emphasize the tight relations between…

Machine Learning · Computer Science 2020-07-22 Alexander Mey , Marco Loog

Majority-of-Three: The Simplest Optimal Learner?

Developing an optimal PAC learning algorithm in the realizable setting, where empirical risk minimization (ERM) is suboptimal, was a major open problem in learning theory for decades. The problem was finally resolved by Hanneke a few years…

Machine Learning · Statistics 2024-03-15 Ishaq Aden-Ali , Mikael Møller Høgsgaard , Kasper Green Larsen , Nikita Zhivotovskiy

On the ERM Principle in Meta-Learning

Classic supervised learning involves algorithms trained on $n$ labeled examples to produce a hypothesis $h \in \mathcal{H}$ aimed at performing well on unseen examples. Meta-learning extends this by training across $n$ tasks, with $m$…

Machine Learning · Statistics 2024-11-28 Yannay Alon , Steve Hanneke , Shay Moran , Uri Shalit

On the Efficiency of ERM in Feature Learning

Given a collection of feature maps indexed by a set $\mathcal{T}$, we study the performance of empirical risk minimization (ERM) on regression problems with square loss over the union of the linear classes induced by these feature maps.…

Machine Learning · Statistics 2024-11-20 Ayoub El Hanchi , Chris J. Maddison , Murat A. Erdogdu

Agnostic Multi-Robust Learning Using ERM

A fundamental problem in robust learning is asymmetry: a learner needs to correctly classify every one of exponentially-many perturbations that an adversary might make to a test-time natural example. In contrast, the attacker only needs to…

Machine Learning · Computer Science 2024-02-14 Saba Ahmadi , Avrim Blum , Omar Montasser , Kevin Stangl

How Does Independence Help Generalization? Sample Complexity of ERM on Product Distributions

While many classical notions of learnability (e.g., PAC learnability) are distribution-free, utilizing the specific structures of an input distribution may improve learning performance. For example, a product distribution on a…

Machine Learning · Computer Science 2022-12-14 Tao Lin

Do highly over-parameterized neural networks generalize since bad solutions are rare?

We study over-parameterized classifiers where Empirical Risk Minimization (ERM) for learning leads to zero training error. In these over-parameterized settings there are many global minima with zero training error, some of which generalize…

Machine Learning · Computer Science 2023-12-05 Julius Martinetz , Thomas Martinetz

Optimal Binary Classifier Aggregation for General Losses

We address the problem of aggregating an ensemble of predictors with known loss bounds in a semi-supervised binary classification setting, to minimize prediction loss incurred on the unlabeled data. We find the minimax optimal predictions…

Machine Learning · Computer Science 2016-11-08 Akshay Balsubramani , Yoav Freund