Related papers: Robust Learning with Optimal Error

Agnostic Learning under Targeted Poisoning: Optimal Rates and the Role of Randomness

We study the problem of learning in the presence of an adversary that can corrupt an $\eta$ fraction of the training examples with the goal of causing failure on a specific test point. In the realizable setting, prior work established that…

Machine Learning · Computer Science 2025-06-04 Bogdan Chornomaz , Yonatan Koren , Shay Moran , Tom Waknine

Is nasty noise actually harder than malicious noise?

We consider the relative abilities and limitations of computationally efficient algorithms for learning in the presence of noise, under two well-studied and challenging adversarial noise models for learning Boolean functions: malicious…

Machine Learning · Computer Science 2026-02-18 Guy Blanc , Yizhi Huang , Tal Malkin , Rocco A. Servedio

On Optimal Learning Under Targeted Data Poisoning

Consider the task of learning a hypothesis class $\mathcal{H}$ in the presence of an adversary that can replace up to an $\eta$ fraction of the examples in the training set with arbitrary adversarial examples. The adversary aims to fail the…

Machine Learning · Computer Science 2022-10-13 Steve Hanneke , Amin Karbasi , Mohammad Mahmoody , Idan Mehalel , Shay Moran

Learning to Learn from Mistakes: Robust Optimization for Adversarial Noise

Sensitivity to adversarial noise hinders deployment of machine learning algorithms in security-critical applications. Although many adversarial defenses have been proposed, robustness to adversarial noise remains an open problem. The most…

Machine Learning · Computer Science 2020-08-13 Alex Serban , Erik Poll , Joost Visser

The Optimal Noise in Noise-Contrastive Learning Is Not What You Think

Learning a parametric model of a data distribution is a well-known statistical problem that has seen renewed interest as it is brought to scale in deep learning. Framing the problem as a self-supervised task, where data samples are…

Machine Learning · Statistics 2022-07-27 Omar Chehab , Alexandre Gramfort , Aapo Hyvarinen

The Power of Localization for Efficiently Learning Linear Separators with Noise

We introduce a new approach for designing computationally efficient learning algorithms that are tolerant to noise, and demonstrate its effectiveness by designing algorithms with improved noise tolerance guarantees for learning linear…

Machine Learning · Computer Science 2018-06-05 Pranjal Awasthi , Maria Florina Balcan , Philip M. Long

Testing Noise Assumptions of Learning Algorithms

We pose a fundamental question in computational learning theory: can we efficiently test whether a training set satisfies the assumptions of a given noise model? This question has remained unaddressed despite decades of research on learning…

Machine Learning · Computer Science 2026-05-11 Surbhi Goel , Adam R. Klivans , Konstantinos Stavropoulos , Arsen Vasilyan

Attribute-Efficient Learning of Halfspaces with Malicious Noise: Near-Optimal Label Complexity and Noise Tolerance

This paper is concerned with computationally efficient learning of homogeneous sparse halfspaces in $\mathbb{R}^d$ under noise. Though recent works have established attribute-efficient learning algorithms under various types of label noise…

Machine Learning · Statistics 2021-03-03 Jie Shen , Chicheng Zhang

Optimal Learning

This paper studies the problem of learning an unknown function $f$ from given data about $f$. The learning problem is to give an approximation $\hat f$ to $f$ that predicts the values of $f$ away from the data. There are numerous settings…

Machine Learning · Computer Science 2023-06-27 Peter Binev , Andrea Bonito , Ronald DeVore , Guergana Petrova

Refined Error Bounds for Several Learning Algorithms

This article studies the achievable guarantees on the error rates of certain learning algorithms, with particular focus on refining logarithmic factors. Many of the results are based on a general technique for obtaining bounds on the error…

Machine Learning · Computer Science 2016-09-13 Steve Hanneke

Taking a Moment for Distributional Robustness

A rich line of recent work has studied distributionally robust learning approaches that seek to learn a hypothesis that performs well, in the worst-case, on many different distributions over a population. We argue that although the most…

Machine Learning · Computer Science 2024-05-10 Jabari Hastings , Christopher Jung , Charlotte Peale , Vasilis Syrgkanis

Optimizing the Noise in Self-Supervised Learning: from Importance Sampling to Noise-Contrastive Estimation

Self-supervised learning is an increasingly popular approach to unsupervised learning, achieving state-of-the-art results. A prevalent approach consists in contrasting data points and noise points within a classification task: this requires…

Machine Learning · Statistics 2023-01-25 Omar Chehab , Alexandre Gramfort , Aapo Hyvarinen

An Efficient Tester-Learner for Halfspaces

We give the first efficient algorithm for learning halfspaces in the testable learning model recently defined by Rubinfeld and Vasilyan (2023). In this model, a learner certifies that the accuracy of its output hypothesis is near optimal…

Machine Learning · Computer Science 2023-03-14 Aravind Gollakota , Adam R. Klivans , Konstantinos Stavropoulos , Arsen Vasilyan

Improve Noise Tolerance of Robust Loss via Noise-Awareness

Robust loss minimization is an important strategy for handling robust learning issue on noisy labels. Current approaches for designing robust losses involve the introduction of noise-robust factors, i.e., hyperparameters, to control the…

Machine Learning · Computer Science 2023-09-06 Kehui Ding , Jun Shu , Deyu Meng , Zongben Xu

Optimal Recovery Meets Minimax Estimation

A fundamental problem in statistics and machine learning is to estimate a function $f$ from possibly noisy observations of its point samples. The goal is to design a numerical algorithm to construct an approximation $\hat f$ to $f$ in a…

Statistics Theory · Mathematics 2025-05-30 Ronald DeVore , Robert D. Nowak , Rahul Parhi , Guergana Petrova , Jonathan W. Siegel

Optimal Decision Tree and Adaptive Submodular Ranking with Noisy Outcomes

In pool-based active learning, the learner is given an unlabeled data set and aims to efficiently learn the unknown hypothesis by querying the labels of the data points. This can be formulated as the classical Optimal Decision Tree (ODT)…

Machine Learning · Computer Science 2024-08-01 Su Jia , Fatemeh Navidi , Viswanath Nagarajan , R. Ravi

Towards Optimal Problem Dependent Generalization Error Bounds in Statistical Learning Theory

We study problem-dependent rates, i.e., generalization errors that scale near-optimally with the variance, the effective loss, or the gradient norms evaluated at the "best hypothesis." We introduce a principled framework dubbed "uniform…

Machine Learning · Statistics 2020-12-25 Yunbei Xu , Assaf Zeevi

Rates of convergence in active learning

We study the rates of convergence in generalization error achievable by active learning under various types of label noise. Additionally, we study the general problem of model selection for active learning with a nested hierarchy of…

Statistics Theory · Mathematics 2011-03-10 Steve Hanneke

On the lifting of deterministic convergence rates for inverse problems with stochastic noise

Both for the theoretical and practical treatment of Inverse Problems, the modeling of the noise is a crucial part. One either models the measurement via a deterministic worst-case error assumption or assumes a certain stochastic behavior of…

Probability · Mathematics 2016-04-26 Daniel Gerth , Andreas Hofinger , Ronny Ramlau

Self-Adaptive Training: beyond Empirical Risk Minimization

We propose self-adaptive training---a new training algorithm that dynamically corrects problematic training labels by model predictions without incurring extra computational cost---to improve generalization of deep learning for potentially…

Machine Learning · Computer Science 2020-10-01 Lang Huang , Chao Zhang , Hongyang Zhang