Related papers: Interpolation Learning With Minimum Description Le…

Provable Tempered Overfitting of Minimal Nets and Typical Nets

We study the overfitting behavior of fully connected deep Neural Networks (NNs) with binary weights fitted to perfectly classify a noisy training set. We consider interpolation using both the smallest NN (having the minimal number of…

Machine Learning · Computer Science 2024-10-28 Itamar Harel , William M. Hoza , Gal Vardi , Itay Evron , Nathan Srebro , Daniel Soudry

Information-Theoretic Probing with Minimum Description Length

To measure how well pretrained representations encode some linguistic property, it is common to use accuracy of a probe, i.e. a classifier trained to predict the property from the representations. Despite widespread adoption of probes,…

Computation and Language · Computer Science 2020-03-30 Elena Voita , Ivan Titov

Memorize to Generalize: on the Necessity of Interpolation in High Dimensional Linear Regression

We examine the necessity of interpolation in overparameterized models, that is, when achieving optimal predictive risk in machine learning problems requires (nearly) interpolating the training data. In particular, we consider simple…

Machine Learning · Statistics 2022-06-17 Chen Cheng , John Duchi , Rohith Kuditipudi

Tight bounds for minimum l1-norm interpolation of noisy data

We provide matching upper and lower bounds of order $\sigma^2/\log(d/n)$ for the prediction error of the minimum $\ell_1$-norm interpolator, a.k.a. basis pursuit. Our result is tight up to negligible terms when $d \gg n$, and is the first…

Statistics Theory · Mathematics 2022-03-09 Guillaume Wang , Konstantin Donhauser , Fanny Yang

Minimum Description Length and Generalization Guarantees for Representation Learning

A major challenge in designing efficient statistical supervised learning algorithms is finding representations that perform well not only on available training samples but also on unseen data. While the study of representation learning has…

Machine Learning · Statistics 2024-02-06 Milad Sefidgaran , Abdellatif Zaidi , Piotr Krasnowski

Noisy Interpolation Learning with Shallow Univariate ReLU Networks

Understanding how overparameterized neural networks generalize despite perfect interpolation of noisy training data is a fundamental question. Mallinar et. al. 2022 noted that neural networks seem to often exhibit ``tempered overfitting'',…

Machine Learning · Computer Science 2024-03-25 Nirmit Joshi , Gal Vardi , Nathan Srebro

A law of adversarial risk, interpolation, and label noise

In supervised learning, it has been shown that label noise in the data can be interpolated without penalties on test accuracy. We show that interpolating label noise induces adversarial vulnerability, and prove the first theorem showing the…

Machine Learning · Statistics 2023-03-15 Daniel Paleka , Amartya Sanyal

Overfitting or perfect fitting? Risk bounds for classification and regression rules that interpolate

Many modern machine learning models are trained to achieve zero or near-zero training error in order to obtain near-optimal (but non-zero) test error. This phenomenon of strong generalization performance for "overfitted" / interpolated…

Machine Learning · Statistics 2018-10-29 Mikhail Belkin , Daniel Hsu , Partha Mitra

Binary Classification with Instance and Label Dependent Label Noise

Learning with label dependent label noise has been extensively explored in both theory and practice; however, dealing with instance (i.e., feature) and label dependent label noise continues to be a challenging task. The difficulty arises…

Machine Learning · Statistics 2023-06-07 Hyungki Im , Paul Grigas

Minimum $\ell_{1}$-norm interpolators: Precise asymptotics and multiple descent

An evolving line of machine learning works observe empirical evidence that suggests interpolating estimators -- the ones that achieve zero training error -- may not necessarily be harmful. This paper pursues theoretical understanding for an…

Statistics Theory · Mathematics 2021-10-19 Yue Li , Yuting Wei

Theoretical Guarantees of Deep Embedding Losses Under Label Noise

Collecting labeled data to train deep neural networks is costly and even impractical for many tasks. Thus, research effort has been focused in automatically curated datasets or unsupervised and weakly supervised learning. The common problem…

Machine Learning · Computer Science 2019-01-03 Nam Le , Jean-Marc Odobez

Robust Learning Under Label Noise With Iterative Noise-Filtering

We consider the problem of training a model under the presence of label noise. Current approaches identify samples with potentially incorrect labels and reduce their influence on the learning process by either assigning lower weights to…

Machine Learning · Computer Science 2019-06-04 Duc Tam Nguyen , Thi-Phuong-Nhung Ngo , Zhongyu Lou , Michael Klar , Laura Beggel , Thomas Brox

Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations

Learning with reduced labeling standards, such as noisy label, partial label, and multiple label candidates, which we generically refer to as \textit{imprecise} labels, is a commonplace challenge in machine learning tasks. Previous methods…

Machine Learning · Computer Science 2024-10-31 Hao Chen , Ankit Shah , Jindong Wang , Ran Tao , Yidong Wang , Xing Xie , Masashi Sugiyama , Rita Singh , Bhiksha Raj

On semi-supervised learning

Semi-supervised learning deals with the problem of how, if possible, to take advantage of a huge amount of unclassified data, to perform a classification in situations when, typically, there is little labeled data. Even though this is not…

Machine Learning · Statistics 2020-12-11 Alejandro Cholaquidis , Ricardo Fraiman , Mariela Sued

The Effect of Label Noise on the Information Content of Neural Representations

In supervised classification tasks, models are trained to predict a label for each data point. In real-world datasets, these labels are often noisy due to annotation errors. While the impact of label noise on the performance of deep…

Machine Learning · Computer Science 2025-10-09 Ali Hussaini Umar , Franky Kevin Nando Tezoh , Jean Barbier , Santiago Acevedo , Alessandro Laio

Interpolation-based Contrastive Learning for Few-Label Semi-Supervised Learning

Semi-supervised learning (SSL) has long been proved to be an effective technique to construct powerful models with limited labels. In the existing literature, consistency regularization-based methods, which force the perturbed samples to…

Computer Vision and Pattern Recognition · Computer Science 2022-06-23 Xihong Yang , Xiaochang Hu , Sihang Zhou , Xinwang Liu , En Zhu

Label Noise in Adversarial Training: A Novel Perspective to Study Robust Overfitting

We show that label noise exists in adversarial training. Such label noise is due to the mismatch between the true label distribution of adversarial examples and the label inherited from clean examples - the true label distribution is…

Machine Learning · Computer Science 2023-10-17 Chengyu Dong , Liyuan Liu , Jingbo Shang

Regression with Sensor Data Containing Incomplete Observations

This paper addresses a regression problem in which output label values are the results of sensing the magnitude of a phenomenon. A low value of such labels can mean either that the actual magnitude of the phenomenon was low or that the…

Machine Learning · Computer Science 2023-06-01 Takayuki Katsuki , Takayuki Osogami

The Minimum Description Length Principle and Model Selection in Spectropolarimetry

It is shown that the two-part Minimum Description Length Principle can be used to discriminate among different models that can explain a given observed dataset. The description length is chosen to be the sum of the lengths of the message…

Astrophysics · Physics 2008-11-26 A. Asensio Ramos

Classification with imperfect training labels

We study the effect of imperfect training data labels on the performance of classification methods. In a general setting, where the probability that an observation in the training dataset is mislabelled may depend on both the feature vector…

Statistics Theory · Mathematics 2019-05-07 Timothy I. Cannings , Yingying Fan , Richard J. Samworth