Related papers: Does data interpolation contradict statistical opt…

On Optimal Interpolation In Linear Regression

Understanding when and why interpolating methods generalize well has recently been a topic of interest in statistical learning theory. However, systematically connecting interpolating methods to achievable notions of optimality has only…

Machine Learning · Statistics 2021-10-22 Eduard Oravkin , Patrick Rebeschini

Memorize to Generalize: on the Necessity of Interpolation in High Dimensional Linear Regression

We examine the necessity of interpolation in overparameterized models, that is, when achieving optimal predictive risk in machine learning problems requires (nearly) interpolating the training data. In particular, we consider simple…

Machine Learning · Statistics 2022-06-17 Chen Cheng , John Duchi , Rohith Kuditipudi

Benefit of Interpolation in Nearest Neighbor Algorithms

In some studies \citep[e.g.,][]{zhang2016understanding} of deep learning, it is observed that over-parametrized deep neural networks achieve a small testing error even when the training error is almost zero. Despite numerous works towards…

Machine Learning · Statistics 2022-02-25 Yue Xing , Qifan Song , Guang Cheng

Surprises in adversarially-trained linear regression

State-of-the-art machine learning models can be vulnerable to very small input perturbations that are adversarially constructed. Adversarial training is an effective approach to defend against such examples. It is formulated as a min-max…

Machine Learning · Statistics 2022-10-21 Antônio H. Ribeiro , Dave Zachariah , Thomas B. Schön

Online Nonparametric Regression

We establish optimal rates for online regression for arbitrary classes of regression functions in terms of the sequential entropy introduced in (Rakhlin, Sridharan, Tewari, 2010). The optimal rates are shown to exhibit a phase transition…

Machine Learning · Statistics 2014-02-12 Alexander Rakhlin , Karthik Sridharan

The Interplay Between Interpolation and Aggregation in Regression: Optimal Sample Complexity

This work investigates theoretically the interplay between interpolation and aggregation in regression. We establish that the $\gamma$-graph dimension characterizes learnability for a broad class of natural aggregation procedures.…

Machine Learning · Computer Science 2026-05-29 Mikael Møller Høgsgaard , Kasper Green Larsen , Liang-Yu Zou

Double descent for least-squares interpolation on contaminated data: A simulation study

Overparametrized models can exhibit an excellent generalization performance, although they should be prone to overfitting according to classical statistical theory. The discovery of the "double descent", indicating that the generalization…

Machine Learning · Computer Science 2026-05-22 Tino Werner

To Each Optimizer a Norm, To Each Norm its Generalization

We study the implicit regularization of optimization methods for linear models interpolating the training data in the under-parametrized and over-parametrized regimes. Since it is difficult to determine whether an optimizer converges to…

Machine Learning · Computer Science 2022-07-12 Sharan Vaswani , Reza Babanezhad , Jose Gallego-Posada , Aaron Mishkin , Simon Lacoste-Julien , Nicolas Le Roux

On damage of interpolation to adversarial robustness in regression

Deep neural networks (DNNs) typically involve a large number of parameters and are trained to achieve zero or near-zero training error. Despite such interpolation, they often exhibit strong generalization performance on unseen data, a…

Machine Learning · Statistics 2026-01-23 Jingfu Peng , Yuhong Yang

Analysis of Interpolating Regression Models and the Double Descent Phenomenon

A regression model with more parameters than data points in the training data is overparametrized and has the capability to interpolate the training data. Based on the classical bias-variance tradeoff expressions, it is commonly assumed…

Machine Learning · Computer Science 2023-04-18 Tomas McKelvey

Empirical Risk Minimization in the Interpolating Regime with Application to Neural Network Learning

A common strategy to train deep neural networks (DNNs) is to use very large architectures and to train them until they (almost) achieve zero training error. Empirically observed good generalization performance on test data, even in the…

Machine Learning · Statistics 2021-07-26 Nicole Mücke , Ingo Steinwart

Interpolation and Learning with Scale Dependent Kernels

We study the learning properties of nonparametric ridge-less least squares. In particular, we consider the common case of estimators defined by scale dependent kernels, and focus on the role of the scale. These estimators interpolate the…

Machine Learning · Statistics 2021-11-11 Nicolò Pagliana , Alessandro Rudi , Ernesto De Vito , Lorenzo Rosasco

Interpolated Adversarial Training: Achieving Robust Neural Networks without Sacrificing Too Much Accuracy

Adversarial robustness has become a central goal in deep learning, both in the theory and the practice. However, successful methods to improve the adversarial robustness (such as adversarial training) greatly hurt generalization performance…

Machine Learning · Statistics 2022-10-20 Alex Lamb , Vikas Verma , Kenji Kawaguchi , Alexander Matyasko , Savya Khosla , Juho Kannala , Yoshua Bengio

A Class of Geometric Structures in Transfer Learning: Minimax Bounds and Optimality

We study the problem of transfer learning, observing that previous efforts to understand its information-theoretic limits do not fully exploit the geometric structure of the source and target domains. In contrast, our study first…

Machine Learning · Computer Science 2022-02-24 Xuhui Zhang , Jose Blanchet , Soumyadip Ghosh , Mark S. Squillante

Benefit of Interpolation in Nearest Neighbor Algorithms

The over-parameterized models attract much attention in the era of data science and deep learning. It is empirically observed that although these models, e.g. deep neural networks, over-fit the training data, they can still achieve small…

Machine Learning · Statistics 2019-09-27 Yue Xing , Qifan Song , Guang Cheng

Predictive Model Degrees of Freedom in Linear Regression

Overparametrized interpolating models have drawn increasing attention from machine learning. Some recent studies suggest that regularized interpolating models can generalize well. This phenomenon seemingly contradicts the conventional…

Statistics Theory · Mathematics 2022-07-04 Bo Luan , Yoonkyung Lee , Yunzhang Zhu

Regularization properties of adversarially-trained linear regression

State-of-the-art machine learning models can be vulnerable to very small input perturbations that are adversarially constructed. Adversarial training is an effective approach to defend against it. Formulated as a min-max problem, it…

Machine Learning · Statistics 2023-10-18 Antônio H. Ribeiro , Dave Zachariah , Francis Bach , Thomas B. Schön

Adversarial Training in Low-Label Regimes with Margin-Based Interpolation

Adversarial training has emerged as an effective approach to train robust neural network models that are resistant to adversarial attacks, even in low-label regimes where labeled data is scarce. In this paper, we introduce a novel…

Machine Learning · Computer Science 2024-11-28 Tian Ye , Rajgopal Kannan , Viktor Prasanna

Least squares approximations in linear statistical inverse learning problems

Statistical inverse learning aims at recovering an unknown function $f$ from randomly scattered and possibly noisy point evaluations of another function $g$, connected to $f$ via an ill-posed mathematical model. In this paper we blend…

Statistics Theory · Mathematics 2024-01-22 Tapio Helin

Surprises in High-Dimensional Ridgeless Least Squares Interpolation

Interpolators -- estimators that achieve zero training error -- have attracted growing attention in machine learning, mainly because state-of-the art neural networks appear to be models of this type. In this paper, we study minimum $\ell_2$…

Statistics Theory · Mathematics 2022-09-12 Trevor Hastie , Andrea Montanari , Saharon Rosset , Ryan J. Tibshirani