English
Related papers

Related papers: Does data interpolation contradict statistical opt…

200 papers

Understanding when and why interpolating methods generalize well has recently been a topic of interest in statistical learning theory. However, systematically connecting interpolating methods to achievable notions of optimality has only…

Machine Learning · Statistics 2021-10-22 Eduard Oravkin , Patrick Rebeschini

We examine the necessity of interpolation in overparameterized models, that is, when achieving optimal predictive risk in machine learning problems requires (nearly) interpolating the training data. In particular, we consider simple…

Machine Learning · Statistics 2022-06-17 Chen Cheng , John Duchi , Rohith Kuditipudi

In some studies \citep[e.g.,][]{zhang2016understanding} of deep learning, it is observed that over-parametrized deep neural networks achieve a small testing error even when the training error is almost zero. Despite numerous works towards…

Machine Learning · Statistics 2022-02-25 Yue Xing , Qifan Song , Guang Cheng

State-of-the-art machine learning models can be vulnerable to very small input perturbations that are adversarially constructed. Adversarial training is an effective approach to defend against such examples. It is formulated as a min-max…

Machine Learning · Statistics 2022-10-21 Antônio H. Ribeiro , Dave Zachariah , Thomas B. Schön

We establish optimal rates for online regression for arbitrary classes of regression functions in terms of the sequential entropy introduced in (Rakhlin, Sridharan, Tewari, 2010). The optimal rates are shown to exhibit a phase transition…

Machine Learning · Statistics 2014-02-12 Alexander Rakhlin , Karthik Sridharan

This work investigates theoretically the interplay between interpolation and aggregation in regression. We establish that the $\gamma$-graph dimension characterizes learnability for a broad class of natural aggregation procedures.…

Machine Learning · Computer Science 2026-05-29 Mikael Møller Høgsgaard , Kasper Green Larsen , Liang-Yu Zou

Overparametrized models can exhibit an excellent generalization performance, although they should be prone to overfitting according to classical statistical theory. The discovery of the "double descent", indicating that the generalization…

Machine Learning · Computer Science 2026-05-22 Tino Werner

We study the implicit regularization of optimization methods for linear models interpolating the training data in the under-parametrized and over-parametrized regimes. Since it is difficult to determine whether an optimizer converges to…

Machine Learning · Computer Science 2022-07-12 Sharan Vaswani , Reza Babanezhad , Jose Gallego-Posada , Aaron Mishkin , Simon Lacoste-Julien , Nicolas Le Roux

Deep neural networks (DNNs) typically involve a large number of parameters and are trained to achieve zero or near-zero training error. Despite such interpolation, they often exhibit strong generalization performance on unseen data, a…

Machine Learning · Statistics 2026-01-23 Jingfu Peng , Yuhong Yang

A regression model with more parameters than data points in the training data is overparametrized and has the capability to interpolate the training data. Based on the classical bias-variance tradeoff expressions, it is commonly assumed…

Machine Learning · Computer Science 2023-04-18 Tomas McKelvey

A common strategy to train deep neural networks (DNNs) is to use very large architectures and to train them until they (almost) achieve zero training error. Empirically observed good generalization performance on test data, even in the…

Machine Learning · Statistics 2021-07-26 Nicole Mücke , Ingo Steinwart

We study the learning properties of nonparametric ridge-less least squares. In particular, we consider the common case of estimators defined by scale dependent kernels, and focus on the role of the scale. These estimators interpolate the…

Machine Learning · Statistics 2021-11-11 Nicolò Pagliana , Alessandro Rudi , Ernesto De Vito , Lorenzo Rosasco

Adversarial robustness has become a central goal in deep learning, both in the theory and the practice. However, successful methods to improve the adversarial robustness (such as adversarial training) greatly hurt generalization performance…

Machine Learning · Statistics 2022-10-20 Alex Lamb , Vikas Verma , Kenji Kawaguchi , Alexander Matyasko , Savya Khosla , Juho Kannala , Yoshua Bengio

We study the problem of transfer learning, observing that previous efforts to understand its information-theoretic limits do not fully exploit the geometric structure of the source and target domains. In contrast, our study first…

Machine Learning · Computer Science 2022-02-24 Xuhui Zhang , Jose Blanchet , Soumyadip Ghosh , Mark S. Squillante

The over-parameterized models attract much attention in the era of data science and deep learning. It is empirically observed that although these models, e.g. deep neural networks, over-fit the training data, they can still achieve small…

Machine Learning · Statistics 2019-09-27 Yue Xing , Qifan Song , Guang Cheng

Overparametrized interpolating models have drawn increasing attention from machine learning. Some recent studies suggest that regularized interpolating models can generalize well. This phenomenon seemingly contradicts the conventional…

Statistics Theory · Mathematics 2022-07-04 Bo Luan , Yoonkyung Lee , Yunzhang Zhu

State-of-the-art machine learning models can be vulnerable to very small input perturbations that are adversarially constructed. Adversarial training is an effective approach to defend against it. Formulated as a min-max problem, it…

Machine Learning · Statistics 2023-10-18 Antônio H. Ribeiro , Dave Zachariah , Francis Bach , Thomas B. Schön

Adversarial training has emerged as an effective approach to train robust neural network models that are resistant to adversarial attacks, even in low-label regimes where labeled data is scarce. In this paper, we introduce a novel…

Machine Learning · Computer Science 2024-11-28 Tian Ye , Rajgopal Kannan , Viktor Prasanna

Statistical inverse learning aims at recovering an unknown function $f$ from randomly scattered and possibly noisy point evaluations of another function $g$, connected to $f$ via an ill-posed mathematical model. In this paper we blend…

Statistics Theory · Mathematics 2024-01-22 Tapio Helin

Interpolators -- estimators that achieve zero training error -- have attracted growing attention in machine learning, mainly because state-of-the art neural networks appear to be models of this type. In this paper, we study minimum $\ell_2$…

Statistics Theory · Mathematics 2022-09-12 Trevor Hastie , Andrea Montanari , Saharon Rosset , Ryan J. Tibshirani
‹ Prev 1 2 3 10 Next ›