Related papers: Kernel interpolation generalizes poorly

The phase diagram of kernel interpolation in large dimensions

The generalization ability of kernel interpolation in large dimensions (i.e., $n \asymp d^{\gamma}$ for some $\gamma>0$) might be one of the most interesting problems in the recent renaissance of kernel regression, since it may help us…

Machine Learning · Computer Science 2024-04-22 Haobo Zhang , Weihao Lu , Qian Lin

Generalization error of minimum weighted norm and kernel interpolation

We study the generalization error of functions that interpolate prescribed data points and are selected by minimizing a weighted norm. Under natural and general conditions, we prove that both the interpolants and their generalization errors…

Numerical Analysis · Mathematics 2021-02-11 Weilin Li

Spectral Bias and Task-Model Alignment Explain Generalization in Kernel Regression and Infinitely Wide Neural Networks

Generalization beyond a training dataset is a main goal of machine learning, but theoretical understanding of generalization remains an open problem for many models. The need for a new theory is exacerbated by recent observations in deep…

Machine Learning · Statistics 2022-02-08 Abdulkadir Canatar , Blake Bordelon , Cengiz Pehlevan

Just Interpolate: Kernel "Ridgeless" Regression Can Generalize

In the absence of explicit regularization, Kernel "Ridgeless" Regression with nonlinear kernels has the potential to fit the training data perfectly. It has been observed empirically, however, that such interpolated solutions can still…

Statistics Theory · Mathematics 2020-07-27 Tengyuan Liang , Alexander Rakhlin

Generalization in Kernel Regression Under Realistic Assumptions

It is by now well-established that modern over-parameterized models seem to elude the bias-variance tradeoff and generalize well despite overfitting noise. Many recent works attempt to analyze this phenomenon in the relatively tractable…

Machine Learning · Computer Science 2024-02-21 Daniel Barzilai , Ohad Shamir

On the Inconsistency of Kernel Ridgeless Regression in Fixed Dimensions

``Benign overfitting'', the ability of certain algorithms to interpolate noisy training data and yet perform well out-of-sample, has been a topic of considerable recent interest. We show, using a fixed design setup, that an important class…

Machine Learning · Computer Science 2023-04-14 Daniel Beaglehole , Mikhail Belkin , Parthe Pandit

Strong inductive biases provably prevent harmless interpolation

Classical wisdom suggests that estimators should avoid fitting noise to achieve good generalization. In contrast, modern overparameterized models can yield small test error despite interpolating noise -- a phenomenon often called "benign…

Machine Learning · Statistics 2023-03-02 Michael Aerni , Marco Milanta , Konstantin Donhauser , Fanny Yang

Overfitting or perfect fitting? Risk bounds for classification and regression rules that interpolate

Many modern machine learning models are trained to achieve zero or near-zero training error in order to obtain near-optimal (but non-zero) test error. This phenomenon of strong generalization performance for "overfitted" / interpolated…

Machine Learning · Statistics 2018-10-29 Mikhail Belkin , Daniel Hsu , Partha Mitra

To understand deep learning we need to understand kernel learning

Generalization performance of classifiers in deep learning has recently become a subject of intense study. Deep models, typically over-parametrized, tend to fit the training data exactly. Despite this "overfitting", they perform well on…

Machine Learning · Statistics 2018-06-18 Mikhail Belkin , Siyuan Ma , Soumik Mandal

General superconvergence for kernel-based approximation

Kernel interpolation is a fundamental technique for approximating functions from scattered data, with a well-understood convergence theory when interpolating elements of a reproducing kernel Hilbert space. Beyond this classical setting,…

Numerical Analysis · Mathematics 2025-05-19 Toni Karvonen , Gabriele Santin , Tizian Wenzel

On the convergence of generalized kernel-based interpolation by greedy data selection algorithms

We analyze the convergence of generalized kernel-based interpolation methods. This is done under minimalistic assumptions on both the kernel and the target function. On these grounds, we further prove convergence of popular greedy data…

Numerical Analysis · Mathematics 2024-11-26 Kristof Albrecht , Armin Iske

Sobolev norm inconsistency of kernel interpolation

We study the consistency of minimum-norm interpolation in reproducing kernel Hilbert spaces corresponding to bounded kernels. Our main result give lower bounds for the generalization error of the kernel interpolation measured in a…

Machine Learning · Statistics 2025-09-30 Yunfei Yang

Deep Learning Generalization, Extrapolation, and Over-parameterization

We study the generalization of over-parameterized deep networks (for image classification) in relation to the convex hull of their training sets. Despite their great success, generalization of deep networks is considered a mystery. These…

Machine Learning · Computer Science 2022-03-22 Roozbeh Yousefzadeh

Consistent Interpolating Ensembles via the Manifold-Hilbert Kernel

Recent research in the theory of overparametrized learning has sought to establish generalization guarantees in the interpolating regime. Such results have been established for a few common classes of methods, but so far not for ensemble…

Machine Learning · Statistics 2022-05-20 Yutong Wang , Clayton D. Scott

Generalization Error Curves for Analytic Spectral Algorithms under Power-law Decay

The generalization error curve of certain kernel regression method aims at determining the exact order of generalization error with various source condition, noise level and choice of the regularization parameter rather than the minimax…

Machine Learning · Computer Science 2024-11-26 Yicheng Li , Weiye Gan , Zuoqiang Shi , Qian Lin

The Neural Tangent Kernel in High Dimensions: Triple Descent and a Multi-Scale Theory of Generalization

Modern deep learning models employ considerably more parameters than required to fit the training data. Whereas conventional statistical wisdom suggests such models should drastically overfit, in practice these models generalize remarkably…

Machine Learning · Statistics 2020-08-18 Ben Adlam , Jeffrey Pennington

Out-of-Distribution Generalization in Kernel Regression

In real word applications, data generating process for training a machine learning model often differs from what the model encounters in the test stage. Understanding how and whether machine learning models generalize under such…

Machine Learning · Statistics 2022-02-08 Abdulkadir Canatar , Blake Bordelon , Cengiz Pehlevan

Geometric Kernel Interpolation and Regression

Exploiting the variational interpretation of kernel interpolation we exhibit a direct connection between interpolation and regression, where interpolation appears as a limiting case of regression. By applying this framework to point clouds…

Numerical Analysis · Mathematics 2026-02-09 Patrick Guidotti

Interpolation, extrapolation, and local generalization in common neural networks

There has been a long history of works showing that neural networks have hard time extrapolating beyond the training set. A recent study by Balestriero et al. (2021) challenges this view: defining interpolation as the state of belonging to…

Machine Learning · Computer Science 2022-07-19 Laurent Bonnasse-Gahot

Benign Overfitting without Linearity: Neural Network Classifiers Trained by Gradient Descent for Noisy Linear Data

Benign overfitting, the phenomenon where interpolating models generalize well in the presence of noisy data, was first observed in neural network models trained with gradient descent. To better understand this empirical observation, we…

Machine Learning · Computer Science 2025-07-04 Spencer Frei , Niladri S. Chatterji , Peter L. Bartlett