Related papers: Kernel Methods and Multi-layer Perceptrons Learn L…

The Neural Tangent Kernel in High Dimensions: Triple Descent and a Multi-Scale Theory of Generalization

Modern deep learning models employ considerably more parameters than required to fit the training data. Whereas conventional statistical wisdom suggests such models should drastically overfit, in practice these models generalize remarkably…

Machine Learning · Statistics 2020-08-18 Ben Adlam , Jeffrey Pennington

Double descent in quantum kernel methods

The double descent phenomenon challenges traditional statistical learning theory by revealing scenarios where larger models do not necessarily lead to reduced performance on unseen data. While this counterintuitive behavior has been…

Quantum Physics · Physics 2026-01-19 Marie Kempkes , Aroosa Ijaz , Elies Gil-Fuster , Carlos Bravo-Prieto , Jakob Spiegelberg , Evert van Nieuwenburg , Vedran Dunjko

How to Scale Up Kernel Methods to Be As Good As Deep Neural Nets

The computational complexity of kernel methods has often been a major barrier for applying them to large-scale learning problems. We argue that this barrier can be effectively overcome. In particular, we develop methods to scale up kernel…

Machine Learning · Computer Science 2019-01-25 Zhiyun Lu , Avner May , Kuan Liu , Alireza Bagheri Garakani , Dong Guo , Aurélien Bellet , Linxi Fan , Michael Collins , Brian Kingsbury , Michael Picheny , Fei Sha

Classifying high-dimensional Gaussian mixtures: Where kernel methods fail and neural networks succeed

A recent series of theoretical works showed that the dynamics of neural networks with a certain initialisation are well-captured by kernel methods. Concurrent empirical work demonstrated that kernel methods can come close to the performance…

Machine Learning · Computer Science 2021-06-11 Maria Refinetti , Sebastian Goldt , Florent Krzakala , Lenka Zdeborová

High-Dimensional Non-Linear Variable Selection through Hierarchical Kernel Learning

We consider the problem of high-dimensional non-linear variable selection for supervised learning. Our approach is based on performing linear selection among exponentially many appropriately defined positive definite kernels that…

Machine Learning · Computer Science 2009-09-08 Francis Bach

Kernel Methods and their derivatives: Concept and perspectives for the Earth system sciences

Kernel methods are powerful machine learning techniques which implement generic non-linear functions to solve complex tasks in a simple way. They Have a solid mathematical background and exhibit excellent performance in practice. However,…

Machine Learning · Computer Science 2021-01-27 J. Emmanuel Johnson , Valero Laparra , Adrián Pérez-Suay , Miguel D. Mahecha , Gustau Camps-Valls

Scalable Algorithms for Learning High-Dimensional Linear Mixed Models

Linear mixed models (LMMs) are used extensively to model dependecies of observations in linear regression and are used extensively in many application areas. Parameter estimation for LMMs can be computationally prohibitive on big data.…

Machine Learning · Statistics 2019-03-08 Zilong Tan , Kimberly Roche , Xiang Zhou , Sayan Mukherjee

Three Mechanisms of Feature Learning in a Linear Network

Understanding the dynamics of neural networks in different width regimes is crucial for improving their training and performance. We present an exact solution for the learning dynamics of a one-hidden-layer linear network, with…

Machine Learning · Computer Science 2025-02-24 Yizhou Xu , Liu Ziyin

Graph Enhanced High Dimensional Kernel Regression

In this paper, the flexibility, versatility and predictive power of kernel regression are combined with now lavishly available network data to create regression models with even greater predictive performances. Building from previous work…

Machine Learning · Statistics 2020-11-05 E. Pei , E. Fokoué

Deep Multiple Kernel Learning

Deep learning methods have predominantly been applied to large artificial neural networks. Despite their state-of-the-art performance, these large networks typically do not generalize well to datasets with limited sample sizes. In this…

Machine Learning · Statistics 2016-11-17 Eric Strobl , Shyam Visweswaran

Metric and Kernel Learning using a Linear Transformation

Metric and kernel learning are important in several machine learning applications. However, most existing metric learning algorithms are limited to learning metrics over low-dimensional data, while existing kernel learning algorithms are…

Machine Learning · Computer Science 2009-11-02 Prateek Jain , Brian Kulis , Jason V. Davis , Inderjit S. Dhillon

Weak Correlations as the Underlying Principle for Linearization of Gradient-Based Learning Systems

Deep learning models, such as wide neural networks, can be conceptualized as nonlinear dynamical physical systems characterized by a multitude of interacting degrees of freedom. Such systems in the infinite limit, tend to exhibit simplified…

Machine Learning · Computer Science 2024-01-09 Ori Shem-Ur , Yaron Oz

What Can Be Learnt With Wide Convolutional Neural Networks?

Understanding how convolutional neural networks (CNNs) can efficiently learn high-dimensional functions remains a fundamental challenge. A popular belief is that these models harness the local and hierarchical structure of natural data such…

Machine Learning · Statistics 2023-06-02 Francesco Cagnetta , Alessandro Favero , Matthieu Wyart

Kernel Learning for Robust Dynamic Mode Decomposition: Linear and Nonlinear Disambiguation Optimization (LANDO)

Research in modern data-driven dynamical systems is typically focused on the three key challenges of high dimensionality, unknown dynamics, and nonlinearity. The dynamic mode decomposition (DMD) has emerged as a cornerstone for modeling…

Fluid Dynamics · Physics 2022-04-27 Peter J. Baddoo , Benjamin Herrmann , Beverley J. McKeon , Steven L. Brunton

Neural Networks can Learn Representations with Gradient Descent

Significant theoretical work has established that in specific regimes, neural networks trained by gradient descent behave like kernel methods. However, in practice, it is known that neural networks strongly outperform their associated…

Machine Learning · Computer Science 2022-07-01 Alex Damian , Jason D. Lee , Mahdi Soltanolkotabi

Interpretable Kernels

The use of kernels for nonlinear prediction is widespread in machine learning. They have been popularized in support vector machines and used in kernel ridge regression, amongst others. Kernel methods share three aspects. First, instead of…

Machine Learning · Statistics 2025-08-25 Patrick J. F. Groenen , Michael Greenacre

Kernel Methods for the Approximation of Nonlinear Systems

We introduce a data-driven order reduction method for nonlinear control systems, drawing on recent progress in machine learning and statistical dimensionality reduction. The method rests on the assumption that the nonlinear system behaves…

Optimization and Control · Mathematics 2016-04-04 Jake Bouvrie , Boumediene Hamzi

A Comparative Analysis of the Optimization and Generalization Property of Two-layer Neural Network and Random Feature Models Under Gradient Descent Dynamics

A fairly comprehensive analysis is presented for the gradient descent dynamics for training two-layer neural network models in the situation when the parameters in both layers are updated. General initialization schemes as well as general…

Machine Learning · Computer Science 2020-02-27 Weinan E , Chao Ma , Lei Wu

Mean-field Analysis on Two-layer Neural Networks from a Kernel Perspective

In this paper, we study the feature learning ability of two-layer neural networks in the mean-field regime through the lens of kernel methods. To focus on the dynamics of the kernel induced by the first layer, we utilize a two-timescale…

Machine Learning · Computer Science 2024-04-09 Shokichi Takakura , Taiji Suzuki

High-Dimensional Kernel Methods under Covariate Shift: Data-Dependent Implicit Regularization

This paper studies kernel ridge regression in high dimensions under covariate shifts and analyzes the role of importance re-weighting. We first derive the asymptotic expansion of high dimensional kernels under covariate shifts. By a…

Machine Learning · Statistics 2024-06-06 Yihang Chen , Fanghui Liu , Taiji Suzuki , Volkan Cevher