English
Related papers

Related papers: Kernel Methods and Multi-layer Perceptrons Learn L…

200 papers

Modern deep learning models employ considerably more parameters than required to fit the training data. Whereas conventional statistical wisdom suggests such models should drastically overfit, in practice these models generalize remarkably…

Machine Learning · Statistics 2020-08-18 Ben Adlam , Jeffrey Pennington

The double descent phenomenon challenges traditional statistical learning theory by revealing scenarios where larger models do not necessarily lead to reduced performance on unseen data. While this counterintuitive behavior has been…

The computational complexity of kernel methods has often been a major barrier for applying them to large-scale learning problems. We argue that this barrier can be effectively overcome. In particular, we develop methods to scale up kernel…

A recent series of theoretical works showed that the dynamics of neural networks with a certain initialisation are well-captured by kernel methods. Concurrent empirical work demonstrated that kernel methods can come close to the performance…

Machine Learning · Computer Science 2021-06-11 Maria Refinetti , Sebastian Goldt , Florent Krzakala , Lenka Zdeborová

We consider the problem of high-dimensional non-linear variable selection for supervised learning. Our approach is based on performing linear selection among exponentially many appropriately defined positive definite kernels that…

Machine Learning · Computer Science 2009-09-08 Francis Bach

Kernel methods are powerful machine learning techniques which implement generic non-linear functions to solve complex tasks in a simple way. They Have a solid mathematical background and exhibit excellent performance in practice. However,…

Machine Learning · Computer Science 2021-01-27 J. Emmanuel Johnson , Valero Laparra , Adrián Pérez-Suay , Miguel D. Mahecha , Gustau Camps-Valls

Linear mixed models (LMMs) are used extensively to model dependecies of observations in linear regression and are used extensively in many application areas. Parameter estimation for LMMs can be computationally prohibitive on big data.…

Machine Learning · Statistics 2019-03-08 Zilong Tan , Kimberly Roche , Xiang Zhou , Sayan Mukherjee

Understanding the dynamics of neural networks in different width regimes is crucial for improving their training and performance. We present an exact solution for the learning dynamics of a one-hidden-layer linear network, with…

Machine Learning · Computer Science 2025-02-24 Yizhou Xu , Liu Ziyin

In this paper, the flexibility, versatility and predictive power of kernel regression are combined with now lavishly available network data to create regression models with even greater predictive performances. Building from previous work…

Machine Learning · Statistics 2020-11-05 E. Pei , E. Fokoué

Deep learning methods have predominantly been applied to large artificial neural networks. Despite their state-of-the-art performance, these large networks typically do not generalize well to datasets with limited sample sizes. In this…

Machine Learning · Statistics 2016-11-17 Eric Strobl , Shyam Visweswaran

Metric and kernel learning are important in several machine learning applications. However, most existing metric learning algorithms are limited to learning metrics over low-dimensional data, while existing kernel learning algorithms are…

Machine Learning · Computer Science 2009-11-02 Prateek Jain , Brian Kulis , Jason V. Davis , Inderjit S. Dhillon

Deep learning models, such as wide neural networks, can be conceptualized as nonlinear dynamical physical systems characterized by a multitude of interacting degrees of freedom. Such systems in the infinite limit, tend to exhibit simplified…

Machine Learning · Computer Science 2024-01-09 Ori Shem-Ur , Yaron Oz

Understanding how convolutional neural networks (CNNs) can efficiently learn high-dimensional functions remains a fundamental challenge. A popular belief is that these models harness the local and hierarchical structure of natural data such…

Machine Learning · Statistics 2023-06-02 Francesco Cagnetta , Alessandro Favero , Matthieu Wyart

Research in modern data-driven dynamical systems is typically focused on the three key challenges of high dimensionality, unknown dynamics, and nonlinearity. The dynamic mode decomposition (DMD) has emerged as a cornerstone for modeling…

Fluid Dynamics · Physics 2022-04-27 Peter J. Baddoo , Benjamin Herrmann , Beverley J. McKeon , Steven L. Brunton

Significant theoretical work has established that in specific regimes, neural networks trained by gradient descent behave like kernel methods. However, in practice, it is known that neural networks strongly outperform their associated…

Machine Learning · Computer Science 2022-07-01 Alex Damian , Jason D. Lee , Mahdi Soltanolkotabi

The use of kernels for nonlinear prediction is widespread in machine learning. They have been popularized in support vector machines and used in kernel ridge regression, amongst others. Kernel methods share three aspects. First, instead of…

Machine Learning · Statistics 2025-08-25 Patrick J. F. Groenen , Michael Greenacre

We introduce a data-driven order reduction method for nonlinear control systems, drawing on recent progress in machine learning and statistical dimensionality reduction. The method rests on the assumption that the nonlinear system behaves…

Optimization and Control · Mathematics 2016-04-04 Jake Bouvrie , Boumediene Hamzi

A fairly comprehensive analysis is presented for the gradient descent dynamics for training two-layer neural network models in the situation when the parameters in both layers are updated. General initialization schemes as well as general…

Machine Learning · Computer Science 2020-02-27 Weinan E , Chao Ma , Lei Wu

In this paper, we study the feature learning ability of two-layer neural networks in the mean-field regime through the lens of kernel methods. To focus on the dynamics of the kernel induced by the first layer, we utilize a two-timescale…

Machine Learning · Computer Science 2024-04-09 Shokichi Takakura , Taiji Suzuki

This paper studies kernel ridge regression in high dimensions under covariate shifts and analyzes the role of importance re-weighting. We first derive the asymptotic expansion of high dimensional kernels under covariate shifts. By a…

Machine Learning · Statistics 2024-06-06 Yihang Chen , Fanghui Liu , Taiji Suzuki , Volkan Cevher
‹ Prev 1 2 3 10 Next ›