Related papers: When Do Neural Networks Outperform Kernel Methods?

Stacked Kernel Network

Kernel methods are powerful tools to capture nonlinear patterns behind data. They implicitly learn high (even infinite) dimensional nonlinear features in the Reproducing Kernel Hilbert Space (RKHS) while making the computation tractable by…

Machine Learning · Statistics 2017-11-28 Shuai Zhang , Jianxin Li , Pengtao Xie , Yingchun Zhang , Minglai Shao , Haoyi Zhou , Mengyi Yan

Stochastic Gradient Descent for Two-layer Neural Networks

This paper presents a comprehensive study on the convergence rates of the stochastic gradient descent (SGD) algorithm when applied to overparameterized two-layer neural networks. Our approach combines the Neural Tangent Kernel (NTK)…

Machine Learning · Statistics 2024-07-11 Dinghao Cao , Zheng-Chu Guo , Lei Shi

A Kernel Perspective for Regularizing Deep Neural Networks

We propose a new point of view for regularizing deep neural networks by using the norm of a reproducing kernel Hilbert space (RKHS). Even though this norm cannot be computed, it admits upper and lower approximations leading to various…

Machine Learning · Statistics 2019-05-15 Alberto Bietti , Grégoire Mialon , Dexiong Chen , Julien Mairal

Classifying high-dimensional Gaussian mixtures: Where kernel methods fail and neural networks succeed

A recent series of theoretical works showed that the dynamics of neural networks with a certain initialisation are well-captured by kernel methods. Concurrent empirical work demonstrated that kernel methods can come close to the performance…

Machine Learning · Computer Science 2021-06-11 Maria Refinetti , Sebastian Goldt , Florent Krzakala , Lenka Zdeborová

Gradient Descent in Neural Networks as Sequential Learning in RKBS

The study of Neural Tangent Kernels (NTKs) has provided much needed insight into convergence and generalization properties of neural networks in the over-parametrized (wide) limit by approximating the network using a first-order Taylor…

Machine Learning · Statistics 2023-02-02 Alistair Shilton , Sunil Gupta , Santu Rana , Svetha Venkatesh

Kernel Stochastic Configuration Networks for Nonlinear Regression

Stochastic configuration networks (SCNs), as a class of randomized learner models, are featured by its way of random parameters assignment in the light of a supervisory mechanism, resulting in the universal approximation property at…

Machine Learning · Computer Science 2024-12-17 Yongxuan Chen , Dianhui Wang

Optimal Convergence for Distributed Learning with Stochastic Gradient Methods and Spectral Algorithms

We study generalization properties of distributed algorithms in the setting of nonparametric regression over a reproducing kernel Hilbert space (RKHS). We first investigate distributed stochastic gradient methods (SGM), with mini-batches…

Machine Learning · Statistics 2018-11-06 Junhong Lin , Volkan Cevher

Efficient kernel surrogates for neural network-based regression

Despite their immense promise in performing a variety of learning tasks, a theoretical understanding of the limitations of Deep Neural Networks (DNNs) has so far eluded practitioners. This is partly due to the inability to determine the…

Machine Learning · Computer Science 2024-01-25 Saad Qadeer , Andrew Engel , Amanda Howard , Adam Tsou , Max Vargas , Panos Stinis , Tony Chiang

Generalization Properties of hyper-RKHS and its Applications

This paper generalizes regularized regression problems in a hyper-reproducing kernel Hilbert space (hyper-RKHS), illustrates its utility for kernel learning and out-of-sample extensions, and proves asymptotic convergence results for the…

Machine Learning · Computer Science 2022-10-20 Fanghui Liu , Lei Shi , Xiaolin Huang , Jie Yang , Johan A. K. Suykens

Approximation of RKHS Functionals by Neural Networks

Motivated by the abundance of functional data such as time series and images, there has been a growing interest in integrating such data into neural networks and learning maps from function spaces to R (i.e., functionals). In this paper, we…

Machine Learning · Statistics 2024-03-20 Tian-Yi Zhou , Namjoon Suh , Guang Cheng , Xiaoming Huo

Analysis of the Gradient Descent Algorithm for a Deep Neural Network Model with Skip-connections

The behavior of the gradient descent (GD) algorithm is analyzed for a deep neural network model with skip-connections. It is proved that in the over-parametrized regime, for a suitable initialization, with high probability GD can find a…

Machine Learning · Computer Science 2019-04-16 Weinan E , Chao Ma , Qingcan Wang , Lei Wu

Linearly Convergent Mixup Learning

Learning in the reproducing kernel Hilbert space (RKHS) such as the support vector machine has been recognized as a promising technique. It continues to be highly effective and competitive in numerous prediction tasks, particularly in…

Machine Learning · Computer Science 2025-01-15 Gakuto Obi , Ayato Saito , Yuto Sasaki , Tsuyoshi Kato

Training Neural Networks as Learning Data-adaptive Kernels: Provable Representation and Approximation Benefits

Consider the problem: given the data pair $(\mathbf{x}, \mathbf{y})$ drawn from a population with $f_*(x) = \mathbf{E}[\mathbf{y} | \mathbf{x} = x]$, specify a neural network model and run gradient flow on the weights over time until…

Machine Learning · Statistics 2020-07-27 Xialiang Dou , Tengyuan Liang

Why High-rank Neural Networks Generalize?: An Algebraic Framework with RKHSs

We derive a new Rademacher complexity bound for deep neural networks using Koopman operators, group representations, and reproducing kernel Hilbert spaces (RKHSs). The proposed bound describes why the models with high-rank weight matrices…

Machine Learning · Computer Science 2026-02-25 Yuka Hashimoto , Sho Sonoda , Isao Ishikawa , Masahiro Ikeda

Optimization on Submanifolds of Convolution Kernels in CNNs

Kernel normalization methods have been employed to improve robustness of optimization methods to reparametrization of convolution kernels, covariate shift, and to accelerate training of Convolutional Neural Networks (CNNs). However, our…

Computer Vision and Pattern Recognition · Computer Science 2016-10-25 Mete Ozay , Takayuki Okatani

Neural Hilbert Ladders: Multi-Layer Neural Networks in Function Space

To characterize the function space explored by neural networks (NNs) is an important aspect of learning theory. In this work, noticing that a multi-layer NN generates implicitly a hierarchy of reproducing kernel Hilbert spaces (RKHSs) -…

Machine Learning · Computer Science 2024-04-12 Zhengdao Chen

Recurrent Kernel Networks

Substring kernels are classical tools for representing biological sequences or text. However, when large amounts of annotated data are available, models that allow end-to-end training such as neural networks are often preferred. Links…

Machine Learning · Statistics 2019-10-18 Dexiong Chen , Laurent Jacob , Julien Mairal

Learning with Norm Constrained, Over-parameterized, Two-layer Neural Networks

Recent studies show that a reproducing kernel Hilbert space (RKHS) is not a suitable space to model functions by neural networks as the curse of dimensionality (CoD) cannot be evaded when trying to approximate even a single ReLU neuron…

Machine Learning · Statistics 2024-06-27 Fanghui Liu , Leello Dadi , Volkan Cevher

Scattering Networks for Hybrid Representation Learning

Scattering networks are a class of designed Convolutional Neural Networks (CNNs) with fixed weights. We argue they can serve as generic representations for modelling images. In particular, by working in scattering space, we achieve…

Machine Learning · Computer Science 2018-09-19 Edouard Oyallon , Sergey Zagoruyko , Gabriel Huang , Nikos Komodakis , Simon Lacoste-Julien , Matthew Blaschko , Eugene Belilovsky

Scalable Kernel Methods via Doubly Stochastic Gradients

The general perception is that kernel methods are not scalable, and neural nets are the methods of choice for nonlinear learning problems. Or have we simply not tried hard enough for kernel methods? Here we propose an approach that scales…

Machine Learning · Computer Science 2015-09-11 Bo Dai , Bo Xie , Niao He , Yingyu Liang , Anant Raj , Maria-Florina Balcan , Le Song