English
Related papers

Related papers: When Do Neural Networks Outperform Kernel Methods?

200 papers

Kernel methods are powerful tools to capture nonlinear patterns behind data. They implicitly learn high (even infinite) dimensional nonlinear features in the Reproducing Kernel Hilbert Space (RKHS) while making the computation tractable by…

Machine Learning · Statistics 2017-11-28 Shuai Zhang , Jianxin Li , Pengtao Xie , Yingchun Zhang , Minglai Shao , Haoyi Zhou , Mengyi Yan

This paper presents a comprehensive study on the convergence rates of the stochastic gradient descent (SGD) algorithm when applied to overparameterized two-layer neural networks. Our approach combines the Neural Tangent Kernel (NTK)…

Machine Learning · Statistics 2024-07-11 Dinghao Cao , Zheng-Chu Guo , Lei Shi

We propose a new point of view for regularizing deep neural networks by using the norm of a reproducing kernel Hilbert space (RKHS). Even though this norm cannot be computed, it admits upper and lower approximations leading to various…

Machine Learning · Statistics 2019-05-15 Alberto Bietti , Grégoire Mialon , Dexiong Chen , Julien Mairal

A recent series of theoretical works showed that the dynamics of neural networks with a certain initialisation are well-captured by kernel methods. Concurrent empirical work demonstrated that kernel methods can come close to the performance…

Machine Learning · Computer Science 2021-06-11 Maria Refinetti , Sebastian Goldt , Florent Krzakala , Lenka Zdeborová

The study of Neural Tangent Kernels (NTKs) has provided much needed insight into convergence and generalization properties of neural networks in the over-parametrized (wide) limit by approximating the network using a first-order Taylor…

Machine Learning · Statistics 2023-02-02 Alistair Shilton , Sunil Gupta , Santu Rana , Svetha Venkatesh

Stochastic configuration networks (SCNs), as a class of randomized learner models, are featured by its way of random parameters assignment in the light of a supervisory mechanism, resulting in the universal approximation property at…

Machine Learning · Computer Science 2024-12-17 Yongxuan Chen , Dianhui Wang

We study generalization properties of distributed algorithms in the setting of nonparametric regression over a reproducing kernel Hilbert space (RKHS). We first investigate distributed stochastic gradient methods (SGM), with mini-batches…

Machine Learning · Statistics 2018-11-06 Junhong Lin , Volkan Cevher

Despite their immense promise in performing a variety of learning tasks, a theoretical understanding of the limitations of Deep Neural Networks (DNNs) has so far eluded practitioners. This is partly due to the inability to determine the…

Machine Learning · Computer Science 2024-01-25 Saad Qadeer , Andrew Engel , Amanda Howard , Adam Tsou , Max Vargas , Panos Stinis , Tony Chiang

This paper generalizes regularized regression problems in a hyper-reproducing kernel Hilbert space (hyper-RKHS), illustrates its utility for kernel learning and out-of-sample extensions, and proves asymptotic convergence results for the…

Machine Learning · Computer Science 2022-10-20 Fanghui Liu , Lei Shi , Xiaolin Huang , Jie Yang , Johan A. K. Suykens

Motivated by the abundance of functional data such as time series and images, there has been a growing interest in integrating such data into neural networks and learning maps from function spaces to R (i.e., functionals). In this paper, we…

Machine Learning · Statistics 2024-03-20 Tian-Yi Zhou , Namjoon Suh , Guang Cheng , Xiaoming Huo

The behavior of the gradient descent (GD) algorithm is analyzed for a deep neural network model with skip-connections. It is proved that in the over-parametrized regime, for a suitable initialization, with high probability GD can find a…

Machine Learning · Computer Science 2019-04-16 Weinan E , Chao Ma , Qingcan Wang , Lei Wu

Learning in the reproducing kernel Hilbert space (RKHS) such as the support vector machine has been recognized as a promising technique. It continues to be highly effective and competitive in numerous prediction tasks, particularly in…

Machine Learning · Computer Science 2025-01-15 Gakuto Obi , Ayato Saito , Yuto Sasaki , Tsuyoshi Kato

Consider the problem: given the data pair $(\mathbf{x}, \mathbf{y})$ drawn from a population with $f_*(x) = \mathbf{E}[\mathbf{y} | \mathbf{x} = x]$, specify a neural network model and run gradient flow on the weights over time until…

Machine Learning · Statistics 2020-07-27 Xialiang Dou , Tengyuan Liang

We derive a new Rademacher complexity bound for deep neural networks using Koopman operators, group representations, and reproducing kernel Hilbert spaces (RKHSs). The proposed bound describes why the models with high-rank weight matrices…

Machine Learning · Computer Science 2026-02-25 Yuka Hashimoto , Sho Sonoda , Isao Ishikawa , Masahiro Ikeda

Kernel normalization methods have been employed to improve robustness of optimization methods to reparametrization of convolution kernels, covariate shift, and to accelerate training of Convolutional Neural Networks (CNNs). However, our…

Computer Vision and Pattern Recognition · Computer Science 2016-10-25 Mete Ozay , Takayuki Okatani

To characterize the function space explored by neural networks (NNs) is an important aspect of learning theory. In this work, noticing that a multi-layer NN generates implicitly a hierarchy of reproducing kernel Hilbert spaces (RKHSs) -…

Machine Learning · Computer Science 2024-04-12 Zhengdao Chen

Substring kernels are classical tools for representing biological sequences or text. However, when large amounts of annotated data are available, models that allow end-to-end training such as neural networks are often preferred. Links…

Machine Learning · Statistics 2019-10-18 Dexiong Chen , Laurent Jacob , Julien Mairal

Recent studies show that a reproducing kernel Hilbert space (RKHS) is not a suitable space to model functions by neural networks as the curse of dimensionality (CoD) cannot be evaded when trying to approximate even a single ReLU neuron…

Machine Learning · Statistics 2024-06-27 Fanghui Liu , Leello Dadi , Volkan Cevher

Scattering networks are a class of designed Convolutional Neural Networks (CNNs) with fixed weights. We argue they can serve as generic representations for modelling images. In particular, by working in scattering space, we achieve…

The general perception is that kernel methods are not scalable, and neural nets are the methods of choice for nonlinear learning problems. Or have we simply not tried hard enough for kernel methods? Here we propose an approach that scales…

Machine Learning · Computer Science 2015-09-11 Bo Dai , Bo Xie , Niao He , Yingyu Liang , Anant Raj , Maria-Florina Balcan , Le Song
‹ Prev 1 2 3 10 Next ›