English
Related papers

Related papers: Efficient kernel surrogates for neural network-bas…

200 papers

Recent research shows that the following two models are equivalent: (a) infinitely wide neural networks (NNs) trained under l2 loss by gradient descent with infinitesimally small learning rate (b) kernel regression with respect to so-called…

Machine Learning · Computer Science 2019-10-29 Sanjeev Arora , Simon S. Du , Zhiyuan Li , Ruslan Salakhutdinov , Ruosong Wang , Dingli Yu

The study of deep neural networks (DNNs) in the infinite-width limit, via the so-called neural tangent kernel (NTK) approach, has provided new insights into the dynamics of learning, generalization, and the impact of initialization. One key…

Machine Learning · Computer Science 2021-06-16 Sina Alemohammad , Zichao Wang , Randall Balestriero , Richard Baraniuk

A recent trend in explainable AI research has focused on surrogate modeling, where neural networks are approximated as simpler ML algorithms such as kernel machines. A second trend has been to utilize kernel functions in various…

Machine Learning · Computer Science 2024-03-13 Andrew Engel , Zhichao Wang , Natalie S. Frank , Ioana Dumitriu , Sutanay Choudhury , Anand Sarwate , Tony Chiang

Graph Convolutional Networks (GCNs) have emerged as powerful tools for learning on network structured data. Although empirically successful, GCNs exhibit certain behaviour that has no rigorous explanation -- for instance, the performance of…

Machine Learning · Computer Science 2023-11-07 Mahalakshmi Sabanayagam , Pascal Esser , Debarghya Ghoshdastidar

Expressiveness and generalization of deep models was recently addressed via the connection between neural networks (NNs) and kernel learning, where first-order dynamics of NN during a gradient-descent (GD) optimization were related to…

Machine Learning · Computer Science 2020-04-21 Dmitry Kopitkov , Vadim Indelman

The Neural Tangent Kernel (NTK) has recently attracted intense study, as it describes the evolution of an over-parameterized Neural Network (NN) trained by gradient descent. However, it is now well-known that gradient descent is not always…

Machine Learning · Computer Science 2021-03-23 Lei Tan , Shutong Wu , Xiaolin Huang

While deep learning has achieved remarkable success across a wide range of applications, its theoretical understanding of representation learning remains limited. Deep neural kernels provide a principled framework to interpret…

Machine Learning · Computer Science 2025-11-11 Yong-Ming Tian , Shuang Liang , Shao-Qun Zhang , Feng-Lei Fan

The evolution of a deep neural network trained by the gradient descent can be described by its neural tangent kernel (NTK) as introduced in [20], where it was proven that in the infinite width limit the NTK converges to an explicit limiting…

Machine Learning · Computer Science 2019-09-19 Jiaoyang Huang , Horng-Tzer Yau

The Neural Tangent Kernel (NTK) characterizes the behavior of infinitely-wide neural networks trained under least squares loss by gradient descent. Recent works also report that NTK regression can outperform finitely-wide neural networks…

Machine Learning · Computer Science 2021-12-09 Amir Zandieh , Insu Han , Haim Avron , Neta Shoham , Chaewon Kim , Jinwoo Shin

Recent theoretical works based on the neural tangent kernel (NTK) have shed light on the optimization and generalization of over-parameterized networks, and partially bridge the gap between their practical success and classical learning…

Machine Learning · Computer Science 2020-08-10 Kyung-Su Kim , Aurélie C. Lozano , Eunho Yang

State-of-the-art neural network training methods depend on the gradient of the network function. Therefore, they cannot be applied to networks whose activation functions do not have useful derivatives, such as binary and discrete-time…

Machine Learning · Statistics 2025-06-24 Luke Eilers , Raoul-Martin Memmesheimer , Sven Goedeke

This paper demonstrates that in classification problems, fully connected neural networks (FCNs) and residual neural networks (ResNets) cannot be approximated by kernel logistic regression based on the Neural Tangent Kernel (NTK) under…

Machine Learning · Computer Science 2025-07-15 Zixiong Yu , Songtao Tian , Guhan Chen

A rising trend in theoretical deep learning is to understand why deep learning works through Neural Tangent Kernel (NTK) [jgh18], a kernel method that is equivalent to using gradient descent to train a multi-layer infinitely-wide neural…

Machine Learning · Computer Science 2023-09-15 Lianke Qin , Zhao Song , Baocheng Sun

The Neural Tangent Kernel (NTK) characterizes the behavior of infinitely wide neural nets trained under least squares loss by gradient descent. However, despite its importance, the super-quadratic runtime of kernel methods limits the use of…

Machine Learning · Computer Science 2021-07-28 Amir Zandieh

At initialization, artificial neural networks (ANNs) are equivalent to Gaussian processes in the infinite-width limit, thus connecting them to kernel methods. We prove that the evolution of an ANN during training can also be described by a…

Machine Learning · Computer Science 2020-02-11 Arthur Jacot , Franck Gabriel , Clément Hongler

The performance of the data-dependent neural tangent kernel (NTK; Jacot et al. (2018)) associated with a trained deep neural network (DNN) often matches or exceeds that of the full network. This implies that DNN training via gradient…

Machine Learning · Computer Science 2025-05-22 Johannes Schwab , Bryan Kelly , Semyon Malamud , Teng Andrea Xu

For certain infinitely-wide neural networks, the neural tangent kernel (NTK) theory fully characterizes generalization, but for the networks used in practice, the empirical NTK only provides a rough first-order approximation. Still, a…

Machine Learning · Computer Science 2021-10-14 Guillermo Ortiz-Jiménez , Seyed-Mohsen Moosavi-Dezfooli , Pascal Frossard

How well does a classic deep net architecture like AlexNet or VGG19 classify on a standard dataset such as CIFAR-10 when its width --- namely, number of channels in convolutional layers, and number of nodes in fully-connected internal…

Machine Learning · Computer Science 2019-11-05 Sanjeev Arora , Simon S. Du , Wei Hu , Zhiyuan Li , Ruslan Salakhutdinov , Ruosong Wang

Neural Tangent Kernel (NTK) theory is widely used to study the dynamics of infinitely-wide deep neural networks (DNNs) under gradient descent. But do the results for infinitely-wide networks give us hints about the behavior of real…

Machine Learning · Computer Science 2022-02-02 Mariia Seleznova , Gitta Kutyniok

Neural tangent kernels (NTKs) have been proposed to study the behavior of trained neural networks from the perspective of Gaussian processes. An important result in this body of work is the theorem of equivalence between a trained neural…

Machine Learning · Statistics 2025-01-22 Haoran Liu , Anthony Tai , David J. Crandall , Chunfeng Huang
‹ Prev 1 2 3 10 Next ›