English
Related papers

Related papers: Reverse Engineering the Neural Tangent Kernel

200 papers

Deep neural networks have become essential for numerous applications due to their strong empirical performance such as vision, RL, and classification. Unfortunately, these networks are quite difficult to interpret, and this limits their…

Machine Learning · Computer Science 2021-10-12 Sina Alemohammad , Hossein Babaei , CJ Barberan , Naiming Liu , Lorenzo Luzi , Blake Mason , Richard G. Baraniuk

While deep learning has achieved remarkable success across a wide range of applications, its theoretical understanding of representation learning remains limited. Deep neural kernels provide a principled framework to interpret…

Machine Learning · Computer Science 2025-11-11 Yong-Ming Tian , Shuang Liang , Shao-Qun Zhang , Feng-Lei Fan

In this article a surprising result is demonstrated using the neural tangent kernel. This kernel is defined as the inner product of the vector of the gradient of an underlying model evaluated at training points. This kernel is used to…

Artificial Intelligence · Computer Science 2021-04-14 Matt Calder

The design of neural architectures for structured objects is typically guided by experimental insights rather than a formal process. In this work, we appeal to kernels over combinatorial structures, such as sequences and graphs, to derive…

Neural and Evolutionary Computing · Computer Science 2017-10-31 Tao Lei , Wengong Jin , Regina Barzilay , Tommi Jaakkola

The study of deep neural networks (DNNs) in the infinite-width limit, via the so-called neural tangent kernel (NTK) approach, has provided new insights into the dynamics of learning, generalization, and the impact of initialization. One key…

Machine Learning · Computer Science 2021-06-16 Sina Alemohammad , Zichao Wang , Randall Balestriero , Richard Baraniuk

The Neural Tangent Kernel (NTK) is an important milestone in the ongoing effort to build a theory for deep learning. Its prediction that sufficiently wide neural networks behave as kernel methods, or equivalently as random feature models,…

Machine Learning · Computer Science 2020-06-25 Maxim Samarin , Volker Roth , David Belius

We investigate the connections between neural networks and simple building blocks in kernel space. In particular, using well established feature space tools such as direct sum, averaging, and moment lifting, we present an algebra for…

Deep kernel learning aims at designing nonlinear combinations of multiple standard elementary kernels by training deep networks. This scheme has proven to be effective, but intractable when handling large-scale datasets especially when the…

Computer Vision and Pattern Recognition · Computer Science 2018-05-01 Mingyuan Jiu , Hichem Sahbi

How well does a classic deep net architecture like AlexNet or VGG19 classify on a standard dataset such as CIFAR-10 when its width --- namely, number of channels in convolutional layers, and number of nodes in fully-connected internal…

Machine Learning · Computer Science 2019-11-05 Sanjeev Arora , Simon S. Du , Wei Hu , Zhiyuan Li , Ruslan Salakhutdinov , Ruosong Wang

Recent theoretical works based on the neural tangent kernel (NTK) have shed light on the optimization and generalization of over-parameterized networks, and partially bridge the gap between their practical success and classical learning…

Machine Learning · Computer Science 2020-08-10 Kyung-Su Kim , Aurélie C. Lozano , Eunho Yang

Yang (2020a) recently showed that the Neural Tangent Kernel (NTK) at initialization has an infinite-width limit for a large class of architectures including modern staples such as ResNet and Transformers. However, their analysis does not…

Machine Learning · Computer Science 2021-05-11 Greg Yang , Etai Littwin

We prove that a randomly initialized neural network of *any architecture* has its Tangent Kernel (NTK) converge to a deterministic limit, as the network widths tend to infinity. We demonstrate how to calculate this limit. In prior…

Machine Learning · Statistics 2020-12-01 Greg Yang

In computational design and fabrication, neural networks are becoming important surrogates for bulky forward simulations. A long-standing, intertwined question is that of inverse design: how to compute a design that satisfies a desired…

Graphics · Computer Science 2022-08-30 Navid Ansari , Hans-Peter Seidel , Vahid Babaei

Advanced deep neural networks (DNNs), designed by either human or AutoML algorithms, are growing increasingly complex. Diverse operations are connected by complicated connectivity patterns, e.g., various types of skip connections. Those…

Machine Learning · Computer Science 2022-10-13 Wuyang Chen , Wei Huang , Xinyu Gong , Boris Hanin , Zhangyang Wang

Past decades have witnessed a great interest in the distinction and connection between neural network learning and kernel learning. Recent advancements have made theoretical progress in connecting infinite-wide neural networks and Gaussian…

Machine Learning · Computer Science 2025-11-07 Shao-Qun Zhang , Zong-Yi Chen , Yong-Ming Tian , Xun Lu

The neural tangent kernel (NTK) has garnered significant attention as a theoretical framework for describing the behavior of large-scale neural networks. Kernel methods are theoretically well-understood and as a result enjoy algorithmic…

Machine Learning · Computer Science 2024-05-30 Jonathan Wenger , Felix Dangel , Agustinus Kristiadi

Deep learning's successes are often attributed to its ability to automatically discover new representations of the data, rather than relying on handcrafted features like other learning methods. We show, however, that deep networks learned…

Machine Learning · Computer Science 2020-12-02 Pedro Domingos

The Neural Tangent Kernel (NTK), defined as $\Theta_\theta^f(x_1, x_2) = \left[\partial f(\theta, x_1)\big/\partial \theta\right] \left[\partial f(\theta, x_2)\big/\partial \theta\right]^T$ where $\left[\partial f(\theta,…

Machine Learning · Computer Science 2022-06-20 Roman Novak , Jascha Sohl-Dickstein , Samuel S. Schoenholz

We explore the equivalence between neural networks and kernel methods by deriving the first exact representation of any finite-size parametric classification model trained with gradient descent as a kernel machine. We compare our exact…

Machine Learning · Computer Science 2023-08-10 Brian Bell , Michael Geyer , David Glickenstein , Amanda Fernandez , Juston Moore

We introduce neural networks for end-to-end differentiable proving of queries to knowledge bases by operating on dense vector representations of symbols. These neural networks are constructed recursively by taking inspiration from the…

Neural and Evolutionary Computing · Computer Science 2017-12-05 Tim Rocktäschel , Sebastian Riedel
‹ Prev 1 2 3 10 Next ›