Related papers: Notes on Deep Learning Theory

A Generalized Neural Tangent Kernel Analysis for Two-layer Neural Networks

A recent breakthrough in deep learning theory shows that the training of over-parameterized deep neural networks can be characterized by a kernel function called \textit{neural tangent kernel} (NTK). However, it is known that this type of…

Machine Learning · Computer Science 2020-10-07 Zixiang Chen , Yuan Cao , Quanquan Gu , Tong Zhang

Lecture Notes on Linear Neural Networks: A Tale of Optimization and Generalization in Deep Learning

These notes are based on a lecture delivered by NC on March 2021, as part of an advanced course in Princeton University on the mathematical understanding of deep learning. They present a theory (developed by NC, NR and collaborators) of…

Machine Learning · Computer Science 2024-11-07 Nadav Cohen , Noam Razin

Machine Learning: a Lecture Note

This lecture note is intended to prepare early-year master's and PhD students in data science or a related discipline with foundational ideas in machine learning. It starts with basic ideas in modern machine learning with classification as…

Machine Learning · Computer Science 2025-05-08 Kyunghyun Cho

Generalized Lyapunov exponents and aspects of the theory of deep learning

We discuss certain recent metric space methods and some of the possibilities these methods provide, with special focus on various generalizations of Lyapunov exponents originally appearing in the theory of dynamical systems and differential…

Dynamical Systems · Mathematics 2022-12-27 Anders Karlsson

A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative Models

In this article, we review the literature on statistical theories of neural networks from three perspectives: approximation, training dynamics and generative models. In the first part, results on excess risks for neural networks are…

Machine Learning · Statistics 2024-09-17 Namjoon Suh , Guang Cheng

Deep Learning and Computational Physics (Lecture Notes)

These notes were compiled as lecture notes for a course developed and taught at the University of the Southern California. They should be accessible to a typical engineering graduate student with a strong background in Applied Mathematics.…

Machine Learning · Computer Science 2023-01-04 Deep Ray , Orazio Pinti , Assad A. Oberai

Recent advances in deep learning theory

Deep learning is usually described as an experiment-driven field under continuous criticizes of lacking theoretical foundations. This problem has been partially fixed by a large volume of literature which has so far not been well organized.…

Machine Learning · Computer Science 2021-03-12 Fengxiang He , Dacheng Tao

Neural Tangent Kernel Analysis of Deep Narrow Neural Networks

The tremendous recent progress in analyzing the training dynamics of overparameterized neural networks has primarily focused on wide networks and therefore does not sufficiently address the role of depth in deep learning. In this work, we…

Machine Learning · Computer Science 2022-06-29 Jongmin Lee , Joo Young Choi , Ernest K. Ryu , Albert No

TASI Lectures on Physics for Machine Learning

These notes are based on lectures I gave at TASI 2024 on Physics for Machine Learning. The focus is on neural network theory, organized according to network expressivity, statistics, and dynamics. I present classic results such as the…

High Energy Physics - Theory · Physics 2024-08-02 Jim Halverson

Learning Curves for Continual Learning in Neural Networks: Self-Knowledge Transfer and Forgetting

Sequential training from task to task is becoming one of the major objects in deep learning applications such as continual learning and transfer learning. Nevertheless, it remains unclear under what conditions the trained model's…

Machine Learning · Statistics 2022-03-21 Ryo Karakida , Shotaro Akaho

Deep learning versus kernel learning: an empirical study of loss landscape geometry and the time evolution of the Neural Tangent Kernel

In suitably initialized wide networks, small learning rates transform deep neural networks (DNNs) into neural tangent kernel (NTK) machines, whose training dynamics is well-approximated by a linear weight expansion of the network at…

Machine Learning · Computer Science 2020-10-29 Stanislav Fort , Gintare Karolina Dziugaite , Mansheej Paul , Sepideh Kharaghani , Daniel M. Roy , Surya Ganguli

Machine Learning and Quantum Devices

These brief lecture notes cover the basics of neural networks and deep learning as well as their applications in the quantum domain, for physicists without prior knowledge. In the first part, we describe training using backpropagation,…

Quantum Physics · Physics 2021-06-02 Florian Marquardt

Lecture notes: From Gaussian processes to feature learning

These lecture notes develop the theory of learning in deep and recurrent neuronal networks from the point of view of Bayesian inference. The aim is to enable the reader to understand typical computations found in the literature in this…

Disordered Systems and Neural Networks · Physics 2026-02-16 Moritz Helias , Javed Lindner , Lars Schutzeichel , Zohar Ringel

Mathematics of Neural Networks (Lecture Notes Graduate Course)

These are the lecture notes that accompanied the course of the same name that I taught at the Eindhoven University of Technology from 2021 to 2023. The course is intended as an introduction to neural networks for mathematics students at the…

Machine Learning · Computer Science 2024-03-11 Bart M. N. Smets

A Theory of Generalization in Deep Learning

We present a non-asymptotic theory of generalization in deep learning where the empirical neural tangent kernel partitions the output space. In directions corresponding to signal, error dissipates rapidly; in the vast orthogonal dimensions…

Machine Learning · Computer Science 2026-05-05 Elon Litman , Gabe Guo

Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications

An in-depth exploration of object detection and semantic segmentation is provided, combining theoretical foundations with practical applications. State-of-the-art advancements in machine learning and deep learning are reviewed, focusing on…

Computer Vision and Pattern Recognition · Computer Science 2025-11-19 Jintao Ren , Ziqian Bi , Qian Niu , Xinyuan Song , Zekun Jiang , Junyu Liu , Benji Peng , Sen Zhang , Xuanhe Pan , Jinlang Wang , Keyu Chen , Caitlyn Heqi Yin , Pohsun Feng , Yizhu Wen , Tianyang Wang , Silin Chen , Ming Li , Jiawei Xu , Ming Liu

The Neural Tangent Kernel in High Dimensions: Triple Descent and a Multi-Scale Theory of Generalization

Modern deep learning models employ considerably more parameters than required to fit the training data. Whereas conventional statistical wisdom suggests such models should drastically overfit, in practice these models generalize remarkably…

Machine Learning · Statistics 2020-08-18 Ben Adlam , Jeffrey Pennington

A Differential Topological View of Challenges in Learning with Feedforward Neural Networks

Among many unsolved puzzles in theories of Deep Neural Networks (DNNs), there are three most fundamental challenges that highly demand solutions, namely, expressibility, optimisability, and generalisability. Although there have been…

Machine Learning · Computer Science 2018-11-27 Hao Shen

Mean Field Theory in Deep Metric Learning

In this paper, we explore the application of mean field theory, a technique from statistical physics, to deep metric learning and address the high training complexity commonly associated with conventional metric learning loss functions. By…

Machine Learning · Computer Science 2023-06-28 Takuya Furusawa

How Does Gradient Descent Learn Features -- A Local Analysis for Regularized Two-Layer Neural Networks

The ability of learning useful features is one of the major advantages of neural networks. Although recent works show that neural network can operate in a neural tangent kernel (NTK) regime that does not allow feature learning, many works…

Machine Learning · Computer Science 2024-11-06 Mo Zhou , Rong Ge