English
Related papers

Related papers: Notes on Deep Learning Theory

200 papers

A recent breakthrough in deep learning theory shows that the training of over-parameterized deep neural networks can be characterized by a kernel function called \textit{neural tangent kernel} (NTK). However, it is known that this type of…

Machine Learning · Computer Science 2020-10-07 Zixiang Chen , Yuan Cao , Quanquan Gu , Tong Zhang

These notes are based on a lecture delivered by NC on March 2021, as part of an advanced course in Princeton University on the mathematical understanding of deep learning. They present a theory (developed by NC, NR and collaborators) of…

Machine Learning · Computer Science 2024-11-07 Nadav Cohen , Noam Razin

This lecture note is intended to prepare early-year master's and PhD students in data science or a related discipline with foundational ideas in machine learning. It starts with basic ideas in modern machine learning with classification as…

Machine Learning · Computer Science 2025-05-08 Kyunghyun Cho

We discuss certain recent metric space methods and some of the possibilities these methods provide, with special focus on various generalizations of Lyapunov exponents originally appearing in the theory of dynamical systems and differential…

Dynamical Systems · Mathematics 2022-12-27 Anders Karlsson

In this article, we review the literature on statistical theories of neural networks from three perspectives: approximation, training dynamics and generative models. In the first part, results on excess risks for neural networks are…

Machine Learning · Statistics 2024-09-17 Namjoon Suh , Guang Cheng

These notes were compiled as lecture notes for a course developed and taught at the University of the Southern California. They should be accessible to a typical engineering graduate student with a strong background in Applied Mathematics.…

Machine Learning · Computer Science 2023-01-04 Deep Ray , Orazio Pinti , Assad A. Oberai

Deep learning is usually described as an experiment-driven field under continuous criticizes of lacking theoretical foundations. This problem has been partially fixed by a large volume of literature which has so far not been well organized.…

Machine Learning · Computer Science 2021-03-12 Fengxiang He , Dacheng Tao

The tremendous recent progress in analyzing the training dynamics of overparameterized neural networks has primarily focused on wide networks and therefore does not sufficiently address the role of depth in deep learning. In this work, we…

Machine Learning · Computer Science 2022-06-29 Jongmin Lee , Joo Young Choi , Ernest K. Ryu , Albert No

These notes are based on lectures I gave at TASI 2024 on Physics for Machine Learning. The focus is on neural network theory, organized according to network expressivity, statistics, and dynamics. I present classic results such as the…

High Energy Physics - Theory · Physics 2024-08-02 Jim Halverson

Sequential training from task to task is becoming one of the major objects in deep learning applications such as continual learning and transfer learning. Nevertheless, it remains unclear under what conditions the trained model's…

Machine Learning · Statistics 2022-03-21 Ryo Karakida , Shotaro Akaho

In suitably initialized wide networks, small learning rates transform deep neural networks (DNNs) into neural tangent kernel (NTK) machines, whose training dynamics is well-approximated by a linear weight expansion of the network at…

Machine Learning · Computer Science 2020-10-29 Stanislav Fort , Gintare Karolina Dziugaite , Mansheej Paul , Sepideh Kharaghani , Daniel M. Roy , Surya Ganguli

These brief lecture notes cover the basics of neural networks and deep learning as well as their applications in the quantum domain, for physicists without prior knowledge. In the first part, we describe training using backpropagation,…

Quantum Physics · Physics 2021-06-02 Florian Marquardt

These lecture notes develop the theory of learning in deep and recurrent neuronal networks from the point of view of Bayesian inference. The aim is to enable the reader to understand typical computations found in the literature in this…

Disordered Systems and Neural Networks · Physics 2026-02-16 Moritz Helias , Javed Lindner , Lars Schutzeichel , Zohar Ringel

These are the lecture notes that accompanied the course of the same name that I taught at the Eindhoven University of Technology from 2021 to 2023. The course is intended as an introduction to neural networks for mathematics students at the…

Machine Learning · Computer Science 2024-03-11 Bart M. N. Smets

We present a non-asymptotic theory of generalization in deep learning where the empirical neural tangent kernel partitions the output space. In directions corresponding to signal, error dissipates rapidly; in the vast orthogonal dimensions…

Machine Learning · Computer Science 2026-05-05 Elon Litman , Gabe Guo

An in-depth exploration of object detection and semantic segmentation is provided, combining theoretical foundations with practical applications. State-of-the-art advancements in machine learning and deep learning are reviewed, focusing on…

Modern deep learning models employ considerably more parameters than required to fit the training data. Whereas conventional statistical wisdom suggests such models should drastically overfit, in practice these models generalize remarkably…

Machine Learning · Statistics 2020-08-18 Ben Adlam , Jeffrey Pennington

Among many unsolved puzzles in theories of Deep Neural Networks (DNNs), there are three most fundamental challenges that highly demand solutions, namely, expressibility, optimisability, and generalisability. Although there have been…

Machine Learning · Computer Science 2018-11-27 Hao Shen

In this paper, we explore the application of mean field theory, a technique from statistical physics, to deep metric learning and address the high training complexity commonly associated with conventional metric learning loss functions. By…

Machine Learning · Computer Science 2023-06-28 Takuya Furusawa

The ability of learning useful features is one of the major advantages of neural networks. Although recent works show that neural network can operate in a neural tangent kernel (NTK) regime that does not allow feature learning, many works…

Machine Learning · Computer Science 2024-11-06 Mo Zhou , Rong Ge
‹ Prev 1 2 3 10 Next ›