English

Notes on Deep Learning Theory

Machine Learning 2020-12-11 v1 Artificial Intelligence

Abstract

These are the notes for the lectures that I was giving during Fall 2020 at the Moscow Institute of Physics and Technology (MIPT) and at the Yandex School of Data Analysis (YSDA). The notes cover some aspects of initialization, loss landscape, generalization, and a neural tangent kernel theory. While many other topics (e.g. expressivity, a mean-field theory, a double descent phenomenon) are missing in the current version, we plan to add them in future revisions.

Cite

@article{arxiv.2012.05760,
  title  = {Notes on Deep Learning Theory},
  author = {Eugene A. Golikov},
  journal= {arXiv preprint arXiv:2012.05760},
  year   = {2020}
}

Comments

68 pages

R2 v1 2026-06-23T20:52:38.350Z