English
Related papers

Related papers: Exploring Generalization in Deep Learning

200 papers

In an attempt to better understand generalization in deep learning, we study several possible explanations. We show that implicit regularization induced by the optimization method is playing a key role in generalization and success of deep…

Machine Learning · Computer Science 2017-09-11 Behnam Neyshabur

This paper provides theoretical insights into why and how deep learning can generalize well, despite its large capacity, complexity, possible algorithmic instability, nonrobustness, and sharp minima, responding to an open question in the…

Machine Learning · Statistics 2023-08-29 Kenji Kawaguchi , Leslie Pack Kaelbling , Yoshua Bengio

Understanding the generalization behaviour of deep neural networks is a topic of recent interest that has driven the production of many studies, notably the development and evaluation of generalization "explainability" measures that…

Machine Learning · Computer Science 2021-10-26 Jonathan Jaegerman , Khalil Damouni , Mahdi S. Hosseini , Konstantinos N. Plataniotis

The generalization performance of deep learning models for medical image analysis often decreases on images collected with different devices for data acquisition, device settings, or patient population. A better understanding of the…

Computer Vision and Pattern Recognition · Computer Science 2023-07-20 Aleksandar Vakanski , Min Xian

One of the principal scientific challenges in deep learning is explaining generalization, i.e., why the particular way the community now trains networks to achieve small training error also leads to small error on held-out data from the…

Along with the rapid development of deep learning in practice, the theoretical explanations for its success become urgent. Generalization and expressivity are two widely used measurements to quantify theoretical behaviors of deep learning.…

Machine Learning · Computer Science 2018-03-26 Shao-Bo Lin

Deep neural networks generalize well on unseen data though the number of parameters often far exceeds the number of training examples. Recently proposed complexity measures have provided insights to understanding the generalizability in…

Machine Learning · Computer Science 2020-05-12 Jingling Li , Yanchao Sun , Jiahao Su , Taiji Suzuki , Furong Huang

Understanding generalization in deep learning is arguably one of the most important questions in deep learning. Deep learning has been successfully adopted to a large number of problems ranging from pattern recognition to complex decision…

Deep Neural Networks can generalize despite being significantly overparametrized. Recent research has tried to examine this phenomenon from various view points and to provide bounds on the generalization error or measures predictive of the…

Machine Learning · Computer Science 2020-12-07 Parth Natekar , Manik Sharma

When several models have similar training scores, classical model selection heuristics follow Occam's razor and advise choosing the ones with least capacity. Yet, modern practice with large neural networks has often led to situations where…

Machine Learning · Computer Science 2022-11-29 Luis Sa-Couto , Jose Miguel Ramos , Andreas Wichert

A widely believed explanation for the remarkable generalization capacities of overparameterized neural networks is that the optimization algorithms used for training induce an implicit bias towards benign solutions. To grasp this…

Machine Learning · Computer Science 2025-12-19 Maria Matveev , Vit Fojtik , Hung-Hsu Chou , Gitta Kutyniok , Johannes Maly

Why do large neural network generalize so well on complex tasks such as image classification or speech recognition? What exactly is the role regularization for them? These are arguably among the most important open questions in machine…

Machine Learning · Statistics 2017-04-10 Pirmin Lemberger

Deep nets generalize well despite having more parameters than the number of training samples. Recent works try to give an explanation using PAC-Bayes and Margin-based analyses, but do not as yet result in sample complexity bounds better…

Machine Learning · Computer Science 2018-11-28 Sanjeev Arora , Rong Ge , Behnam Neyshabur , Yi Zhang

While there has been progress in developing non-vacuous generalization bounds for deep neural networks, these bounds tend to be uninformative about why deep learning works. In this paper, we develop a compression approach based on…

Machine Learning · Computer Science 2022-11-28 Sanae Lotfi , Marc Finzi , Sanyam Kapoor , Andres Potapczynski , Micah Goldblum , Andrew Gordon Wilson

This dissertation studies a fundamental open challenge in deep learning theory: why do deep networks generalize well even while being overparameterized, unregularized and fitting the training data to zero error? In the first part of the…

Machine Learning · Computer Science 2021-10-19 Vaishnavh Nagarajan

Despite the extreme popularity of deep learning in science and industry, its formal understanding is limited. This thesis puts forth notions of rank as key for developing a theory of deep learning, focusing on the fundamental aspects of…

Machine Learning · Computer Science 2024-12-31 Noam Razin

Sharpness of minima is a promising quantity that can correlate with generalization in deep networks and, when optimized during training, can improve generalization. However, standard sharpness is not invariant under reparametrizations of…

Machine Learning · Computer Science 2023-06-08 Maksym Andriushchenko , Francesco Croce , Maximilian Müller , Matthias Hein , Nicolas Flammarion

Generalization of deep networks has been of great interest in recent years, resulting in a number of theoretically and empirically motivated complexity measures. However, most papers proposing such measures study only a small set of models,…

Machine Learning · Computer Science 2019-12-05 Yiding Jiang , Behnam Neyshabur , Hossein Mobahi , Dilip Krishnan , Samy Bengio

Generalization remains a critical challenge in speech deepfake detection (SDD). While various approaches aim to improve robustness, generalization is typically assessed through performance metrics like equal error rate without a theoretical…

Audio and Speech Processing · Electrical Eng. & Systems 2025-06-16 Wen Huang , Xuechen Liu , Xin Wang , Junichi Yamagishi , Yanmin Qian

Despite the popularity and success of deep learning, there is limited understanding of when, how, and why neural networks generalize to unseen examples. Since learning can be seen as extracting information from data, we formally study…

Machine Learning · Computer Science 2023-06-29 Hrayr Harutyunyan
‹ Prev 1 2 3 10 Next ›