Related papers: Representation Based Complexity Measures for Predi…

NeurIPS 2020 Competition: Predicting Generalization in Deep Learning

Understanding generalization in deep learning is arguably one of the most important questions in deep learning. Deep learning has been successfully adopted to a large number of problems ranging from pattern recognition to complex decision…

Machine Learning · Computer Science 2020-12-16 Yiding Jiang , Pierre Foret , Scott Yak , Daniel M. Roy , Hossein Mobahi , Gintare Karolina Dziugaite , Samy Bengio , Suriya Gunasekar , Isabelle Guyon , Behnam Neyshabur

On margin-based generalization prediction in deep neural networks

Understanding generalization in deep neural networks is an active area of research. A promising avenue of exploration has been that of margin measurements: the shortest distance to the decision boundary for a given sample or that sample's…

Machine Learning · Computer Science 2024-05-29 Coenraad Mouton

Neural Complexity Measures

While various complexity measures for deep neural networks exist, specifying an appropriate measure capable of predicting and explaining generalization in deep networks has proven challenging. We propose Neural Complexity (NC), a…

Machine Learning · Computer Science 2020-10-26 Yoonho Lee , Juho Lee , Sung Ju Hwang , Eunho Yang , Seungjin Choi

Predicting Generalization in Deep Learning via Local Measures of Distortion

We study generalization in deep learning by appealing to complexity measures originally developed in approximation and information theory. While these concepts are challenged by the high-dimensional and data-defined nature of deep learning,…

Machine Learning · Statistics 2020-12-17 Abhejit Rajagopal , Vamshi C. Madala , Shivkumar Chandrasekaran , Peder E. Z. Larson

Generalization Analysis for Deep Contrastive Representation Learning

In this paper, we present generalization bounds for the unsupervised risk in the Deep Contrastive Representation Learning framework, which employs deep neural networks as representation functions. We approach this problem from two angles.…

Machine Learning · Statistics 2024-12-20 Nong Minh Hieu , Antoine Ledent , Yunwen Lei , Cheng Yeaw Ku

Understanding Generalization in Deep Learning via Tensor Methods

Deep neural networks generalize well on unseen data though the number of parameters often far exceeds the number of training examples. Recently proposed complexity measures have provided insights to understanding the generalizability in…

Machine Learning · Computer Science 2020-05-12 Jingling Li , Yanchao Sun , Jiahao Su , Taiji Suzuki , Furong Huang

Using noise resilience for ranking generalization of deep neural networks

Recent papers have shown that sufficiently overparameterized neural networks can perfectly fit even random labels. Thus, it is crucial to understand the underlying reason behind the generalization performance of a network on real-world…

Machine Learning · Computer Science 2020-12-17 Depen Morwani , Rahul Vashisht , Harish G. Ramaswamy

Generalizability of Neural Networks Minimizing Empirical Risk Based on Expressive Ability

The primary objective of learning methods is generalization. Classic uniform generalization bounds, which rely on VC-dimension or Rademacher complexity, fail to explain the significant attribute that over-parameterized models in deep…

Machine Learning · Computer Science 2025-03-07 Lijia Yu , Yibo Miao , Yifan Zhu , Xiao-Shan Gao , Lijun Zhang

Generalization Error in Deep Learning

Deep learning models have lately shown great performance in various fields such as computer vision, speech recognition, speech translation, and natural language processing. However, alongside their state-of-the-art performance, it is still…

Machine Learning · Computer Science 2019-04-09 Daniel Jakubovitz , Raja Giryes , Miguel R. D. Rodrigues

Can Biases in ImageNet Models Explain Generalization?

The robust generalization of models to rare, in-distribution (ID) samples drawn from the long tail of the training distribution and to out-of-training-distribution (OOD) samples is one of the major challenges of current deep learning…

Computer Vision and Pattern Recognition · Computer Science 2024-04-03 Paul Gavrikov , Janis Keuper

Estimating the Generalization in Deep Neural Networks via Sparsity

Generalization is the key capability for deep neural networks (DNNs). However, it is challenging to give a reliable measure of the generalization ability of a DNN via only its nature. In this paper, we propose a novel method for estimating…

Computer Vision and Pattern Recognition · Computer Science 2023-11-21 Yang Zhao , Hao Zhang

Input margins can predict generalization too

Understanding generalization in deep neural networks is an active area of research. A promising avenue of exploration has been that of margin measurements: the shortest distance to the decision boundary for a given sample or its…

Machine Learning · Computer Science 2023-08-30 Coenraad Mouton , Marthinus W. Theunissen , Marelie H. Davel

On Predicting Generalization using GANs

Research on generalization bounds for deep networks seeks to give ways to predict test error using just the training dataset and the network parameters. While generalization bounds can give many insights about architecture design, training…

Machine Learning · Computer Science 2022-03-21 Yi Zhang , Arushi Gupta , Nikunj Saunshi , Sanjeev Arora

Implicit Regularization in Deep Learning

In an attempt to better understand generalization in deep learning, we study several possible explanations. We show that implicit regularization induced by the optimization method is playing a key role in generalization and success of deep…

Machine Learning · Computer Science 2017-09-11 Behnam Neyshabur

Quantifying the generalization error in deep learning in terms of data distribution and neural network smoothness

The accuracy of deep learning, i.e., deep neural networks, can be characterized by dividing the total error into three main types: approximation error, optimization error, and generalization error. Whereas there are some satisfactory…

Machine Learning · Statistics 2021-11-03 Pengzhan Jin , Lu Lu , Yifa Tang , George Em Karniadakis

Representation Gap: Explaining the Unreasonable Effectiveness of Neural Networks from a Geometric Perspective

Characterizing precisely the asymptotic generalization error of neural networks using parameters that can be estimated efficiently is a crucial problem in machine learning, which relies heavily on heuristics and practitioners' intuition to…

Machine Learning · Computer Science 2026-05-22 David Perera , Victor Moura , Lais Isabelle Alves dos Santos , Michel F. C. Haddad , Flavio Figueiredo

Predicting the Generalization Gap in Deep Networks with Margin Distributions

As shown in recent research, deep neural networks can perfectly fit randomly labeled data, but with very poor accuracy on held out data. This phenomenon indicates that loss functions such as cross-entropy are not a reliable indicator of…

Machine Learning · Statistics 2019-06-13 Yiding Jiang , Dilip Krishnan , Hossein Mobahi , Samy Bengio

Stronger generalization bounds for deep nets via a compression approach

Deep nets generalize well despite having more parameters than the number of training samples. Recent works try to give an explanation using PAC-Bayes and Margin-based analyses, but do not as yet result in sample complexity bounds better…

Machine Learning · Computer Science 2018-11-28 Sanjeev Arora , Rong Ge , Behnam Neyshabur , Yi Zhang

Generalization and Expressivity for Deep Nets

Along with the rapid development of deep learning in practice, the theoretical explanations for its success become urgent. Generalization and expressivity are two widely used measurements to quantify theoretical behaviors of deep learning.…

Machine Learning · Computer Science 2018-03-26 Shao-Bo Lin

In Search of Robust Measures of Generalization

One of the principal scientific challenges in deep learning is explaining generalization, i.e., why the particular way the community now trains networks to achieve small training error also leads to small error on held-out data from the…

Machine Learning · Computer Science 2021-01-22 Gintare Karolina Dziugaite , Alexandre Drouin , Brady Neal , Nitarshan Rajkumar , Ethan Caballero , Linbo Wang , Ioannis Mitliagkas , Daniel M. Roy