Related papers: Understanding Generalization through Decision Patt…

A New Perspective for Understanding Generalization Gap of Deep Neural Networks Trained with Large Batch Sizes

Deep neural networks (DNNs) are typically optimized using various forms of mini-batch gradient descent algorithm. A major motivation for mini-batch gradient descent is that with a suitably chosen batch size, available computing resources…

Machine Learning · Computer Science 2022-10-25 Oyebade K. Oyedotun , Konstantinos Papadopoulos , Djamila Aouada

Understanding Why Neural Networks Generalize Well Through GSNR of Parameters

As deep neural networks (DNNs) achieve tremendous success across many application domains, researchers tried to explore in many aspects on why they generalize well. In this paper, we provide a novel perspective on these issues using the…

Machine Learning · Computer Science 2020-02-25 Jinlong Liu , Guoqing Jiang , Yunzhi Bai , Ting Chen , Huayan Wang

A Survey on Assessing the Generalization Envelope of Deep Neural Networks: Predictive Uncertainty, Out-of-distribution and Adversarial Samples

Deep Neural Networks (DNNs) achieve state-of-the-art performance on numerous applications. However, it is difficult to tell beforehand if a DNN receiving an input will deliver the correct output since their decision criteria are usually…

Machine Learning · Computer Science 2021-09-07 Julia Lust , Alexandru Paul Condurache

Learning Trajectories are Generalization Indicators

This paper explores the connection between learning trajectories of Deep Neural Networks (DNNs) and their generalization capabilities when optimized using (stochastic) gradient descent algorithms. Instead of concentrating solely on the…

Machine Learning · Computer Science 2023-11-01 Jingwen Fu , Zhizheng Zhang , Dacheng Yin , Yan Lu , Nanning Zheng

Finding Diverse and Predictable Subgraphs for Graph Domain Generalization

This paper focuses on out-of-distribution generalization on graphs where performance drops due to the unseen distribution shift. Previous graph domain generalization works always resort to learning an invariant predictor among different…

Machine Learning · Computer Science 2022-06-22 Junchi Yu , Jian Liang , Ran He

Estimating the Generalization in Deep Neural Networks via Sparsity

Generalization is the key capability for deep neural networks (DNNs). However, it is challenging to give a reliable measure of the generalization ability of a DNN via only its nature. In this paper, we propose a novel method for estimating…

Computer Vision and Pattern Recognition · Computer Science 2023-11-21 Yang Zhao , Hao Zhang

Deep learning generalizes because the parameter-function map is biased towards simple functions

Deep neural networks (DNNs) generalize remarkably well without explicit regularization even in the strongly over-parametrized regime where classical learning theory would instead predict that they would severely overfit. While many…

Machine Learning · Statistics 2019-04-23 Guillermo Valle-Pérez , Chico Q. Camargo , Ard A. Louis

Generalizing Neural Networks by Reflecting Deviating Data in Production

Trained with a sufficiently large training and testing dataset, Deep Neural Networks (DNNs) are expected to generalize. However, inputs may deviate from the training dataset distribution in real deployments. This is a fundamental issue with…

Machine Learning · Computer Science 2021-10-07 Yan Xiao , Yun Lin , Ivan Beschastnikh , Changsheng Sun , David S. Rosenblum , Jin Song Dong

Modeling Uncertain Feature Representation for Domain Generalization

Though deep neural networks have achieved impressive success on various vision tasks, obvious performance degradation still exists when models are tested in out-of-distribution scenarios. In addressing this limitation, we ponder that the…

Computer Vision and Pattern Recognition · Computer Science 2023-01-18 Xiaotong Li , Zixuan Hu , Jun Liu , Yixiao Ge , Yongxing Dai , Ling-Yu Duan

Deep Neural Networks as Iterated Function Systems and a Generalization Bound

Deep neural networks (DNNs) achieve remarkable performance on a wide range of tasks, yet their mathematical analysis remains fragmented: stability and generalization are typically studied in disparate frameworks and on a case-by-case basis.…

Machine Learning · Statistics 2026-01-29 Jonathan Vacher

Verifying the Generalization of Deep Learning to Out-of-Distribution Domains

Deep neural networks (DNNs) play a crucial role in the field of machine learning, demonstrating state-of-the-art performance across various application domains. However, despite their success, DNN-based models may occasionally exhibit…

Machine Learning · Computer Science 2024-07-02 Guy Amir , Osher Maayan , Tom Zelazny , Guy Katz , Michael Schapira

Deep Probabilistic Supervision for Image Classification

Supervised training of deep neural networks for classification typically relies on hard targets, which promote overconfidence and can limit calibration, generalization, and robustness. Self-distillation methods aim to mitigate this by…

Computer Vision and Pattern Recognition · Computer Science 2026-02-06 Anton Adelöw , Matteo Gamba , Atsuto Maki

Deep P-Spline: Theory, Fast Tuning, and Application

Deep neural networks (DNNs) have been widely applied to solve real-world regression problems. However, selecting optimal network structures remains a significant challenge. This study addresses this issue by linking neuron selection in DNNs…

Computation · Statistics 2025-09-30 Noah Yi-Ting Hung , Li-Hsiang Lin , Vince D. Calhoun

A Theoretical-Empirical Approach to Estimating Sample Complexity of DNNs

This paper focuses on understanding how the generalization error scales with the amount of the training data for deep neural networks (DNNs). Existing techniques in statistical learning require computation of capacity measures, such as VC…

Machine Learning · Computer Science 2021-05-06 Devansh Bisla , Apoorva Nandini Saridena , Anna Choromanska

Uncertainty Modeling for Out-of-Distribution Generalization

Though remarkable progress has been achieved in various vision tasks, deep neural networks still suffer obvious performance degradation when tested in out-of-distribution scenarios. We argue that the feature statistics (mean and standard…

Computer Vision and Pattern Recognition · Computer Science 2022-04-25 Xiaotong Li , Yongxing Dai , Yixiao Ge , Jun Liu , Ying Shan , Ling-Yu Duan

Domain Generalization through Meta-Learning: A Survey

Deep neural networks (DNNs) have revolutionized artificial intelligence but often lack performance when faced with out-of-distribution (OOD) data, a common scenario due to the inevitable domain shifts in real-world applications. This…

Machine Learning · Computer Science 2024-08-23 Arsham Gholamzadeh Khoee , Yinan Yu , Robert Feldt

Biased Generalization in Diffusion Models

Generalization in generative modeling is defined as the ability to learn an underlying distribution from a finite dataset and produce novel samples, with evaluation largely driven by held-out performance and perceived sample quality. In…

Machine Learning · Computer Science 2026-03-05 Jerome Garnier-Brun , Luca Biggio , Davide Beltrame , Marc Mézard , Luca Saglietti

Statistical Physics of Deep Neural Networks: Generalization Capability, Beyond the Infinite Width, and Feature Learning

Deep Neural Networks (DNNs) excel at many tasks, often rivaling or surpassing human performance. Yet their internal processes remain elusive, frequently described as "black boxes." While performance can be refined experimentally, achieving…

Disordered Systems and Neural Networks · Physics 2025-02-03 Sebastiano Ariosto

On the Generalization Behavior of Deep Residual Networks From a Dynamical System Perspective

Deep neural networks (DNNs) have significantly advanced machine learning, with model depth playing a central role in their successes. The dynamical system modeling approach has recently emerged as a powerful framework, offering new…

Machine Learning · Computer Science 2026-02-25 Jinshu Huang , Mingfei Sun , Chunlin Wu

Dynamic Continual Learning: Harnessing Parameter Uncertainty for Improved Network Adaptation

When fine-tuning Deep Neural Networks (DNNs) to new data, DNNs are prone to overwriting network parameters required for task-specific functionality on previously learned tasks, resulting in a loss of performance on those tasks. We propose…

Machine Learning · Computer Science 2025-01-22 Christopher Angelini , Nidhal Bouaynaya