Related papers: Sample-Conditioned Hypothesis Stability Sharpens I…

Tighter Information-Theoretic Generalization Bounds from Supersamples

In this work, we present a variety of novel information-theoretic generalization bounds for learning algorithms, from the supersample setting of Steinke & Zakynthinou (2020)-the setting of the "conditional mutual information" framework. Our…

Machine Learning · Statistics 2023-06-16 Ziqiao Wang , Yongyi Mao

Information Theoretic Lower Bounds for Information Theoretic Upper Bounds

We examine the relationship between the mutual information between the output model and the empirical sample and the generalization of the algorithm in the context of stochastic convex optimization. Despite increasing interest in…

Machine Learning · Computer Science 2024-01-17 Roi Livni

Stability, Complexity and Data-Dependent Worst-Case Generalization Bounds

Providing generalization guarantees for stochastic optimization algorithms remains a key challenge in learning theory. Recently, numerous works demonstrated the impact of the geometric properties of optimization trajectories on…

Machine Learning · Computer Science 2026-01-23 Mario Tuci , Lennart Bastian , Benjamin Dupuis , Nassir Navab , Tolga Birdal , Umut Şimşekli

Boosting the Confidence of Generalization for $L_2$-Stable Randomized Learning Algorithms

Exponential generalization bounds with near-tight rates have recently been established for uniformly stable learning algorithms. The notion of uniform stability, however, is stringent in the sense that it is invariant to the data-generating…

Machine Learning · Statistics 2022-06-09 Xiao-Tong Yuan , Ping Li

Robustness Implies Generalization via Data-Dependent Generalization Bounds

This paper proves that robustness implies generalization via data-dependent generalization bounds. As a result, robustness and generalization are shown to be connected closely in a data-dependent manner. Our bounds improve previous bounds…

Machine Learning · Computer Science 2022-08-04 Kenji Kawaguchi , Zhun Deng , Kyle Luh , Jiaoyang Huang

Hypothesis Set Stability and Generalization

We present a study of generalization for data-dependent hypothesis sets. We give a general learning guarantee for data-dependent hypothesis sets based on a notion of transductive Rademacher complexity. Our main result is a generalization…

Machine Learning · Computer Science 2020-10-06 Dylan J. Foster , Spencer Greenberg , Satyen Kale , Haipeng Luo , Mehryar Mohri , Karthik Sridharan

Information-Theoretic Generalization Bounds for Stochastic Gradient Descent with Predictable Virtual Noise

Information-theoretic generalization bounds analyze stochastic optimization by relating expected generalization error to the mutual information between learned parameters and training data. Virtual perturbation analyses of SGD add auxiliary…

Machine Learning · Computer Science 2026-05-04 Mohammad Partohaghighi

Fine-Grained Analysis of Stability and Generalization for Stochastic Gradient Descent

Recently there are a considerable amount of work devoted to the study of the algorithmic stability and generalization for stochastic gradient descent (SGD). However, the existing stability analysis requires to impose restrictive assumptions…

Machine Learning · Computer Science 2020-06-16 Yunwen Lei , Yiming Ying

Time-Independent Information-Theoretic Generalization Bounds for SGLD

We provide novel information-theoretic generalization bounds for stochastic gradient Langevin dynamics (SGLD) under the assumptions of smoothness and dissipativity, which are widely used in sampling and non-convex optimization studies. Our…

Machine Learning · Computer Science 2023-11-03 Futoshi Futami , Masahiro Fujisawa

Algorithmic stability and hypothesis complexity

We introduce a notion of algorithmic stability of learning algorithms---that we term \emph{argument stability}---that captures stability of the hypothesis output by the learning algorithm in the normed space of functions from which…

Machine Learning · Statistics 2017-08-04 Tongliang Liu , Gábor Lugosi , Gergely Neu , Dacheng Tao

Optimizing Information-theoretical Generalization Bounds via Anisotropic Noise in SGLD

Recently, the information-theoretical framework has been proven to be able to obtain non-vacuous generalization bounds for large models trained by Stochastic Gradient Langevin Dynamics (SGLD) with isotropic noise. In this paper, we optimize…

Machine Learning · Computer Science 2021-11-04 Bohan Wang , Huishuai Zhang , Jieyu Zhang , Qi Meng , Wei Chen , Tie-Yan Liu

Formal limitations of sample-wise information-theoretic generalization bounds

Some of the tightest information-theoretic generalization bounds depend on the average information between the learned hypothesis and a single training example. However, these sample-wise bounds were derived only for expected generalization…

Machine Learning · Computer Science 2022-12-14 Hrayr Harutyunyan , Greg Ver Steeg , Aram Galstyan

Jensen-Shannon Information Based Characterization of the Generalization Error of Learning Algorithms

Generalization error bounds are critical to understanding the performance of machine learning models. In this work, we propose a new information-theoretic based generalization error upper bound applicable to supervised learning scenarios.…

Information Theory · Computer Science 2021-01-11 Gholamali Aminian , Laura Toni , Miguel R. D. Rodrigues

Stability Analysis and Learning Bounds for Transductive Regression Algorithms

This paper uses the notion of algorithmic stability to derive novel generalization bounds for several families of transductive regression algorithms, both by using convexity and closed-form solutions. Our analysis helps compare the…

Machine Learning · Computer Science 2009-04-07 Corinna Cortes , Mehryar Mohri , Dmitry Pechyony , Ashish Rastogi

Generalization Bounds for Stochastic Saddle Point Problems

This paper studies the generalization bounds for the empirical saddle point (ESP) solution to stochastic saddle point (SSP) problems. For SSP with Lipschitz continuous and strongly convex-strongly concave objective functions, we establish…

Optimization and Control · Mathematics 2020-06-04 Junyu Zhang , Mingyi Hong , Mengdi Wang , Shuzhong Zhang

Data-Dependent Stability of Stochastic Gradient Descent

We establish a data-dependent notion of algorithmic stability for Stochastic Gradient Descent (SGD), and employ it to develop novel generalization bounds. This is in contrast to previous distribution-free algorithmic stability results for…

Machine Learning · Computer Science 2018-02-19 Ilja Kuzborskij , Christoph H. Lampert

Stability of SGD: Tightness Analysis and Improved Bounds

Stochastic Gradient Descent (SGD) based methods have been widely used for training large-scale machine learning models that also generalize well in practice. Several explanations have been offered for this generalization performance, a…

Machine Learning · Computer Science 2021-02-11 Yikai Zhang , Wenjia Zhang , Sammy Bald , Vamsi Pingali , Chao Chen , Mayank Goswami

Information-theoretic generalization bounds for black-box learning algorithms

We derive information-theoretic generalization bounds for supervised learning algorithms based on the information contained in predictions rather than in the output of the training algorithm. These bounds improve over the existing…

Machine Learning · Computer Science 2021-10-06 Hrayr Harutyunyan , Maxim Raginsky , Greg Ver Steeg , Aram Galstyan

On Unified and Sharpened CMI Bounds for Generalization Errors

We present a new family of information-theoretic generalization bounds within the framework of conditional mutual information (CMI). Most of our results are established based on the leave-$m$-out (L$m$O) cross-validation error, with $m$…

Information Theory · Computer Science 2026-05-21 Yang Lu , Matthias Frey , Margreta Kuijper , Jingge Zhu

Sharpened Generalization Bounds based on Conditional Mutual Information and an Application to Noisy, Iterative Algorithms

The information-theoretic framework of Russo and J. Zou (2016) and Xu and Raginsky (2017) provides bounds on the generalization error of a learning algorithm in terms of the mutual information between the algorithm's output and the training…

Machine Learning · Statistics 2020-10-26 Mahdi Haghifam , Jeffrey Negrea , Ashish Khisti , Daniel M. Roy , Gintare Karolina Dziugaite