Related papers: Information Complexity and Generalization Bounds

Generalization Bounds via Information Density and Conditional Information Density

We present a general approach, based on exponential inequalities, to derive bounds on the generalization error of randomized learning algorithms. Using this approach, we provide bounds on the average generalization error as well as bounds…

Machine Learning · Computer Science 2023-03-10 Fredrik Hellström , Giuseppe Durisi

Generalization Bounds: Perspectives from Information Theory and PAC-Bayes

A fundamental question in theoretical machine learning is generalization. Over the past decades, the PAC-Bayesian approach has been established as a flexible framework to address the generalization capabilities of machine learning…

Machine Learning · Computer Science 2024-03-28 Fredrik Hellström , Giuseppe Durisi , Benjamin Guedj , Maxim Raginsky

Generalization Error Bounds via $m$th Central Moments of the Information Density

We present a general approach to deriving bounds on the generalization error of randomized learning algorithms. Our approach can be used to obtain bounds on the average generalization error as well as bounds on its tail probabilities, both…

Information Theory · Computer Science 2020-09-10 Fredrik Hellström , Giuseppe Durisi

Information-Theoretic Generalization Bounds for Transductive Learning and its Applications

In this paper, we establish generalization bounds for transductive learning algorithms in the context of information theory and PAC-Bayes, covering both the random sampling and the random splitting setting. First, we show that the…

Machine Learning · Computer Science 2025-01-22 Huayi Tang , Yong Liu

PAC-Bayesian Learning of Optimization Algorithms

We apply the PAC-Bayes theory to the setting of learning-to-optimize. To the best of our knowledge, we present the first framework to learn optimization algorithms with provable generalization guarantees (PAC-bounds) and explicit trade-off…

Machine Learning · Computer Science 2023-02-16 Michael Sucker , Peter Ochs

An Information-Theoretic Approach to Generalization Theory

We investigate the in-distribution generalization of machine learning algorithms. We depart from traditional complexity-based approaches by analyzing information-theoretic bounds that quantify the dependence between a learning algorithm and…

Machine Learning · Statistics 2024-08-27 Borja Rodríguez-Gálvez , Ragnar Thobaben , Mikael Skoglund

A New Family of Generalization Bounds Using Samplewise Evaluated CMI

We present a new family of information-theoretic generalization bounds, in which the training loss and the population loss are compared through a jointly convex function. This function is upper-bounded in terms of the disintegrated,…

Machine Learning · Computer Science 2023-03-28 Fredrik Hellström , Giuseppe Durisi

Tightening Mutual Information Based Bounds on Generalization Error

An information-theoretic upper bound on the generalization error of supervised learning algorithms is derived. The bound is constructed in terms of the mutual information between each individual training sample and the output of the…

Machine Learning · Computer Science 2020-08-06 Yuheng Bu , Shaofeng Zou , Venugopal V. Veeravalli

A DPI-PAC-Bayesian Framework for Generalization Bounds

We develop a unified Data Processing Inequality PAC-Bayesian framework -- abbreviated DPI-PAC-Bayesian -- for deriving generalization error bounds in the supervised learning setting. By embedding the Data Processing Inequality (DPI) into…

Information Theory · Computer Science 2025-08-26 Muhan Guan , Farhad Farokhi , Jingge Zhu

Leveraging PAC-Bayes Theory and Gibbs Distributions for Generalization Bounds with Complexity Measures

In statistical learning theory, a generalization bound usually involves a complexity measure imposed by the considered theoretical framework. This limits the scope of such bounds, as other forms of capacity measures or regularizations are…

Machine Learning · Statistics 2024-02-22 Paul Viallard , Rémi Emonet , Amaury Habrard , Emilie Morvant , Valentina Zantedeschi

Chained Generalisation Bounds

This work discusses how to derive upper bounds for the expected generalisation error of supervised learning algorithms by means of the chaining technique. By developing a general theoretical framework, we establish a duality between…

Machine Learning · Statistics 2022-07-01 Eugenio Clerico , Amitis Shidani , George Deligiannidis , Arnaud Doucet

PAC-Bayes Bounds for Meta-learning with Data-Dependent Prior

By leveraging experience from previous tasks, meta-learning algorithms can achieve effective fast adaptation ability when encountering new tasks. However it is unclear how the generalization property applies to new tasks. Probably…

Machine Learning · Computer Science 2021-02-09 Tianyu Liu , Jie Lu , Zheng Yan , Guangquan Zhang

Generalization bounds for deep learning

Generalization in deep learning has been the topic of much recent theoretical and empirical research. Here we introduce desiderata for techniques that predict generalization errors for deep learning models in supervised learning. Such…

Machine Learning · Statistics 2020-12-10 Guillermo Valle-Pérez , Ard A. Louis

A General framework for PAC-Bayes Bounds for Meta-Learning

Meta learning automatically infers an inductive bias, that includes the hyperparameter of the base-learning algorithm, by observing data from a finite number of related tasks. This paper studies PAC-Bayes bounds on meta generalization gap.…

Machine Learning · Computer Science 2022-06-14 Arezou Rezazadeh

PAC-Bayes Un-Expected Bernstein Inequality

We present a new PAC-Bayesian generalization bound. Standard bounds contain a $\sqrt{L_n \cdot \KL/n}$ complexity term which dominates unless $L_n$, the empirical error of the learning algorithm's randomized predictions, vanishes. We manage…

Machine Learning · Computer Science 2021-12-16 Zakaria Mhammedi , Peter D. Grunwald , Benjamin Guedj

Information Theoretic Lower Bounds for Information Theoretic Upper Bounds

We examine the relationship between the mutual information between the output model and the empirical sample and the generalization of the algorithm in the context of stochastic convex optimization. Despite increasing interest in…

Machine Learning · Computer Science 2024-01-17 Roi Livni

User-friendly introduction to PAC-Bayes bounds

Aggregated predictors are obtained by making a set of basic predictors vote according to some weights, that is, to some probability distribution. Randomized predictors are obtained by sampling in a set of basic predictors, according to some…

Machine Learning · Statistics 2025-03-03 Pierre Alquier

Fast Rate Information-theoretic Bounds on Generalization Errors

The generalization error of a learning algorithm refers to the discrepancy between the loss of a learning algorithm on training data and that on unseen testing data. Various information-theoretic bounds on the generalization error have been…

Information Theory · Computer Science 2025-06-24 Xuetong Wu , Jonathan H. Manton , Uwe Aickelin , Jingge Zhu

Information-theoretic generalization bounds for black-box learning algorithms

We derive information-theoretic generalization bounds for supervised learning algorithms based on the information contained in predictions rather than in the output of the training algorithm. These bounds improve over the existing…

Machine Learning · Computer Science 2021-10-06 Hrayr Harutyunyan , Maxim Raginsky , Greg Ver Steeg , Aram Galstyan

Demystify Optimization and Generalization of Over-parameterized PAC-Bayesian Learning

PAC-Bayesian is an analysis framework where the training error can be expressed as the weighted average of the hypotheses in the posterior distribution whilst incorporating the prior knowledge. In addition to being a pure generalization…

Machine Learning · Computer Science 2022-02-07 Wei Huang , Chunrui Liu , Yilan Chen , Tianyu Liu , Richard Yi Da Xu