English
Related papers

Related papers: Conditional Mutual Information-Based Generalizatio…

200 papers

Meta-learning, or "learning to learn", refers to techniques that infer an inductive bias from data corresponding to multiple related tasks with the goal of improving the sample efficiency for new, previously unobserved, tasks. A key…

Machine Learning · Computer Science 2021-02-24 Sharu Theresa Jose , Osvaldo Simeone

We provide an information-theoretic framework for studying the generalization properties of machine learning algorithms. Our framework ties together existing approaches, including uniform convergence bounds and recent methods for adaptive…

Machine Learning · Computer Science 2020-06-22 Thomas Steinke , Lydia Zakynthinou

Recent work has established that the conditional mutual information (CMI) framework of Steinke and Zakynthinou (2020) is expressive enough to capture generalization guarantees in terms of algorithmic stability, VC dimension, and related…

Machine Learning · Computer Science 2023-03-28 Fredrik Hellström , Giuseppe Durisi

Meta-learning automatically infers an inductive bias by observing data from a number of related tasks. The inductive bias is encoded by hyperparameters that determine aspects of the model class or training algorithm, such as initialization…

Machine Learning · Computer Science 2020-11-10 Sharu Theresa Jose , Osvaldo Simeone , Giuseppe Durisi

We study the mutual information between (certain summaries of) the output of a learning algorithm and its $n$ training data, conditional on a supersample of $n+1$ i.i.d. data from which the training data is chosen at random without…

Machine Learning · Computer Science 2022-06-30 Mahdi Haghifam , Shay Moran , Daniel M. Roy , Gintare Karolina Dziugaite

We propose a new information-theoretic bound on generalization error based on a combination of the error decomposition technique of Bu et al. and the conditional mutual information (CMI) construction of Steinke and Zakynthinou. In a…

Information Theory · Computer Science 2021-01-01 Ruida Zhou , Chao Tian , Tie Liu

In this work, we investigate the expressiveness of the "conditional mutual information" (CMI) framework of Steinke and Zakynthinou (2020) and the prospect of using it to provide a unified framework for proving generalization bounds in the…

Information Theory · Computer Science 2021-11-18 Mahdi Haghifam , Gintare Karolina Dziugaite , Shay Moran , Daniel M. Roy

In recent years, information-theoretic generalization bounds have gained increasing attention for analyzing the generalization capabilities of meta-learning algorithms. However, existing results are confined to two-step bounds, failing to…

Machine Learning · Statistics 2025-10-14 Wen Wen , Tieliang Gong , Yuxin Dong , Zeyu Gao , Yong-Jin Liu

We present a new family of information-theoretic generalization bounds within the framework of conditional mutual information (CMI). Most of our results are established based on the leave-$m$-out (L$m$O) cross-validation error, with $m$…

Information Theory · Computer Science 2026-05-21 Yang Lu , Matthias Frey , Margreta Kuijper , Jingge Zhu

Information-theoretic generalization bounds based on the supersample construction are a central tool for algorithm-dependent generalization analysis in the batch i.i.d.~setting. However, existing supersample conditional mutual information…

Machine Learning · Statistics 2026-05-13 Futoshi Futami , Masahiro Fujisawa

In this paper, we leverage stochastic projection and lossy compression to establish new conditional mutual information (CMI) bounds on the generalization error of statistical learning algorithms. It is shown that these bounds are generally…

Machine Learning · Statistics 2025-10-28 Milad Sefidgaran , Kimia Nadjahi , Abdellatif Zaidi

We derive a novel information-theoretic analysis of the generalization property of meta-learning algorithms. Concretely, our analysis proposes a generic understanding of both the conventional learning-to-learn framework and the modern…

Machine Learning · Computer Science 2021-12-13 Qi Chen , Changjian Shui , Mario Marchand

In this work, we present a variety of novel information-theoretic generalization bounds for learning algorithms, from the supersample setting of Steinke & Zakynthinou (2020)-the setting of the "conditional mutual information" framework. Our…

Machine Learning · Statistics 2023-06-16 Ziqiao Wang , Yongyi Mao

We derive information theoretic generalization bounds for supervised learning algorithms based on a new measure of leave-one-out conditional mutual information (loo-CMI). Contrary to other CMI bounds, which are black-box bounds that do not…

Machine Learning · Computer Science 2022-07-04 Mohamad Rida Rammal , Alessandro Achille , Aditya Golatkar , Suhas Diggavi , Stefano Soatto

The concepts of conditional mutual information (CMI) and normalized conditional mutual information (NCMI) are introduced to measure the concentration and separation performance of a classification deep neural network (DNN) in the output…

Machine Learning · Computer Science 2023-09-19 En-Hui Yang , Shayan Mohajer Hamidi , Linfeng Ye , Renhao Tan , Beverly Yang

In this work, we introduce novel information-theoretic generalization bounds using the conditional $f$-information framework, an extension of the traditional conditional mutual information (MI) framework. We provide a generic approach to…

Machine Learning · Statistics 2024-10-31 Ziqiao Wang , Yongyi Mao

Meta learning automatically infers an inductive bias, that includes the hyperparameter of the base-learning algorithm, by observing data from a finite number of related tasks. This paper studies PAC-Bayes bounds on meta generalization gap.…

Machine Learning · Computer Science 2022-06-14 Arezou Rezazadeh

Standard meta-learning for representation learning aims to find a common representation to be shared across multiple tasks. The effectiveness of these methods is often limited when the nuances of the tasks' distribution cannot be captured…

Machine Learning · Computer Science 2021-03-31 Giulia Denevi , Massimiliano Pontil , Carlo Ciliberto

Existing generalization theories of supervised learning typically take a holistic approach and provide bounds for the expected generalization over the whole data distribution, which implicitly assumes that the model generalizes similarly…

Machine Learning · Computer Science 2024-01-08 Firas Laakom , Yuheng Bu , Moncef Gabbouj

The information-theoretic framework of Russo and J. Zou (2016) and Xu and Raginsky (2017) provides bounds on the generalization error of a learning algorithm in terms of the mutual information between the algorithm's output and the training…

Machine Learning · Statistics 2020-10-26 Mahdi Haghifam , Jeffrey Negrea , Ashish Khisti , Daniel M. Roy , Gintare Karolina Dziugaite
‹ Prev 1 2 3 10 Next ›