English
Related papers

Related papers: Generalization Error Bounds for Noisy, Iterative A…

200 papers

We adopt an information-theoretic framework to analyze the generalization behavior of the class of iterative, noisy learning algorithms. This class is particularly suitable for study under information-theoretic metrics as the algorithms are…

Machine Learning · Computer Science 2023-07-20 Ibrahim Issa , Amedeo Roberto Esposito , Michael Gastpar

Machine learning models trained by different optimization algorithms under different data distributions can exhibit distinct generalization behaviors. In this paper, we analyze the generalization of models trained by noisy iterative…

Machine Learning · Statistics 2022-12-29 Hao Wang , Rui Gao , Flavio P. Calmon

Generalization error (also known as the out-of-sample error) measures how well the hypothesis learned from training data generalizes to previously unseen data. Proving tight generalization error bounds is a central question in statistical…

Machine Learning · Computer Science 2020-03-03 Jian Li , Xuanyuan Luo , Mingda Qiao

Generalization error bounds are critical to understanding the performance of machine learning models. In this work, building upon a new bound of the expected value of an arbitrary function of the population and empirical risk of a learning…

Information Theory · Computer Science 2021-05-07 Gholamali Aminian , Laura Toni , Miguel R. D. Rodrigues

An information-theoretic upper bound on the generalization error of supervised learning algorithms is derived. The bound is constructed in terms of the mutual information between each individual training sample and the output of the…

Machine Learning · Computer Science 2020-08-06 Yuheng Bu , Shaofeng Zou , Venugopal V. Veeravalli

This paper explores the generalization characteristics of iterative learning algorithms with bounded updates for non-convex loss functions, employing information-theoretic techniques. Our key contribution is a novel bound for the…

Machine Learning · Computer Science 2023-10-17 Jingwen Fu , Nanning Zheng

Many machine learning tasks can be formulated as Regularized Empirical Risk Minimization (R-ERM), and solved by optimization algorithms such as gradient descent (GD), stochastic gradient descent (SGD), and stochastic variance reduction…

Machine Learning · Statistics 2016-09-28 Qi Meng , Yue Wang , Wei Chen , Taifeng Wang , Zhi-Ming Ma , Tie-Yan Liu

We study the generalization error of stochastic learning algorithms from an information-theoretic perspective, with a particular emphasis on deriving sharper bounds for differentially private algorithms. It is well known that the…

Information Theory · Computer Science 2026-04-20 Yanxiao Liu , Chun Hei Michael Shiu , Lele Wang , Deniz Gündüz

The information-theoretic framework of Russo and J. Zou (2016) and Xu and Raginsky (2017) provides bounds on the generalization error of a learning algorithm in terms of the mutual information between the algorithm's output and the training…

Machine Learning · Statistics 2020-10-26 Mahdi Haghifam , Jeffrey Negrea , Ashish Khisti , Daniel M. Roy , Gintare Karolina Dziugaite

We derive upper bounds on the generalization error of a learning algorithm in terms of the mutual information between its input and output. The bounds provide an information-theoretic understanding of generalization in learning problems,…

Machine Learning · Computer Science 2017-11-07 Aolin Xu , Maxim Raginsky

Generalization error bounds for deep neural networks trained by stochastic gradient descent (SGD) are derived by combining a dynamical control of an appropriate parameter norm and the Rademacher complexity estimate based on parameter norms.…

Machine Learning · Computer Science 2023-05-30 Mingze Wang , Chao Ma

Many learning algorithms can be represented as Markov processes, and understanding their generalization error is a central topic in learning theory. For specific continuous-time noisy algorithms, a prominent analysis technique relies on…

Machine Learning · Statistics 2026-03-06 Benjamin Dupuis , Maxime Haddouche , George Deligiannidis , Umut Simsekli

We present a general approach to deriving bounds on the generalization error of randomized learning algorithms. Our approach can be used to obtain bounds on the average generalization error as well as bounds on its tail probabilities, both…

Information Theory · Computer Science 2020-09-10 Fredrik Hellström , Giuseppe Durisi

The generalization error of a learning algorithm refers to the discrepancy between the loss of a learning algorithm on training data and that on unseen testing data. Various information-theoretic bounds on the generalization error have been…

Information Theory · Computer Science 2025-06-24 Xuetong Wu , Jonathan H. Manton , Uwe Aickelin , Jingge Zhu

The generalization error (risk) of a supervised statistical learning algorithm quantifies its prediction ability on previously unseen data. Inspired by exponential tilting, \citet{li2020tilted} proposed the {\it tilted empirical risk} (TER)…

Machine Learning · Statistics 2025-06-10 Gholamali Aminian , Amir R. Asadi , Tian Li , Ahmad Beirami , Gesine Reinert , Samuel N. Cohen

Algorithm-dependent generalization error bounds are central to statistical learning theory. A learning algorithm may use a large hypothesis space, but the limited number of iterations controls its model capacity and generalization error.…

Machine Learning · Computer Science 2017-07-20 Wenlong Mou , Liwei Wang , Xiyu Zhai , Kai Zheng

In this work, we unify several expected generalization error bounds based on random subsets using the framework developed by Hellstr\"om and Durisi [1]. First, we recover the bounds based on the individual sample mutual information from Bu…

Information Theory · Computer Science 2021-07-27 Borja Rodríguez-Gálvez , Germán Bassi , Ragnar Thobaben , Mikael Skoglund

In this work, we study the generalization capability of algorithms from an information-theoretic perspective. It has been shown that the expected generalization error of an algorithm is bounded from above by a function of the relative…

Information Theory · Computer Science 2021-10-27 Borja Rodríguez-Gálvez , Germán Bassi , Mikael Skoglund

Meta-learning, or "learning to learn", refers to techniques that infer an inductive bias from data corresponding to multiple related tasks with the goal of improving the sample efficiency for new, previously unobserved, tasks. A key…

Machine Learning · Computer Science 2021-02-24 Sharu Theresa Jose , Osvaldo Simeone

We study the generalization properties of the popular stochastic optimization method known as stochastic gradient descent (SGD) for optimizing general non-convex loss functions. Our main contribution is providing upper bounds on the…

Machine Learning · Computer Science 2021-08-17 Gergely Neu , Gintare Karolina Dziugaite , Mahdi Haghifam , Daniel M. Roy
‹ Prev 1 2 3 10 Next ›