English
Related papers

Related papers: The Generalization Error of Supervised Machine Lea…

200 papers

In this paper, the worst-case probability measure over the data is introduced as a tool for characterizing the generalization capabilities of machine learning algorithms. More specifically, the worst-case probability measure is a Gibbs…

Machine Learning · Computer Science 2023-12-20 Xinying Zou , Samir M. Perlaza , Iñaki Esnaola , Eitan Altman

In this work, we study the generalization capability of algorithms from an information-theoretic perspective. It has been shown that the expected generalization error of an algorithm is bounded from above by a function of the relative…

Information Theory · Computer Science 2021-10-27 Borja Rodríguez-Gálvez , Germán Bassi , Mikael Skoglund

Various approaches have been developed to upper bound the generalization error of a supervised learning algorithm. However, existing bounds are often loose and even vacuous when evaluated in practice. As a result, they may fail to…

Information Theory · Computer Science 2022-10-19 Gholamali Aminian , Yuheng Bu , Laura Toni , Miguel R. D. Rodrigues , Gregory W. Wornell

Bounding the generalization error of a supervised learning algorithm is one of the most important problems in learning theory, and various approaches have been developed. However, existing bounds are often loose and lack of guarantees. As a…

Machine Learning · Computer Science 2021-07-30 Gholamali Aminian , Yuheng Bu , Laura Toni , Miguel R. D. Rodrigues , Gregory Wornell

Generalization error (also known as the out-of-sample error) measures how well the hypothesis learned from training data generalizes to previously unseen data. Proving tight generalization error bounds is a central question in statistical…

Machine Learning · Computer Science 2020-03-03 Jian Li , Xuanyuan Luo , Mingda Qiao

This work discusses how to derive upper bounds for the expected generalisation error of supervised learning algorithms by means of the chaining technique. By developing a general theoretical framework, we establish a duality between…

Machine Learning · Statistics 2022-07-01 Eugenio Clerico , Amitis Shidani , George Deligiannidis , Arnaud Doucet

The dependence on training data of the Gibbs algorithm (GA) is analytically characterized. By adopting the expected empirical risk as the performance metric, the sensitivity of the GA is obtained in closed form. In this case, sensitivity is…

Machine Learning · Computer Science 2023-06-22 Samir M. Perlaza , Iñaki Esnaola , Gaetan Bisson , H. Vincent Poor

The goal of machine learning is to find models that minimize prediction error on data that has not yet been seen. Its operational paradigm assumes access to a dataset $S$ and articulates a scheme for evaluating how well a given model…

Machine Learning · Computer Science 2026-04-22 Maxim Raginsky , Benjamin Recht

We investigate the in-distribution generalization of machine learning algorithms. We depart from traditional complexity-based approaches by analyzing information-theoretic bounds that quantify the dependence between a learning algorithm and…

Machine Learning · Statistics 2024-08-27 Borja Rodríguez-Gálvez , Ragnar Thobaben , Mikael Skoglund

Generalization error bounds are critical to understanding the performance of machine learning models. In this work, building upon a new bound of the expected value of an arbitrary function of the population and empirical risk of a learning…

Information Theory · Computer Science 2021-05-07 Gholamali Aminian , Laura Toni , Miguel R. D. Rodrigues

In statistical learning theory, generalization error is used to quantify the degree to which a supervised machine learning algorithm may overfit to training data. Recent work [Xu and Raginsky (2017)] has established a bound on the…

Machine Learning · Computer Science 2018-01-16 Ankit Pensia , Varun Jog , Po-Ling Loh

We provide an information-theoretic analysis of the generalization ability of Gibbs-based transfer learning algorithms by focusing on two popular transfer learning approaches, $\alpha$-weighted-ERM and two-stage-ERM. Our key result is an…

Machine Learning · Computer Science 2021-11-03 Yuheng Bu , Gholamali Aminian , Laura Toni , Miguel Rodrigues , Gregory Wornell

We present a general approach to deriving bounds on the generalization error of randomized learning algorithms. Our approach can be used to obtain bounds on the average generalization error as well as bounds on its tail probabilities, both…

Information Theory · Computer Science 2020-09-10 Fredrik Hellström , Giuseppe Durisi

We provide an exact characterization of the expected generalization error (gen-error) for semi-supervised learning (SSL) with pseudo-labeling via the Gibbs algorithm. The gen-error is expressed in terms of the symmetrized KL information…

Information Theory · Computer Science 2023-06-16 Haiyun He , Gholamali Aminian , Yuheng Bu , Miguel Rodrigues , Vincent Y. F. Tan

Generalization error bounds are essential to understanding machine learning algorithms. This paper presents novel expected generalization error upper bounds based on the average joint distribution between the output hypothesis and each…

Information Theory · Computer Science 2022-02-25 Gholamali Aminian , Yuheng Bu , Gregory Wornell , Miguel Rodrigues

An information-theoretic upper bound on the generalization error of supervised learning algorithms is derived. The bound is constructed in terms of the mutual information between each individual training sample and the output of the…

Machine Learning · Computer Science 2020-08-06 Yuheng Bu , Shaofeng Zou , Venugopal V. Veeravalli

The ability of likelihood-based probabilistic models to generalize to unseen data is central to many machine learning applications such as lossless compression. In this work, we study the generalization of a popular class of probabilistic…

Machine Learning · Statistics 2022-10-18 Mingtian Zhang , Peter Hayes , David Barber

Data augmentation is one of the most widely used techniques to improve generalization in modern machine learning, often justified by its ability to promote invariance to label-irrelevant transformations. However, its theoretical role…

Machine Learning · Computer Science 2026-02-17 Abdelali Bouyahia , Frédéric LeBlanc , Mario Marchand

In this work, the probability of an event under some joint distribution is bounded by measuring it with the product of the marginals instead (which is typically easier to analyze) together with a measure of the dependence between the two…

Information Theory · Computer Science 2020-10-22 Amedeo Roberto Esposito , Michael Gastpar , Ibrahim Issa

We establish in-expectation and tail bounds on the generalization error of representation learning type algorithms. The bounds are in terms of the relative entropy between the distribution of the representations extracted from the training…

Machine Learning · Statistics 2025-03-21 Milad Sefidgaran , Abdellatif Zaidi , Piotr Krasnowski
‹ Prev 1 2 3 10 Next ›