English
Related papers

Related papers: Data-dependent Generalization Bounds via Variable-…

200 papers

We propose data-dependent uniform generalization bounds by approaching the problem from a PAC-Bayesian perspective. We first apply the PAC-Bayesian framework on "random sets" in a rigorous way, where the training algorithm is assumed to…

Machine Learning · Statistics 2025-02-11 Benjamin Dupuis , Paul Viallard , George Deligiannidis , Umut Simsekli

Modern neural networks are highly overparameterized, with capacity to substantially overfit to training data. Nevertheless, these networks often generalize well in practice. It has also been observed that trained networks can often be…

Machine Learning · Statistics 2019-02-26 Wenda Zhou , Victor Veitch , Morgane Austern , Ryan P. Adams , Peter Orbanz

Understanding generalization in modern machine learning settings has been one of the major challenges in statistical learning theory. In this context, recent years have witnessed the development of various generalization bounds suggesting…

Machine Learning · Statistics 2022-07-01 Milad Sefidgaran , Amin Gohari , Gaël Richard , Umut Şimşekli

We develop a unified Data Processing Inequality PAC-Bayesian framework -- abbreviated DPI-PAC-Bayesian -- for deriving generalization error bounds in the supervised learning setting. By embedding the Data Processing Inequality (DPI) into…

Information Theory · Computer Science 2025-08-26 Muhan Guan , Farhad Farokhi , Jingge Zhu

Generalization error (also known as the out-of-sample error) measures how well the hypothesis learned from training data generalizes to previously unseen data. Proving tight generalization error bounds is a central question in statistical…

Machine Learning · Computer Science 2020-03-03 Jian Li , Xuanyuan Luo , Mingda Qiao

We present a general approach to deriving bounds on the generalization error of randomized learning algorithms. Our approach can be used to obtain bounds on the average generalization error as well as bounds on its tail probabilities, both…

Information Theory · Computer Science 2020-09-10 Fredrik Hellström , Giuseppe Durisi

Deep neural networks generalize well despite being heavily overparameterized, in apparent contradiction with classical learning theory based on uniform convergence over fixed hypothesis spaces. Uniform bounds over the entire parameter space…

Machine Learning · Statistics 2026-05-15 Hubert Leroux , Jean Marcus , Julien Roger

Algorithm- and data-dependent generalization bounds are required to explain the generalization behavior of modern machine learning algorithms. In this context, there exists information theoretic generalization bounds that involve (various…

Machine Learning · Statistics 2023-07-07 Sarah Sachs , Tim van Erven , Liam Hodgkinson , Rajiv Khanna , Umut Simsekli

This work discusses how to derive upper bounds for the expected generalisation error of supervised learning algorithms by means of the chaining technique. By developing a general theoretical framework, we establish a duality between…

Machine Learning · Statistics 2022-07-01 Eugenio Clerico , Amitis Shidani , George Deligiannidis , Arnaud Doucet

We investigate the in-distribution generalization of machine learning algorithms. We depart from traditional complexity-based approaches by analyzing information-theoretic bounds that quantify the dependence between a learning algorithm and…

Machine Learning · Statistics 2024-08-27 Borja Rodríguez-Gálvez , Ragnar Thobaben , Mikael Skoglund

Generalization error bounds are essential to understanding machine learning algorithms. This paper presents novel expected generalization error upper bounds based on the average joint distribution between the output hypothesis and each…

Information Theory · Computer Science 2022-02-25 Gholamali Aminian , Yuheng Bu , Gregory Wornell , Miguel Rodrigues

Understanding the generalization behavior of deep neural networks remains a fundamental challenge in modern statistical learning theory. Among existing approaches, PAC-Bayesian norm-based bounds have demonstrated particular promise due to…

Machine Learning · Statistics 2026-01-14 Xinping Yi , Gaojie Jin , Xiaowei Huang , Shi Jin

Generalization in deep learning has been the topic of much recent theoretical and empirical research. Here we introduce desiderata for techniques that predict generalization errors for deep learning models in supervised learning. Such…

Machine Learning · Statistics 2020-12-10 Guillermo Valle-Pérez , Ard A. Louis

Deep neural networks generalize well on unseen data though the number of parameters often far exceeds the number of training examples. Recently proposed complexity measures have provided insights to understanding the generalizability in…

Machine Learning · Computer Science 2020-05-12 Jingling Li , Yanchao Sun , Jiahao Su , Taiji Suzuki , Furong Huang

One of the biggest issues in deep learning theory is the generalization ability of networks with huge model size. The classical learning theory suggests that overparameterized models cause overfitting. However, practically used large deep…

Machine Learning · Computer Science 2020-06-23 Taiji Suzuki , Hiroshi Abe , Tomoaki Nishimura

We present a general approach, based on exponential inequalities, to derive bounds on the generalization error of randomized learning algorithms. Using this approach, we provide bounds on the average generalization error as well as bounds…

Machine Learning · Computer Science 2023-03-10 Fredrik Hellström , Giuseppe Durisi

The ability of overparameterized deep networks to generalize well has been linked to the fact that stochastic gradient descent (SGD) finds solutions that lie in flat, wide minima in the training loss -- minima where the output of the…

Machine Learning · Computer Science 2019-06-03 Vaishnavh Nagarajan , J. Zico Kolter

In this work, we study the generalization capability of algorithms from an information-theoretic perspective. It has been shown that the expected generalization error of an algorithm is bounded from above by a function of the relative…

Information Theory · Computer Science 2021-10-27 Borja Rodríguez-Gálvez , Germán Bassi , Mikael Skoglund

Data-driven algorithms can adapt their internal structure or parameters to inputs from unknown application-specific distributions, by learning from a training sample of inputs. Several recent works have applied this approach to problems in…

Machine Learning · Computer Science 2022-06-17 Peter Bartlett , Piotr Indyk , Tal Wagner

Deep nets generalize well despite having more parameters than the number of training samples. Recent works try to give an explanation using PAC-Bayes and Margin-based analyses, but do not as yet result in sample complexity bounds better…

Machine Learning · Computer Science 2018-11-28 Sanjeev Arora , Rong Ge , Behnam Neyshabur , Yi Zhang
‹ Prev 1 2 3 10 Next ›