Related papers: Misclassification bounds for PAC-Bayesian sparse d…

Layer Adaptive Node Selection in Bayesian Neural Networks: Statistical Guarantees and Implementation Details

Sparse deep neural networks have proven to be efficient for predictive model building in large-scale studies. Although several works have studied theoretical and numerical properties of sparse neural architectures, they have primarily…

Machine Learning · Statistics 2023-09-18 Sanket Jantre , Shrijita Bhattacharya , Tapabrata Maiti

PAC-Bayesian Learning of Optimization Algorithms

We apply the PAC-Bayes theory to the setting of learning-to-optimize. To the best of our knowledge, we present the first framework to learn optimization algorithms with provable generalization guarantees (PAC-bounds) and explicit trade-off…

Machine Learning · Computer Science 2023-02-16 Michael Sucker , Peter Ochs

Misclassification excess risk bounds for PAC-Bayesian classification via convexified loss

PAC-Bayesian bounds have proven to be a valuable tool for deriving generalization bounds and for designing new learning algorithms in machine learning. However, it typically focus on providing generalization bounds with respect to a chosen…

Machine Learning · Statistics 2024-08-19 The Tien Mai

Generalization bounds for deep learning

Generalization in deep learning has been the topic of much recent theoretical and empirical research. Here we introduce desiderata for techniques that predict generalization errors for deep learning models in supervised learning. Such…

Machine Learning · Statistics 2020-12-10 Guillermo Valle-Pérez , Ard A. Louis

Improving Generalization of Complex Models under Unbounded Loss Using PAC-Bayes Bounds

Previous research on PAC-Bayes learning theory has focused extensively on establishing tight upper bounds for test errors. A recently proposed training procedure called PAC-Bayes training, updates the model toward minimizing these bounds.…

Machine Learning · Statistics 2024-10-22 Xitong Zhang , Avrajit Ghosh , Guangliang Liu , Rongrong Wang

Conditionally Gaussian PAC-Bayes

Recent studies have empirically investigated different methods to train stochastic neural networks on a classification task by optimising a PAC-Bayesian bound via stochastic gradient descent. Most of these procedures need to replace the…

Machine Learning · Computer Science 2022-07-01 Eugenio Clerico , George Deligiannidis , Arnaud Doucet

Learning-to-Optimize with PAC-Bayesian Guarantees: Theoretical Considerations and Practical Implementation

We use the PAC-Bayesian theory for the setting of learning-to-optimize. To the best of our knowledge, we present the first framework to learn optimization algorithms with provable generalization guarantees (PAC-Bayesian bounds) and explicit…

Machine Learning · Computer Science 2025-02-26 Michael Sucker , Jalal Fadili , Peter Ochs

PAC-Bayesian Domain Adaptation Bounds for Multiclass Learners

Multiclass neural networks are a common tool in modern unsupervised domain adaptation, yet an appropriate theoretical description for their non-uniform sample complexity is lacking in the adaptation literature. To fill this gap, we propose…

Machine Learning · Computer Science 2022-07-13 Anthony Sicilia , Katherine Atwell , Malihe Alikhani , Seong Jae Hwang

Learning under Model Misspecification: Applications to Variational and Ensemble methods

Virtually any model we use in machine learning to make predictions does not perfectly represent reality. So, most of the learning happens under model misspecification. In this work, we present a novel analysis of the generalization…

Machine Learning · Computer Science 2020-10-23 Andres R. Masegosa

PAC-Bayesian risk bounds for fully connected deep neural network with Gaussian priors

Deep neural networks (DNNs) have emerged as a powerful methodology with significant practical successes in fields such as computer vision and natural language processing. Recent works have demonstrated that sparsely connected DNNs with…

Statistics Theory · Mathematics 2025-05-08 The Tien Mai

Bayesian Sparse learning with preconditioned stochastic gradient MCMC and its applications

In this work, we propose a Bayesian type sparse deep learning algorithm. The algorithm utilizes a set of spike-and-slab priors for the parameters in the deep neural network. The hierarchical Bayesian mixture will be trained using an…

Numerical Analysis · Mathematics 2021-03-17 Yating Wang , Wei Deng , Lin Guang

Efficient hyperparameter optimization by way of PAC-Bayes bound minimization

Identifying optimal values for a high-dimensional set of hyperparameters is a problem that has received growing attention given its importance to large-scale machine learning applications such as neural architecture search. Recently…

Machine Learning · Statistics 2020-08-17 John J. Cherian , Andrew G. Taube , Robert T. McGibbon , Panagiotis Angelikopoulos , Guy Blanc , Michael Snarski , Daniel D. Richman , John L. Klepeis , David E. Shaw

PACMAN: PAC-style bounds accounting for the Mismatch between Accuracy and Negative log-loss

The ultimate performance of machine learning algorithms for classification tasks is usually measured in terms of the empirical error probability (or accuracy) based on a testing dataset. Whereas, these algorithms are optimized through the…

Machine Learning · Computer Science 2021-12-13 Matias Vera , Leonardo Rey Vega , Pablo Piantanida

Demystify Optimization and Generalization of Over-parameterized PAC-Bayesian Learning

PAC-Bayesian is an analysis framework where the training error can be expressed as the weighted average of the hypotheses in the posterior distribution whilst incorporating the prior knowledge. In addition to being a pure generalization…

Machine Learning · Computer Science 2022-02-07 Wei Huang , Chunrui Liu , Yilan Chen , Tianyu Liu , Richard Yi Da Xu

An Adaptive Empirical Bayesian Method for Sparse Deep Learning

We propose a novel adaptive empirical Bayesian method for sparse deep learning, where the sparsity is ensured via a class of self-adaptive spike-and-slab priors. The proposed method works by alternatively sampling from an adaptive…

Machine Learning · Statistics 2020-04-15 Wei Deng , Xiao Zhang , Faming Liang , Guang Lin

PAC Bayesian Performance Guarantees for Deep (Stochastic) Networks in Medical Imaging

Application of deep neural networks to medical imaging tasks has in some sense become commonplace. Still, a "thorn in the side" of the deep learning movement is the argument that deep networks are prone to overfitting and are thus unable to…

Machine Learning · Computer Science 2021-07-12 Anthony Sicilia , Xingchen Zhao , Anastasia Sosnovskikh , Seong Jae Hwang

Robust PAC$^m$: Training Ensemble Models Under Misspecification and Outliers

Standard Bayesian learning is known to have suboptimal generalization capabilities under misspecification and in the presence of outliers. PAC-Bayes theory demonstrates that the free energy criterion minimized by Bayesian learning is a…

Machine Learning · Computer Science 2023-04-25 Matteo Zecchin , Sangwoo Park , Osvaldo Simeone , Marios Kountouris , David Gesbert

Sparse Deep Learning: A New Framework Immune to Local Traps and Miscalibration

Deep learning has powered recent successes of artificial intelligence (AI). However, the deep neural network, as the basic model of deep learning, has suffered from issues such as local traps and miscalibration. In this paper, we provide a…

Machine Learning · Statistics 2021-12-03 Yan Sun , Wenjun Xiong , Faming Liang

Overfitting and Generalizing with (PAC) Bayesian Prediction in Noisy Binary Classification

We consider a PAC-Bayes type learning rule for binary classification, balancing the training error of a randomized ''posterior'' predictor with its KL divergence to a pre-specified ''prior''. This can be seen as an extension of a modified…

Machine Learning · Statistics 2026-03-25 Xiaohan Zhu , Mesrob I. Ohannessian , Nathan Srebro

Toward Optimal Probabilistic Active Learning Using a Bayesian Approach

Gathering labeled data to train well-performing machine learning models is one of the critical challenges in many applications. Active learning aims at reducing the labeling costs by an efficient and effective allocation of costly labeling…

Machine Learning · Computer Science 2020-06-03 Daniel Kottke , Marek Herde , Christoph Sandrock , Denis Huseljic , Georg Krempl , Bernhard Sick