English
Related papers

Related papers: Linear regression through PAC-Bayesian truncation

200 papers

We consider the problem of predicting as well as the best linear combination of d given functions in least squares regression, and variants of this problem including constraints on the parameters of the linear combination. When the input…

Machine Learning · Statistics 2010-07-06 Jean-Yves Audibert , Olivier Catoni

We consider the problem of robustly predicting as well as the best linear combination of $d$ given functions in least squares regression, and variants of this problem including constraints on the parameters of the linear combination. For…

Statistics Theory · Mathematics 2012-02-24 Jean-Yves Audibert , Olivier Catoni

Empirically, the PAC-Bayesian analysis is known to produce tight risk bounds for practical machine learning algorithms. However, in its naive form, it can only deal with stochastic predictors while such predictors are rarely used and…

Machine Learning · Statistics 2019-11-22 Kohei Miyaguchi

We present a new PAC-Bayesian generalization bound. Standard bounds contain a $\sqrt{L_n \cdot \KL/n}$ complexity term which dominates unless $L_n$, the empirical error of the learning algorithm's randomized predictions, vanishes. We manage…

Machine Learning · Computer Science 2021-12-16 Zakaria Mhammedi , Peter D. Grunwald , Benjamin Guedj

This paper studies the truncation method from Alquier [1] to derive high-probability PAC-Bayes bounds for unbounded losses with heavy tails. Assuming that the $p$-th moment is bounded, the resulting bounds interpolate between a slow rate $1…

Machine Learning · Statistics 2024-03-26 Borja Rodríguez-Gálvez , Omar Rivasplata , Ragnar Thobaben , Mikael Skoglund

We apply the PAC-Bayes theory to the setting of learning-to-optimize. To the best of our knowledge, we present the first framework to learn optimization algorithms with provable generalization guarantees (PAC-bounds) and explicit trade-off…

Machine Learning · Computer Science 2023-02-16 Michael Sucker , Peter Ochs

We study the problem of aggregation under the squared loss in the model of regression with deterministic design. We obtain sharp PAC-Bayesian risk bounds for aggregates defined via exponential weights, under general assumptions on the…

Statistics Theory · Mathematics 2013-03-25 Arnak Dalalyan , Alexandre Tsybakov

We use the PAC-Bayesian theory for the setting of learning-to-optimize. To the best of our knowledge, we present the first framework to learn optimization algorithms with provable generalization guarantees (PAC-Bayesian bounds) and explicit…

Machine Learning · Computer Science 2025-02-26 Michael Sucker , Jalal Fadili , Peter Ochs

In this paper, we improve the PAC-Bayesian error bound for linear regression derived in Germain et al. [10]. The improvements are twofold. First, the proposed error bound is tighter, and converges to the generalization loss with a…

Machine Learning · Computer Science 2019-12-09 Vera Shalaeva , Alireza Fakhrizadeh Esfahani , Pascal Germain , Mihaly Petreczky

PAC-Bayesian bounds have proven to be a valuable tool for deriving generalization bounds and for designing new learning algorithms in machine learning. However, it typically focus on providing generalization bounds with respect to a chosen…

Machine Learning · Statistics 2024-08-19 The Tien Mai

In truncated linear regression, samples $(x,y)$ are shown only when the outcome $y$ falls inside a certain survival set $S^\star$ and the goal is to estimate the unknown $d$-dimensional regressor $w^\star$. This problem has a long history…

Machine Learning · Statistics 2026-05-25 Alexandros Kouridakis , Anay Mehrotra , Alkis Kalavasis , Constantine Caramanis

PAC-Bayesian is an analysis framework where the training error can be expressed as the weighted average of the hypotheses in the posterior distribution whilst incorporating the prior knowledge. In addition to being a pure generalization…

Machine Learning · Computer Science 2022-02-07 Wei Huang , Chunrui Liu , Yilan Chen , Tianyu Liu , Richard Yi Da Xu

The topics dicussed in this paper take their origin inthe estimation of the Gram matrix of a random vector from a sample made of n independent copies. They comprise the estimation of the covariance matrix and the study of least squares…

Statistics Theory · Mathematics 2016-03-17 Olivier Catoni

Motivated by the increasing use of and rapid changes in array technologies, we consider the prediction problem of fitting a linear regression relating a continuous outcome $Y$ to a large number of covariates $\mathbf {X}$, for example,…

Applications · Statistics 2014-01-13 Philip S. Boonstra , Bhramar Mukherjee , Jeremy M. G. Taylor

Variational approximation techniques and inference for stochastic models in machine learning has gained much attention the last years. Especially in the case of Gaussian Processes (GP) and their deep versions, Deep Gaussian Processes…

Statistics Theory · Mathematics 2019-09-24 Roman Föll , Ingo Steinwart

We study an approach to learning pruning masks by optimizing the expected loss of stochastic pruning masks, i.e., masks which zero out each weight independently with some weight-specific probability. We analyze the training dynamics of the…

Machine Learning · Statistics 2021-10-25 Soufiane Hayou , Bobby He , Gintare Karolina Dziugaite

We focus on a stochastic learning model where the learner observes a finite set of training examples and the output of the learning process is a data-dependent distribution over a space of hypotheses. The learned data-dependent distribution…

Machine Learning · Statistics 2020-12-29 Omar Rivasplata , Ilja Kuzborskij , Csaba Szepesvari , John Shawe-Taylor

We study the generalization error of randomized learning algorithms -- focusing on stochastic gradient descent (SGD) -- using a novel combination of PAC-Bayes and algorithmic stability. Importantly, our generalization bounds hold for all…

Machine Learning · Computer Science 2020-06-23 Ben London

Considering a probability distribution over parameters is known as an efficient strategy to learn a neural network with non-differentiable activation functions. We study the expectation of a probabilistic neural network as a predictor by…

Machine Learning · Computer Science 2023-04-17 Louis Fortier-Dubois , Gaël Letarte , Benjamin Leblanc , François Laviolette , Pascal Germain

In Bayesian regression models with categorical predictors, constraints are needed to ensure identifiability when using all $K$ levels of a factor. The sum-to-zero constraint is particularly useful as it allows coefficients to represent…

Methodology · Statistics 2025-04-15 Zhi Ling , Shozen Dan
‹ Prev 1 2 3 10 Next ›