Related papers: Equations of States in Singular Statistical Estima…

Generalization for Least Squares Regression With Simple Spiked Covariances

Random matrix theory has proven to be a valuable tool in analyzing the generalization of linear models. However, the generalization properties of even two-layer neural networks trained by gradient descent remain poorly understood. To…

Statistics Theory · Mathematics 2024-10-21 Jiping Li , Rishi Sonthalia

Lower Bounds for Public-Private Learning under Distribution Shift

The most effective differentially private machine learning algorithms in practice rely on an additional source of purportedly public data. This paradigm is most interesting when the two sources combine to be more than the sum of their…

Machine Learning · Computer Science 2025-07-25 Amrith Setlur , Pratiksha Thaker , Jonathan Ullman

Learning the Truth From Only One Side of the Story

Learning under one-sided feedback (i.e., where we only observe the labels for examples we predicted positively on) is a fundamental problem in machine learning -- applications include lending and recommendation systems. Despite this, there…

Machine Learning · Computer Science 2020-10-14 Heinrich Jiang , Qijia Jiang , Aldo Pacchiano

Semi-parametric estimation and prediction intervals in state space models

Literatures in state space models focus on parametric inference and prediction, which fail if the state space model is not fully specified and the maximum likelihood estimation does not work. In this paper, we assume the state transition…

Statistics Theory · Mathematics 2020-12-15 Yunyi Zhang , Tingting Wang , Dimitris N. Politis

On the Posterior Computation Under the Dirichlet-Laplace Prior

Modern applications routinely collect high-dimensional data, leading to statistical models having more parameters than there are samples available. A common solution is to impose sparsity in parameter estimation, often using penalized…

Methodology · Statistics 2025-07-08 Paolo Onorati , David B. Dunson , Antonio Canale

Generalization bounds for averaged classifiers

We study a simple learning algorithm for binary classification. Instead of predicting with the best hypothesis in the hypothesis class, that is, the hypothesis that minimizes the training error, our algorithm predicts with a weighted…

Statistics Theory · Mathematics 2007-06-13 Yoav Freund , Yishay Mansour , Robert E. Schapire

Generalization Guarantees for Representation Learning via Data-Dependent Gaussian Mixture Priors

We establish in-expectation and tail bounds on the generalization error of representation learning type algorithms. The bounds are in terms of the relative entropy between the distribution of the representations extracted from the training…

Machine Learning · Statistics 2025-03-21 Milad Sefidgaran , Abdellatif Zaidi , Piotr Krasnowski

An Accurate Standard Error Estimation for Quadratic Exponential Logistic Regressions by Applying Generalized Estimating Equations to Pseudo-Likelihoods

For a set of binary response variables, conditional mean models characterize the expected value of a response variable given the others and are popularly applied in longitudinal and network data analyses. The quadratic exponential binary…

Methodology · Statistics 2025-10-02 Ong Wei Yong , Lee Shao-Man , Hsueh Chia-Ming , Chang Sheng-Mao

Minimax rate of consistency for linear models with missing values

Missing values arise in most real-world data sets due to the aggregation of multiple sources and intrinsically missing information (sensor failure, unanswered questions in surveys...). In fact, the very nature of missing values usually…

Machine Learning · Statistics 2022-02-04 Alexis Ayme , Claire Boyer , Aymeric Dieuleveut , Erwan Scornet

Theoretical characterization of uncertainty in high-dimensional linear classification

Being able to reliably assess not only the \emph{accuracy} but also the \emph{uncertainty} of models' predictions is an important endeavour in modern machine learning. Even if the model generating the data and labels is known, computing the…

Machine Learning · Computer Science 2023-09-12 Lucas Clarté , Bruno Loureiro , Florent Krzakala , Lenka Zdeborová

Bayes meets Bernstein at the Meta Level: an Analysis of Fast Rates in Meta-Learning with PAC-Bayes

Bernstein's condition is a key assumption that guarantees fast rates in machine learning. For example, the Gibbs algorithm with prior $\pi$ has an excess risk in $O(d_{\pi}/n)$, as opposed to the standard $O(\sqrt{d_{\pi}/n})$, where $n$…

Machine Learning · Statistics 2025-03-03 Charles Riou , Pierre Alquier , Badr-Eddine Chérief-Abdellatif

Bridged Posterior: Optimization, Profile Likelihood and a New Approach to Generalized Bayes

Optimization is widely used in statistics, and often efficiently delivers point estimates on useful spaces involving structural constraints or combinatorial structure. To quantify uncertainty, Gibbs posterior exponentiates the negative loss…

Methodology · Statistics 2025-07-23 Cheng Zeng , Eleni Dilma , Jason Xu , Leo L Duan

Improved robust Bayes estimators of the error variance in linear models

We consider the problem of estimating the error variance in a general linear model when the error distribution is assumed to be spherically symmetric, but not necessary Gaussian. In particular we study the case of a scale mixture of…

Statistics Theory · Mathematics 2013-03-18 Yuzo Maruyama , William E. Strawderman

Robust PAC$^m$: Training Ensemble Models Under Misspecification and Outliers

Standard Bayesian learning is known to have suboptimal generalization capabilities under misspecification and in the presence of outliers. PAC-Bayes theory demonstrates that the free energy criterion minimized by Bayesian learning is a…

Machine Learning · Computer Science 2023-04-25 Matteo Zecchin , Sangwoo Park , Osvaldo Simeone , Marios Kountouris , David Gesbert

On Posterior consistency of Bayesian Changepoint models

While there have been a lot of recent developments in the context of Bayesian model selection and variable selection for high dimensional linear models, there is not much work in the presence of change point in literature, unlike the…

Methodology · Statistics 2021-02-26 Nilabja Guha , Jyotishka Datta

Bayesian Calibration of Computer Models with Informative Failures

There are many practical difficulties in the calibration of computer models to experimental data. One such complication is the fact that certain combinations of the calibration inputs can cause the code to output data lacking fundamental…

Methodology · Statistics 2020-06-16 Peter W. Marcy , Curtis B. Storlie

Optimal Binary Classification Beyond Accuracy

The vast majority of statistical theory on binary classification characterizes performance in terms of accuracy. However, accuracy is known in many cases to poorly reflect the practical consequences of classification error, most famously in…

Statistics Theory · Mathematics 2022-09-27 Shashank Singh , Justin Khim

Can Bayesian Neural Networks Make Confident Predictions?

Bayesian inference promises a framework for principled uncertainty quantification of neural network predictions. Barriers to adoption include the difficulty of fully characterizing posterior distributions on network parameters and the…

Machine Learning · Statistics 2025-01-22 Katharine Fisher , Youssef Marzouk

Statistical mechanics of extensive-width Bayesian neural networks near interpolation

For three decades statistical mechanics has been providing a framework to analyse neural networks. However, the theoretically tractable models, e.g., perceptrons, random features models and kernel machines, or multi-index models and…

Machine Learning · Statistics 2025-06-02 Jean Barbier , Francesco Camilli , Minh-Toan Nguyen , Mauro Pastore , Rudy Skerk

Testing for Overfitting

High complexity models are notorious in machine learning for overfitting, a phenomenon in which models well represent data but fail to generalize an underlying data generating process. A typical procedure for circumventing overfitting…

Machine Learning · Statistics 2025-03-11 James Schmidt