Related papers: Equations of States in Singular Statistical Estima…

On the Efficacy of Generalization Error Prediction Scoring Functions

Generalization error predictors (GEPs) aim to predict model performance on unseen distributions by deriving dataset-level error estimates from sample-level scores. However, GEPs often utilize disparate mechanisms (e.g., regressors,…

Machine Learning · Computer Science 2023-05-30 Puja Trivedi , Danai Koutra , Jayaraman J. Thiagarajan

Hidden Markov and state space models: asymptotic analysis of exact and approximate methods for prediction, filtering, smoothing and statistical inference

State space models have long played an important role in signal processing. The Gaussian case can be treated algorithmically using the famous Kalman filter. Similarly since the 1970s there has been extensive application of Hidden Markov…

Statistics Theory · Mathematics 2007-06-13 Peter Bickel , Yaacov Ritov , Tobias Rydén

Refined Statistical Bounds for Classification Error Mismatches with Constrained Bayes Error

In statistical classification/multiple hypothesis testing and machine learning, a model distribution estimated from the training data is usually applied to replace the unknown true distribution in the Bayes decision rule, which introduces a…

Information Theory · Computer Science 2024-09-24 Zijian Yang , Vahe Eminyan , Ralf Schlüter , Hermann Ney

Good Classifiers are Abundant in the Interpolating Regime

Within the machine learning community, the widely-used uniform convergence framework has been used to answer the question of how complex, over-parameterized models can generalize well to new data. This approach bounds the test error of the…

Machine Learning · Statistics 2021-03-05 Ryan Theisen , Jason M. Klusowski , Michael W. Mahoney

An Information-Theoretic Approach to Generalization Theory

We investigate the in-distribution generalization of machine learning algorithms. We depart from traditional complexity-based approaches by analyzing information-theoretic bounds that quantify the dependence between a learning algorithm and…

Machine Learning · Statistics 2024-08-27 Borja Rodríguez-Gálvez , Ragnar Thobaben , Mikael Skoglund

Singular Bayesian Neural Networks

Bayesian neural networks promise calibrated uncertainty but require $O(mn)$ parameters for standard mean-field Gaussian posteriors. We argue this cost is often unnecessary, particularly when weight matrices exhibit fast singular value…

Machine Learning · Statistics 2026-05-05 Mame Diarra Toure , David A. Stephens

Unlearnable phases of matter

We identify fundamental limitations in machine learning by demonstrating that non-trivial mixed-state phases of matter are computationally hard to learn. Focusing on unsupervised learning of distributions, we show that autoregressive neural…

Disordered Systems and Neural Networks · Physics 2026-03-19 Tarun Advaith Kumar , Yijian Zou , Amir-Reza Negari , Roger G. Melko , Timothy H. Hsieh

A Statistical Model for Predicting Generalization in Few-Shot Classification

The estimation of the generalization error of classifiers often relies on a validation set. Such a set is hardly available in few-shot learning scenarios, a highly disregarded shortcoming in the field. In these scenarios, it is common to…

Machine Learning · Computer Science 2023-03-29 Yassir Bendou , Vincent Gripon , Bastien Pasdeloup , Lukas Mauch , Stefan Uhlich , Fabien Cardinaux , Ghouthi Boukli Hacene , Javier Alonso Garcia

Learning tapestries, a statistical learning substrate for open chaotic systems measured with error

The problem of statistical inference for open chaotic systems measured with error is complicated by the interaction of the uncertainty introduced by chaos, and the various sources of random or external variation. Here a method of…

Applications · Statistics 2024-03-11 Michael LuValle

Generalised Bayesian Inference for Discrete Intractable Likelihood

Discrete state spaces represent a major computational challenge to statistical inference, since the computation of normalisation constants requires summation over large or possibly infinite sets, which can be impractical. This paper…

Methodology · Statistics 2023-09-04 Takuo Matsubara , Jeremias Knoblauch , François-Xavier Briol , Chris. J. Oates

Efficient Marginalization-based MCMC Methods for Hierarchical Bayesian Inverse Problems

Hierarchical models in Bayesian inverse problems are characterized by an assumed prior probability distribution for the unknown state and measurement error precision, and hyper-priors for the prior parameters. Combining these probability…

Computation · Statistics 2019-06-10 Arvind K. Saibaba , Johnathan Bardsley , D. Andrew Brown , Alen Alexanderian

Safe-Bayesian Generalized Linear Regression

We study generalized Bayesian inference under misspecification, i.e. when the model is 'wrong but useful'. Generalized Bayes equips the likelihood with a learning rate $\eta$. We show that for generalized linear models (GLMs),…

Statistics Theory · Mathematics 2021-06-01 Rianne de Heide , Alisa Kirichenko , Nishant Mehta , Peter Grünwald

The Interplay between Distribution Parameters and the Accuracy-Robustness Tradeoff in Classification

Adversarial training tends to result in models that are less accurate on natural (unperturbed) examples compared to standard models. This can be attributed to either an algorithmic shortcoming or a fundamental property of the training data…

Machine Learning · Computer Science 2021-07-02 Alireza Mousavi Hosseini , Amir Mohammad Abouei , Mohammad Hossein Rohban

Adaptive ABC model choice and geometric summary statistics for hidden Gibbs random fields

Selecting between different dependency structures of hidden Markov random field can be very challenging, due to the intractable normalizing constant in the likelihood. We answer this question with approximate Bayesian computation (ABC)…

Statistics Theory · Mathematics 2019-09-04 Julien Stoehr , Pierre Pudlo , Lionel Cucala

Unstable Rankings in Bayesian Deep Learning Evaluation

Standard evaluations of Bayesian deep learning methods assume that metric estimates are reliable, but we show this assumption fails under data scarcity. Method rankings are not only unreliable at small $n$, but also dataset-dependent in…

Machine Learning · Computer Science 2026-04-28 Qishi Zhan , Minxuan Hu , Guansu Wang , Jiaxin Liu , Liang He

Bias-Variance Tradeoffs in Single-Sample Binary Gradient Estimators

Discrete and especially binary random variables occur in many machine learning models, notably in variational autoencoders with binary latent states and in stochastic binary networks. When learning such models, a key tool is an estimator of…

Machine Learning · Computer Science 2021-10-18 Alexander Shekhovtsov

Gaussian Universality for Diffusion Models

We investigate Gaussian Universality for data distributions generated via diffusion models. By Gaussian Universality we mean that the test error of a generalized linear model $f(\mathbf{W})$ trained for a classification task on the…

Machine Learning · Statistics 2025-09-30 Reza Ghane , Anthony Bao , Danil Akhtiamov , Babak Hassibi

Bayesian Estimators in Uncertain Nested Error Regression Models

Nested error regression models are useful tools for analysis of grouped data, especially in the case of small area estimation. This paper suggests a nested error regression model using uncertain random effects in which the random effect in…

Methodology · Statistics 2017-02-28 Shonosuke Sugasawa , Tatsuya Kubokawa

Learning Robust Statistics for Simulation-based Inference under Model Misspecification

Simulation-based inference (SBI) methods such as approximate Bayesian computation (ABC), synthetic likelihood, and neural posterior estimation (NPE) rely on simulating statistics to infer parameters of intractable likelihood models.…

Machine Learning · Statistics 2023-10-06 Daolang Huang , Ayush Bharti , Amauri Souza , Luigi Acerbi , Samuel Kaski

Efficient Estimation of Generalization Error and Bias-Variance Components of Ensembles

For many applications, an ensemble of base classifiers is an effective solution. The tuning of its parameters(number of classes, amount of data on which each classifier is to be trained on, etc.) requires G, the generalization error of a…

Machine Learning · Computer Science 2017-11-16 Dhruv Mahajan , Vivek Gupta , S Sathiya Keerthi , Sellamanickam Sundararajan , Shravan Narayanamurthy , Rahul Kidambi