English
Related papers

Related papers: Equations of States in Singular Statistical Estima…

200 papers

Generalization error predictors (GEPs) aim to predict model performance on unseen distributions by deriving dataset-level error estimates from sample-level scores. However, GEPs often utilize disparate mechanisms (e.g., regressors,…

Machine Learning · Computer Science 2023-05-30 Puja Trivedi , Danai Koutra , Jayaraman J. Thiagarajan

State space models have long played an important role in signal processing. The Gaussian case can be treated algorithmically using the famous Kalman filter. Similarly since the 1970s there has been extensive application of Hidden Markov…

Statistics Theory · Mathematics 2007-06-13 Peter Bickel , Yaacov Ritov , Tobias Rydén

In statistical classification/multiple hypothesis testing and machine learning, a model distribution estimated from the training data is usually applied to replace the unknown true distribution in the Bayes decision rule, which introduces a…

Information Theory · Computer Science 2024-09-24 Zijian Yang , Vahe Eminyan , Ralf Schlüter , Hermann Ney

Within the machine learning community, the widely-used uniform convergence framework has been used to answer the question of how complex, over-parameterized models can generalize well to new data. This approach bounds the test error of the…

Machine Learning · Statistics 2021-03-05 Ryan Theisen , Jason M. Klusowski , Michael W. Mahoney

We investigate the in-distribution generalization of machine learning algorithms. We depart from traditional complexity-based approaches by analyzing information-theoretic bounds that quantify the dependence between a learning algorithm and…

Machine Learning · Statistics 2024-08-27 Borja Rodríguez-Gálvez , Ragnar Thobaben , Mikael Skoglund

Bayesian neural networks promise calibrated uncertainty but require $O(mn)$ parameters for standard mean-field Gaussian posteriors. We argue this cost is often unnecessary, particularly when weight matrices exhibit fast singular value…

Machine Learning · Statistics 2026-05-05 Mame Diarra Toure , David A. Stephens

We identify fundamental limitations in machine learning by demonstrating that non-trivial mixed-state phases of matter are computationally hard to learn. Focusing on unsupervised learning of distributions, we show that autoregressive neural…

Disordered Systems and Neural Networks · Physics 2026-03-19 Tarun Advaith Kumar , Yijian Zou , Amir-Reza Negari , Roger G. Melko , Timothy H. Hsieh

The estimation of the generalization error of classifiers often relies on a validation set. Such a set is hardly available in few-shot learning scenarios, a highly disregarded shortcoming in the field. In these scenarios, it is common to…

The problem of statistical inference for open chaotic systems measured with error is complicated by the interaction of the uncertainty introduced by chaos, and the various sources of random or external variation. Here a method of…

Applications · Statistics 2024-03-11 Michael LuValle

Discrete state spaces represent a major computational challenge to statistical inference, since the computation of normalisation constants requires summation over large or possibly infinite sets, which can be impractical. This paper…

Methodology · Statistics 2023-09-04 Takuo Matsubara , Jeremias Knoblauch , François-Xavier Briol , Chris. J. Oates

Hierarchical models in Bayesian inverse problems are characterized by an assumed prior probability distribution for the unknown state and measurement error precision, and hyper-priors for the prior parameters. Combining these probability…

Computation · Statistics 2019-06-10 Arvind K. Saibaba , Johnathan Bardsley , D. Andrew Brown , Alen Alexanderian

We study generalized Bayesian inference under misspecification, i.e. when the model is 'wrong but useful'. Generalized Bayes equips the likelihood with a learning rate $\eta$. We show that for generalized linear models (GLMs),…

Statistics Theory · Mathematics 2021-06-01 Rianne de Heide , Alisa Kirichenko , Nishant Mehta , Peter Grünwald

Adversarial training tends to result in models that are less accurate on natural (unperturbed) examples compared to standard models. This can be attributed to either an algorithmic shortcoming or a fundamental property of the training data…

Machine Learning · Computer Science 2021-07-02 Alireza Mousavi Hosseini , Amir Mohammad Abouei , Mohammad Hossein Rohban

Selecting between different dependency structures of hidden Markov random field can be very challenging, due to the intractable normalizing constant in the likelihood. We answer this question with approximate Bayesian computation (ABC)…

Statistics Theory · Mathematics 2019-09-04 Julien Stoehr , Pierre Pudlo , Lionel Cucala

Standard evaluations of Bayesian deep learning methods assume that metric estimates are reliable, but we show this assumption fails under data scarcity. Method rankings are not only unreliable at small $n$, but also dataset-dependent in…

Machine Learning · Computer Science 2026-04-28 Qishi Zhan , Minxuan Hu , Guansu Wang , Jiaxin Liu , Liang He

Discrete and especially binary random variables occur in many machine learning models, notably in variational autoencoders with binary latent states and in stochastic binary networks. When learning such models, a key tool is an estimator of…

Machine Learning · Computer Science 2021-10-18 Alexander Shekhovtsov

We investigate Gaussian Universality for data distributions generated via diffusion models. By Gaussian Universality we mean that the test error of a generalized linear model $f(\mathbf{W})$ trained for a classification task on the…

Machine Learning · Statistics 2025-09-30 Reza Ghane , Anthony Bao , Danil Akhtiamov , Babak Hassibi

Nested error regression models are useful tools for analysis of grouped data, especially in the case of small area estimation. This paper suggests a nested error regression model using uncertain random effects in which the random effect in…

Methodology · Statistics 2017-02-28 Shonosuke Sugasawa , Tatsuya Kubokawa

Simulation-based inference (SBI) methods such as approximate Bayesian computation (ABC), synthetic likelihood, and neural posterior estimation (NPE) rely on simulating statistics to infer parameters of intractable likelihood models.…

Machine Learning · Statistics 2023-10-06 Daolang Huang , Ayush Bharti , Amauri Souza , Luigi Acerbi , Samuel Kaski

For many applications, an ensemble of base classifiers is an effective solution. The tuning of its parameters(number of classes, amount of data on which each classifier is to be trained on, etc.) requires G, the generalization error of a…