Related papers: Equations of States in Singular Statistical Estima…

Asymptotic Accuracy of Bayesian Estimation for a Single Latent Variable

In data science and machine learning, hierarchical parametric models, such as mixture models, are often used. They contain two kinds of variables: observable variables, which represent the parts of the data that can be directly measured,…

Machine Learning · Statistics 2015-04-20 Keisuke Yamazaki

Stability of the Gibbs Sampler for Bayesian Hierarchical Models

We characterise the convergence of the Gibbs sampler which samples from the joint posterior distribution of parameters and missing data in hierarchical linear models with arbitrary symmetric error distributions. We show that the convergence…

Methodology · Statistics 2007-10-24 Omiros Papaspiliopoulos , Gareth Roberts

Calibrating generalized predictive distributions

In prediction problems, it is common to model the data-generating process and then use a model-based procedure, such as a Bayesian predictive distribution, to quantify uncertainty about the next observation. However, if the posited model is…

Methodology · Statistics 2021-07-06 Pei-Shien Wu , Ryan Martin

Leveraging PAC-Bayes Theory and Gibbs Distributions for Generalization Bounds with Complexity Measures

In statistical learning theory, a generalization bound usually involves a complexity measure imposed by the considered theoretical framework. This limits the scope of such bounds, as other forms of capacity measures or regularizations are…

Machine Learning · Statistics 2024-02-22 Paul Viallard , Rémi Emonet , Amaury Habrard , Emilie Morvant , Valentina Zantedeschi

Evaluating machine learning models in non-standard settings: An overview and new findings

Estimating the generalization error (GE) of machine learning models is fundamental, with resampling methods being the most common approach. However, in non-standard settings, particularly those where observations are not independently and…

Machine Learning · Statistics 2023-10-24 Roman Hornung , Malte Nalenz , Lennart Schneider , Andreas Bender , Ludwig Bothmann , Bernd Bischl , Thomas Augustin , Anne-Laure Boulesteix

Geostatistical Learning: Challenges and Opportunities

Statistical learning theory provides the foundation to applied machine learning, and its various successful applications in computer vision, natural language processing and other scientific domains. The theory, however, does not take into…

Machine Learning · Statistics 2021-02-18 Júlio Hoffimann , Maciel Zortea , Breno de Carvalho , Bianca Zadrozny

PAC-Bayes Bounds for Gibbs Posteriors via Singular Learning Theory

We derive explicit non-asymptotic PAC-Bayes generalization bounds for Gibbs posteriors, that is, data-dependent distributions over model parameters obtained by exponentially tilting a prior with the empirical risk. Unlike classical…

Machine Learning · Statistics 2026-04-21 Chenyang Wang , Yun Yang

Asymptotic Learning Curve and Renormalizable Condition in Statistical Learning Theory

Bayes statistics and statistical physics have the common mathematical structure, where the log likelihood function corresponds to the random Hamiltonian. Recently, it was discovered that the asymptotic learning curves in Bayes estimation…

Machine Learning · Computer Science 2015-05-18 Sumio Watanabe

Generalization Error of Generalized Linear Models in High Dimensions

At the heart of machine learning lies the question of generalizability of learned rules over previously unseen data. While over-parameterized models based on neural networks are now ubiquitous in machine learning applications, our…

Machine Learning · Computer Science 2020-05-04 Melikasadat Emami , Mojtaba Sahraee-Ardakan , Parthe Pandit , Sundeep Rangan , Alyson K. Fletcher

A Widely Applicable Bayesian Information Criterion

A statistical model or a learning machine is called regular if the map taking a parameter to a probability distribution is one-to-one and if its Fisher information matrix is always positive definite. If otherwise, it is called singular. In…

Machine Learning · Computer Science 2012-09-03 Sumio Watanabe

Parameter identifiability of discrete Bayesian networks with hidden variables

Identifiability of parameters is an essential property for a statistical model to be useful in most settings. However, establishing parameter identifiability for Bayesian networks with hidden variables remains challenging. In the context of…

Statistics Theory · Mathematics 2014-06-04 Elizabeth S. Allman , John A. Rhodes , Elena Stanghellini , Marco Valtorta

Unknown Examples & Machine Learning Model Generalization

Over the past decades, researchers and ML practitioners have come up with better and better ways to build, understand and improve the quality of ML models, but mostly under the key assumption that the training data is distributed…

Machine Learning · Computer Science 2019-10-14 Yeounoh Chung , Peter J. Haas , Eli Upfal , Tim Kraska

Singularity, Misspecification, and the Convergence Rate of EM

A line of recent work has analyzed the behavior of the Expectation-Maximization (EM) algorithm in the well-specified setting, in which the population likelihood is locally strongly concave around its maximizing argument. Examples include…

Statistics Theory · Mathematics 2020-04-30 Raaz Dwivedi , Nhat Ho , Koulik Khamaru , Michael I. Jordan , Martin J. Wainwright , Bin Yu

Computationally sufficient statistics for Ising models

Learning Gibbs distributions using only sufficient statistics has long been recognized as a computationally hard problem. On the other hand, computationally efficient algorithms for learning Gibbs distributions rely on access to full sample…

Machine Learning · Computer Science 2026-02-16 Abhijith Jayakumar , Shreya Shukla , Marc Vuffray , Andrey Y. Lokhov , Sidhant Misra

Mathematical Theory of Bayesian Statistics for Unknown Information Source

In statistical inference, uncertainty is unknown and all models are wrong. That is to say, a person who makes a statistical model and a prior distribution is simultaneously aware that both are fictional candidates. To study such cases,…

Machine Learning · Computer Science 2023-02-13 Sumio Watanabe

Learning the Dimensionality of Hidden Variables

A serious problem in learning probabilistic models is the presence of hidden variables. These variables are not observed, yet interact with several of the observed variables. Detecting hidden variables poses two problems: determining the…

Machine Learning · Computer Science 2013-01-14 Gal Elidan , Nir Friedman

Is Memorization Helpful or Harmful? Prior Information Sets the Threshold

We examine the connection between training error and generalization error for arbitrary estimating procedures, working in an overparameterized linear model under general priors in a Bayesian setup. We find determining factors inherent to…

Machine Learning · Statistics 2026-02-11 Chen Cheng , Rina Foygel Barber

An Impossibility Result for High Dimensional Supervised Learning

We study high-dimensional asymptotic performance limits of binary supervised classification problems where the class conditional densities are Gaussian with unknown means and covariances and the number of signal dimensions scales faster…

Machine Learning · Statistics 2016-11-17 Mohammad Hossein Rohban , Prakash Ishwar , Birant Orten , William C. Karl , Venkatesh Saligrama

State-Reification Networks: Improving Generalization by Modeling the Distribution of Hidden Representations

Machine learning promises methods that generalize well from finite labeled data. However, the brittleness of existing neural net approaches is revealed by notable failures, such as the existence of adversarial examples that are…

Machine Learning · Computer Science 2019-05-29 Alex Lamb , Jonathan Binas , Anirudh Goyal , Sandeep Subramanian , Ioannis Mitliagkas , Denis Kazakov , Yoshua Bengio , Michael C. Mozer

On the Validation of Gibbs Algorithms: Training Datasets, Test Datasets and their Aggregation

The dependence on training data of the Gibbs algorithm (GA) is analytically characterized. By adopting the expected empirical risk as the performance metric, the sensitivity of the GA is obtained in closed form. In this case, sensitivity is…

Machine Learning · Computer Science 2023-06-22 Samir M. Perlaza , Iñaki Esnaola , Gaetan Bisson , H. Vincent Poor