Related papers: Equations of States in Singular Statistical Estima…

Equations of States in Statistical Learning for a Nonparametrizable and Regular Case

Many learning machines that have hierarchical structure or hidden variables are now being used in information science, artificial intelligence, and bioinformatics. However, several learning machines used in such fields are not regular but…

Machine Learning · Computer Science 2015-05-13 Sumio Watanabe

Asymptotic Accuracy of Bayes Estimation for Latent Variables with Redundancy

Hierarchical parametric models consisting of observable and latent variables are widely used for unsupervised learning tasks. For example, a mixture model is a representative hierarchical model for clustering. From the statistical point of…

Machine Learning · Statistics 2014-01-24 Keisuke Yamazaki

Stochastic complexity of Bayesian networks

Bayesian networks are now being used in enormous fields, for example, diagnosis of a system, data mining, clustering and so on. In spite of their wide range of applications, the statistical properties have not yet been clarified, because…

Machine Learning · Computer Science 2012-12-12 Keisuke Yamazaki , Sumio Watanbe

Information-theoretic Characterizations of Generalization Error for the Gibbs Algorithm

Various approaches have been developed to upper bound the generalization error of a supervised learning algorithm. However, existing bounds are often loose and even vacuous when evaluated in practice. As a result, they may fail to…

Information Theory · Computer Science 2022-10-19 Gholamali Aminian , Yuheng Bu , Laura Toni , Miguel R. D. Rodrigues , Gregory W. Wornell

Characterizing and Understanding the Generalization Error of Transfer Learning with Gibbs Algorithm

We provide an information-theoretic analysis of the generalization ability of Gibbs-based transfer learning algorithms by focusing on two popular transfer learning approaches, $\alpha$-weighted-ERM and two-stage-ERM. Our key result is an…

Machine Learning · Computer Science 2021-11-03 Yuheng Bu , Gholamali Aminian , Laura Toni , Miguel Rodrigues , Gregory Wornell

Characterizing the Generalization Error of Gibbs Algorithm with Symmetrized KL information

Bounding the generalization error of a supervised learning algorithm is one of the most important problems in learning theory, and various approaches have been developed. However, existing bounds are often loose and lack of guarantees. As a…

Machine Learning · Computer Science 2021-07-30 Gholamali Aminian , Yuheng Bu , Laura Toni , Miguel R. D. Rodrigues , Gregory Wornell

Statistical inference for quantum singular models

Deep learning has seen substantial achievements, with numerical and theoretical evidence suggesting that singularities of statistical models are considered a contributing factor to its performance. From this remarkable success of classical…

Quantum Physics · Physics 2024-11-26 Hiroshi Yano , Yota Maeda , Naoki Yamamoto

On the Generalization Error of Meta Learning for the Gibbs Algorithm

We analyze the generalization ability of joint-training meta learning algorithms via the Gibbs algorithm. Our exact characterization of the expected meta generalization error for the meta Gibbs algorithm is based on symmetrized KL…

Machine Learning · Computer Science 2023-04-28 Yuheng Bu , Harsha Vardhan Tetali , Gholamali Aminian , Miguel Rodrigues , Gregory Wornell

Asymptotic Behavior of Bayesian Generalization Error in Multinomial Mixtures

Multinomial mixtures are widely used in the information engineering field, however, their mathematical properties are not yet clarified because they are singular learning models. In fact, the models are non-identifiable and their Fisher…

Machine Learning · Computer Science 2022-03-15 Takumi Watanabe , Sumio Watanabe

Singular Learning Theory for Factor Analysis

Watanabe's singular learning theory provides a framework for asymptotic analysis of Bayesian model selection for statistical models with singularities, where traditional statistical regularity assumptions fail. Learning coefficients, also…

Statistics Theory · Mathematics 2025-11-20 Mathias Drton , Elizabeth Gross , Dimitra Kosta , Anton Leykin , Seth Sullivant , Daniel Windisch

Hypothesis Testing over Observable Regimes in Singular Models

Hypothesis testing in singular statistical models is often regarded as inherently problematic due to non-identifiability and degeneracy of the Fisher information. We show that the fundamental obstruction to testing in such models is not…

Statistics Theory · Mathematics 2026-03-02 Sean Plummer

Statistical Learning Theory of Quasi-Regular Cases

Many learning machines such as normal mixtures and layered neural networks are not regular but singular statistical models, because the map from a parameter to a probability distribution is not one-to-one. The conventional statistical…

Statistics Theory · Mathematics 2015-06-03 Koshi Yamada , Sumio Watanabe

A Bayesian information criterion for singular models

We consider approximate Bayesian model choice for model selection problems that involve models whose Fisher-information matrices may fail to be invertible along other competing submodels. Such singular models do not obey the regularity…

Methodology · Statistics 2016-03-24 Mathias Drton , Martyn Plummer

Learning under Model Misspecification: Applications to Variational and Ensemble methods

Virtually any model we use in machine learning to make predictions does not perfectly represent reality. So, most of the learning happens under model misspecification. In this work, we present a novel analysis of the generalization…

Machine Learning · Computer Science 2020-10-23 Andres R. Masegosa

The Generalization Error of Supervised Machine Learning Algorithms

In this paper, the method of gaps, a technique for deriving closed-form expressions in terms of information measures for the generalization error of supervised machine learning algorithms is introduced. The method relies on the notion of…

Machine Learning · Computer Science 2026-01-01 Samir M. Perlaza , Xinying Zou

Asymptotic Accuracy of Distribution-Based Estimation for Latent Variables

Hierarchical statistical models are widely employed in information science and data engineering. The models consist of two types of variables: observable variables that represent the given data and latent variables for the unobservable…

Machine Learning · Statistics 2014-02-21 Keisuke Yamazaki

Inferring the Gibbs state of a small quantum system

Gibbs states are familiar from statistical mechanics, yet their use is not limited to that domain. For instance, they also feature in the maximum entropy reconstruction of quantum states from incomplete measurement data. Outside the…

Quantum Physics · Physics 2011-07-04 Jochen Rau

Singularity structures and impacts on parameter estimation in finite mixtures of distributions

Singularities of a statistical model are the elements of the model's parameter space which make the corresponding Fisher information matrix degenerate. These are the points for which estimation techniques such as the maximum likelihood…

Statistics Theory · Mathematics 2019-07-25 Nhat Ho , XuanLong Nguyen

Information-theoretic Analysis of the Gibbs Algorithm: An Individual Sample Approach

Recent progress has shown that the generalization error of the Gibbs algorithm can be exactly characterized using the symmetrized KL information between the learned hypothesis and the entire training dataset. However, evaluating such a…

Information Theory · Computer Science 2024-10-17 Youheng Zhu , Yuheng Bu

Model Mis-specification and Algorithmic Bias

Machine learning algorithms are increasingly used to inform critical decisions. There is a growing concern about bias, that algorithms may produce uneven outcomes for individuals in different demographic groups. In this work, we measure…

Machine Learning · Computer Science 2021-06-01 Runshan Fu , Yangfan Liang , Peter Zhang