English
Related papers

Related papers: Equations of States in Singular Statistical Estima…

200 papers

In data science and machine learning, hierarchical parametric models, such as mixture models, are often used. They contain two kinds of variables: observable variables, which represent the parts of the data that can be directly measured,…

Machine Learning · Statistics 2015-04-20 Keisuke Yamazaki

We characterise the convergence of the Gibbs sampler which samples from the joint posterior distribution of parameters and missing data in hierarchical linear models with arbitrary symmetric error distributions. We show that the convergence…

Methodology · Statistics 2007-10-24 Omiros Papaspiliopoulos , Gareth Roberts

In prediction problems, it is common to model the data-generating process and then use a model-based procedure, such as a Bayesian predictive distribution, to quantify uncertainty about the next observation. However, if the posited model is…

Methodology · Statistics 2021-07-06 Pei-Shien Wu , Ryan Martin

In statistical learning theory, a generalization bound usually involves a complexity measure imposed by the considered theoretical framework. This limits the scope of such bounds, as other forms of capacity measures or regularizations are…

Machine Learning · Statistics 2024-02-22 Paul Viallard , Rémi Emonet , Amaury Habrard , Emilie Morvant , Valentina Zantedeschi

Estimating the generalization error (GE) of machine learning models is fundamental, with resampling methods being the most common approach. However, in non-standard settings, particularly those where observations are not independently and…

Statistical learning theory provides the foundation to applied machine learning, and its various successful applications in computer vision, natural language processing and other scientific domains. The theory, however, does not take into…

Machine Learning · Statistics 2021-02-18 Júlio Hoffimann , Maciel Zortea , Breno de Carvalho , Bianca Zadrozny

We derive explicit non-asymptotic PAC-Bayes generalization bounds for Gibbs posteriors, that is, data-dependent distributions over model parameters obtained by exponentially tilting a prior with the empirical risk. Unlike classical…

Machine Learning · Statistics 2026-04-21 Chenyang Wang , Yun Yang

Bayes statistics and statistical physics have the common mathematical structure, where the log likelihood function corresponds to the random Hamiltonian. Recently, it was discovered that the asymptotic learning curves in Bayes estimation…

Machine Learning · Computer Science 2015-05-18 Sumio Watanabe

At the heart of machine learning lies the question of generalizability of learned rules over previously unseen data. While over-parameterized models based on neural networks are now ubiquitous in machine learning applications, our…

Machine Learning · Computer Science 2020-05-04 Melikasadat Emami , Mojtaba Sahraee-Ardakan , Parthe Pandit , Sundeep Rangan , Alyson K. Fletcher

A statistical model or a learning machine is called regular if the map taking a parameter to a probability distribution is one-to-one and if its Fisher information matrix is always positive definite. If otherwise, it is called singular. In…

Machine Learning · Computer Science 2012-09-03 Sumio Watanabe

Identifiability of parameters is an essential property for a statistical model to be useful in most settings. However, establishing parameter identifiability for Bayesian networks with hidden variables remains challenging. In the context of…

Statistics Theory · Mathematics 2014-06-04 Elizabeth S. Allman , John A. Rhodes , Elena Stanghellini , Marco Valtorta

Over the past decades, researchers and ML practitioners have come up with better and better ways to build, understand and improve the quality of ML models, but mostly under the key assumption that the training data is distributed…

Machine Learning · Computer Science 2019-10-14 Yeounoh Chung , Peter J. Haas , Eli Upfal , Tim Kraska

A line of recent work has analyzed the behavior of the Expectation-Maximization (EM) algorithm in the well-specified setting, in which the population likelihood is locally strongly concave around its maximizing argument. Examples include…

Statistics Theory · Mathematics 2020-04-30 Raaz Dwivedi , Nhat Ho , Koulik Khamaru , Michael I. Jordan , Martin J. Wainwright , Bin Yu

Learning Gibbs distributions using only sufficient statistics has long been recognized as a computationally hard problem. On the other hand, computationally efficient algorithms for learning Gibbs distributions rely on access to full sample…

Machine Learning · Computer Science 2026-02-16 Abhijith Jayakumar , Shreya Shukla , Marc Vuffray , Andrey Y. Lokhov , Sidhant Misra

In statistical inference, uncertainty is unknown and all models are wrong. That is to say, a person who makes a statistical model and a prior distribution is simultaneously aware that both are fictional candidates. To study such cases,…

Machine Learning · Computer Science 2023-02-13 Sumio Watanabe

A serious problem in learning probabilistic models is the presence of hidden variables. These variables are not observed, yet interact with several of the observed variables. Detecting hidden variables poses two problems: determining the…

Machine Learning · Computer Science 2013-01-14 Gal Elidan , Nir Friedman

We examine the connection between training error and generalization error for arbitrary estimating procedures, working in an overparameterized linear model under general priors in a Bayesian setup. We find determining factors inherent to…

Machine Learning · Statistics 2026-02-11 Chen Cheng , Rina Foygel Barber

We study high-dimensional asymptotic performance limits of binary supervised classification problems where the class conditional densities are Gaussian with unknown means and covariances and the number of signal dimensions scales faster…

Machine Learning · Statistics 2016-11-17 Mohammad Hossein Rohban , Prakash Ishwar , Birant Orten , William C. Karl , Venkatesh Saligrama

Machine learning promises methods that generalize well from finite labeled data. However, the brittleness of existing neural net approaches is revealed by notable failures, such as the existence of adversarial examples that are…

The dependence on training data of the Gibbs algorithm (GA) is analytically characterized. By adopting the expected empirical risk as the performance metric, the sensitivity of the GA is obtained in closed form. In this case, sensitivity is…

Machine Learning · Computer Science 2023-06-22 Samir M. Perlaza , Iñaki Esnaola , Gaetan Bisson , H. Vincent Poor