Related papers: Equations of States in Singular Statistical Estima…

Deep Learning is Singular, and That's Good

In singular models, the optimal set of parameters forms an analytic set with singularities and classical statistical inference cannot be applied to such models. This is significant for deep learning as neural networks are singular and thus…

Machine Learning · Computer Science 2023-12-05 Daniel Murfet , Susan Wei , Mingming Gong , Hui Li , Jesse Gell-Redman , Thomas Quella

A Simplicity Bubble Problem in Formal-Theoretic Learning Systems

When mining large datasets in order to predict new data, limitations of the principles behind statistical machine learning pose a serious challenge not only to the Big Data deluge, but also to the traditional assumptions that data…

Information Theory · Computer Science 2023-04-26 Felipe S. Abrahão , Hector Zenil , Fabio Porto , Michael Winter , Klaus Wehmuth , Itala M. L. D'Ottaviano

Generalization of Quantum Machine Learning Models Using Quantum Fisher Information Metric

Generalization is the ability of machine learning models to make accurate predictions on new data by learning from training data. However, understanding generalization of quantum machine learning models has been a major challenge. Here, we…

Quantum Physics · Physics 2024-08-07 Tobias Haug , M. S. Kim

Bayesian Sparse Linear Regression with Unknown Symmetric Error

We study full Bayesian procedures for sparse linear regression when errors have a symmetric but otherwise unknown distribution. The unknown error distribution is endowed with a symmetrized Dirichlet process mixture of Gaussians. For the…

Statistics Theory · Mathematics 2019-03-26 Minwoo Chae , Lizhen Lin , David B. Dunson

Learning under Distribution Mismatch and Model Misspecification

We study learning algorithms when there is a mismatch between the distributions of the training and test datasets of a learning algorithm. The effect of this mismatch on the generalization error and model misspecification are quantified.…

Information Theory · Computer Science 2022-08-11 Saeed Masiha , Amin Gohari , Mohammad Hossein Yassaee , Mohammad Reza Aref

Insufficient Gibbs Sampling

In some applied scenarios, the availability of complete data is restricted, often due to privacy concerns; only aggregated, robust and inefficient statistics derived from the data are made accessible. These robust statistics are not…

Methodology · Statistics 2024-02-23 Antoine Luciano , Christian P. Robert , Robin J. Ryder

The Exact Asymptotic Form of Bayesian Generalization Error in Latent Dirichlet Allocation

Latent Dirichlet allocation (LDA) obtains essential information from data by using Bayesian inference. It is applied to knowledge discovery via dimension reducing and clustering in many fields. However, its generalization error had not been…

Machine Learning · Statistics 2021-01-26 Naoki Hayashi

On PAC-Bayesian Bounds for Random Forests

Existing guarantees in terms of rigorous upper bounds on the generalization error for the original random forest algorithm, one of the most frequently used machine learning methods, are unsatisfying. We discuss and evaluate various…

Machine Learning · Computer Science 2019-03-07 Stephan Sloth Lorenzen , Christian Igel , Yevgeny Seldin

Generalization in Quantum Machine Learning: a Quantum Information Perspective

Quantum classification and hypothesis testing are two tightly related subjects, the main difference being that the former is data driven: how to assign to quantum states $\rho(x)$ the corresponding class $c$ (or hypothesis) is learnt from…

Quantum Physics · Physics 2021-11-30 Leonardo Banchi , Jason Pereira , Stefano Pirandola

Fundamental Limits of Matrix Sensing: Exact Asymptotics, Universality, and Applications

In the matrix sensing problem, one wishes to reconstruct a matrix from (possibly noisy) observations of its linear projections along given directions. We consider this model in the high-dimensional limit: while previous works on this model…

Machine Learning · Statistics 2025-11-13 Yizhou Xu , Antoine Maillard , Lenka Zdeborová , Florent Krzakala

Symmetries and Expressive Requirements for Learning General Policies

State symmetries play an important role in planning and generalized planning. In the first case, state symmetries can be used to reduce the size of the search; in the second, to reduce the size of the training set. In the case of general…

Artificial Intelligence · Computer Science 2024-09-25 Dominik Drexler , Simon Ståhlberg , Blai Bonet , Hector Geffner

Adaptive Synaptic Failure Enables Sampling from Posterior Predictive Distributions in the Brain

Bayesian interpretations of neural processing require that biological mechanisms represent and operate upon probability distributions in accordance with Bayes' theorem. Many have speculated that synaptic failure constitutes a mechanism of…

Neurons and Cognition · Quantitative Biology 2022-10-05 Kevin McKee , Ian Crandell , Rishidev Chaudhuri , Randall O'Reilly

Improved Information Theoretic Generalization Bounds for Distributed and Federated Learning

We consider information-theoretic bounds on expected generalization error for statistical learning problems in a networked setting. In this setting, there are $K$ nodes, each with its own independent dataset, and the models from each node…

Information Theory · Computer Science 2024-01-17 L. P. Barnes , Alex Dytso , H. V. Poor

Information Complexity and Generalization Bounds

We present a unifying picture of PAC-Bayesian and mutual information-based upper bounds on the generalization error of randomized learning algorithms. As we show, Tong Zhang's information exponential inequality (IEI) gives a general recipe…

Machine Learning · Computer Science 2021-10-26 Pradeep Kr. Banerjee , Guido Montúfar

Variational Gibbs Inference for Statistical Model Estimation from Incomplete Data

Statistical models are central to machine learning with broad applicability across a range of downstream tasks. The models are controlled by free parameters that are typically estimated from data by maximum-likelihood estimation or…

Machine Learning · Computer Science 2023-08-16 Vaidotas Simkus , Benjamin Rhodes , Michael U. Gutmann

A Primer on Causal and Statistical Dataset Biases for Fair and Robust Image Analysis

Machine learning methods often fail when deployed in the real world. Worse still, they fail in high-stakes situations and across socially sensitive lines. These issues have a chilling effect on the adoption of machine learning methods in…

Machine Learning · Computer Science 2025-09-05 Charles Jones , Ben Glocker

Asymptotic Theory for Regularized System Identification Part I: Empirical Bayes Hyper-parameter Estimator

Regularized system identification is the major advance in system identification in the last decade. Although many promising results have been achieved, it is far from complete and there are still many key problems to be solved. One of them…

Systems and Control · Electrical Eng. & Systems 2023-04-05 Yue Ju , Biqiang Mu , Lennart Ljung , Tianshi Chen

A Good Measure for Bayesian Inference

The Gaussian theory of errors has been generalized to situations, where the Gaussian distribution and, hence, the Gaussian rules of error propagation are inadequate. The generalizations are based on Bayes' theorem and a suitable measure.…

Data Analysis, Statistics and Probability · Physics 2007-05-23 Hanns L. Harney

Inconsistency of Bayesian Inference for Misspecified Linear Models, and a Proposal for Repairing It

We empirically show that Bayesian inference can be inconsistent under misspecification in simple linear regression problems, both in a model averaging/selection and in a Bayesian ridge regression setting. We use the standard linear model,…

Statistics Theory · Mathematics 2018-10-30 Peter Grünwald , Thijs van Ommen

Weighted Particle-Based Optimization for Efficient Generalized Posterior Calibration

In the realm of statistical learning, the increasing volume of accessible data and increasing model complexity necessitate robust methodologies. This paper explores two branches of robust Bayesian methods in response to this trend. The…

Methodology · Statistics 2024-12-02 Masahiro Tanaka