English
Related papers

Related papers: Accurate Bayesian Data Classification without Hype…

200 papers

\noindent Hyper-parameter selection is a central practical problem in modern machine learning, governing regularization strength, model capacity, and robustness choices. Cross-validation is often computationally prohibitive at scale, while…

Machine Learning · Statistics 2025-12-24 Hedibert Lopes , Nick Polson , Vadim Sokolov

Classifiers based on probabilistic graphical models are very effective. In continuous domains, maximum likelihood is usually used to assess the predictions of those classifiers. When data is scarce, this can easily lead to overfitting. In…

Machine Learning · Computer Science 2013-08-29 Victor Bellon , Jesus Cerquides , Ivo Grosse

A new methodology for model determination in decomposable graphical Gaussian models is developed. The Bayesian paradigm is used and, for each given graph, a hyper inverse Wishart prior distribution on the covariance matrix is considered.…

Computation · Statistics 2015-03-13 Sophie Donnet , Jean-Michel Marin

The behavior of many Bayesian models used in machine learning critically depends on the choice of prior distributions, controlled by some hyperparameters that are typically selected by Bayesian optimization or cross-validation. This…

Machine Learning · Statistics 2023-10-09 Eliezer de Souza da Silva , Tomasz Kuśmierczyk , Marcelo Hartmann , Arto Klami

Semi-supervised learning is a model training method that uses both labeled and unlabeled data. This paper proposes a fully Bayes semi-supervised learning algorithm that can be applied to any multi-category classification problem. We assume…

Machine Learning · Statistics 2024-07-22 Rui Zhu , Shuvrarghya Ghosh , Subhashis Ghosal

In high-dimensional Bayesian statistics, various methods have been developed, including prior distributions that induce parameter sparsity to handle many parameters. Yet, these approaches often overlook the rich spectral structure of the…

Statistics Theory · Mathematics 2025-05-06 Tomoya Wakayama , Masaaki Imaizumi

We discuss Bayesian inference for a known-mean Gaussian model with a compound symmetric variance-covariance matrix. Since the space of such matrices is a linear subspace of that of positive definite matrices, we utilize the methods of…

Methodology · Statistics 2023-03-20 Zachary M. Pisano

High dimensional data analysis is known to be as a challenging problem. In this article, we give a theoretical analysis of high dimensional classification of Gaussian data which relies on a geometrical analysis of the error measure. It…

Statistics Theory · Mathematics 2008-07-10 Robin Girard

In a traditional Gaussian graphical model, data homogeneity is routinely assumed with no extra variables affecting the conditional independence. In modern genomic datasets, there is an abundance of auxiliary information, which often gets…

Methodology · Statistics 2023-08-16 Yabo Niu , Yang Ni , Debdeep Pati , Bani K. Mallick

Usually one compares the accuracy of two competing classifiers via null hypothesis significance tests (nhst). Yet the nhst tests suffer from important shortcomings, which can be overcome by switching to Bayesian hypothesis testing. We…

Machine Learning · Computer Science 2016-11-23 Giorgio Corani , Alessio Benavoli , Janez Demšar , Francesca Mangili , Marco Zaffalon

Multivariate categorical data occur in many applications of machine learning. One of the main difficulties with these vectors of categorical variables is sparsity. The number of possible observations grows exponentially with vector length,…

Machine Learning · Statistics 2015-03-10 Yarin Gal , Yutian Chen , Zoubin Ghahramani

We consider Bayesian model selection in generalized linear models that are high-dimensional, with the number of covariates p being large relative to the sample size n, but sparse in that the number of active covariates is small compared to…

Statistics Theory · Mathematics 2011-12-26 Rina Foygel , Mathias Drton

Bayesian optimisation is a popular technique for hyperparameter learning but typically requires initial exploration even in cases where similar prior tasks have been solved. We propose to transfer information across tasks using learnt…

Machine Learning · Statistics 2019-05-28 Ho Chung Leon Law , Peilin Zhao , Lucian Chan , Junzhou Huang , Dino Sejdinovic

There is a widespread need for statistical methods that can analyze high-dimensional datasets with- out imposing restrictive or opaque modeling assumptions. This paper describes a domain-general data analysis method called CrossCat.…

Artificial Intelligence · Computer Science 2015-12-07 Vikash Mansinghka , Patrick Shafto , Eric Jonas , Cap Petschulat , Max Gasner , Joshua B. Tenenbaum

We derive closed-form expressions for the Bayes optimal decision boundaries in binary classification of high dimensional overlapping Gaussian mixture model (GMM) data, and show how they depend on the eigenstructure of the class covariances,…

Machine Learning · Statistics 2024-05-29 Khen Cohen , Noam Levi , Yaron Oz

Prior design is one of the most important problems in both statistics and machine learning. The cross validation (CV) and the widely applicable information criterion (WAIC) are predictive measures of the Bayesian estimation, however, it has…

Machine Learning · Computer Science 2015-03-30 Sumio Watanabe

When modeling a probability distribution with a Bayesian network, we are faced with the problem of how to handle continuous variables. Most previous work has either solved the problem by discretizing, or assumed that the data are generated…

Machine Learning · Computer Science 2013-02-21 George H. John , Pat Langley

Cross-validation is the workhorse of modern applied statistics and machine learning, as it provides a principled framework for selecting the model that maximizes generalization performance. In this paper, we show that the cross-validation…

Machine Learning · Statistics 2018-05-21 Shane Barratt , Rishi Sharma

We present a methodology for model evaluation and selection where the sampling mechanism violates the i.i.d. assumption. Our methodology involves a formulation of the bias between the standard Cross-Validation (CV) estimator and the mean…

Methodology · Statistics 2025-03-14 Oren Yuval , Saharon Rosset

Current causal discovery approaches require restrictive model assumptions in the absence of interventional data to ensure structure identifiability. These assumptions often do not hold in real-world applications leading to a loss of…

Machine Learning · Statistics 2025-06-25 Anish Dhir , Ruby Sedgwick , Avinash Kori , Ben Glocker , Mark van der Wilk
‹ Prev 1 2 3 10 Next ›