Related papers: Training samples in objective Bayesian model selec…

On a Class of Objective Priors from Scoring Rules

Objective prior distributions represent an important tool that allows one to have the advantages of using the Bayesian framework even when information about the parameters of a model is not available. The usual objective approaches work off…

Methodology · Statistics 2018-09-25 Fabrizio Leisen , Cristiano Villa , Stephen G. Walker

A Bayesian/Information Theoretic Model of Bias Learning

In this paper the problem of learning appropriate bias for an environment of related tasks is examined from a Bayesian perspective. The environment of related tasks is shown to be naturally modelled by the concept of an {\em objective}…

Machine Learning · Computer Science 2019-11-15 Jonathan Baxter

Sampling Bias Correction for Supervised Machine Learning: A Bayesian Inference Approach with Practical Applications

Given a supervised machine learning problem where the training set has been subject to a known sampling bias, how can a model be trained to fit the original dataset? We achieve this through the Bayesian inference framework by altering the…

Machine Learning · Statistics 2022-03-16 Max Sklar

A Bayesian Method for Causal Modeling and Discovery Under Selection

This paper describes a Bayesian method for learning causal networks using samples that were selected in a non-random manner from a population of interest. Examples of data obtained by non-random sampling include convenience samples and…

Artificial Intelligence · Computer Science 2013-01-18 Gregory F. Cooper

Criteria for Bayesian model choice with application to variable selection

In objective Bayesian model selection, no single criterion has emerged as dominant in defining objective prior distributions. Indeed, many criteria have been separately proposed and utilized to propose differing prior choices. We first…

Statistics Theory · Mathematics 2012-09-25 M. J. Bayarri , J. O. Berger , A. Forte , G. García-Donato

Bayesian analysis of the prevalence bias: learning and predicting from imbalanced data

Datasets are rarely a realistic approximation of the target population. Say, prevalence is misrepresented, image quality is above clinical standards, etc. This mismatch is known as sampling bias. Sampling biases are a major hindrance for…

Machine Learning · Computer Science 2021-08-03 Loic Le Folgoc , Vasileios Baltatzis , Amir Alansary , Sujal Desai , Anand Devaraj , Sam Ellis , Octavio E. Martinez Manzanera , Fahdi Kanavati , Arjun Nair , Julia Schnabel , Ben Glocker

Towards Accelerated Model Training via Bayesian Data Selection

Mislabeled, duplicated, or biased data in real-world scenarios can lead to prolonged training and even hinder model convergence. Traditional solutions prioritizing easy or hard samples lack the flexibility to handle such a variety…

Machine Learning · Computer Science 2023-11-08 Zhijie Deng , Peng Cui , Jun Zhu

On the safe use of prior densities for Bayesian model selection

The application of Bayesian inference for the purpose of model selection is very popular nowadays. In this framework, models are compared through their marginal likelihoods, or their quotients, called Bayes factors. However, marginal…

Methodology · Statistics 2022-07-27 F. Llorente , L. Martino , E. Curbelo , J. Lopez-Santiago , D. Delgado

Comparison of Bayesian predictive methods for model selection

The goal of this paper is to compare several widely used Bayesian model selection methods in practical model selection problems, highlight their differences and give recommendations about the preferred approaches. We focus on the variable…

Methodology · Statistics 2017-12-18 Juho Piironen , Aki Vehtari

Bayesian model choice and information criteria in sparse generalized linear models

We consider Bayesian model selection in generalized linear models that are high-dimensional, with the number of covariates p being large relative to the sample size n, but sparse in that the number of active covariates is small compared to…

Statistics Theory · Mathematics 2011-12-26 Rina Foygel , Mathias Drton

Posterior Impropriety of some Sparse Bayesian Learning Models

Sparse Bayesian learning models are typically used for prediction in datasets with significantly greater number of covariates than observations. Such models often take a reproducing kernel Hilbert space (RKHS) approach to carry out the task…

Statistics Theory · Mathematics 2021-06-22 Anand Dixit , Vivekananda Roy

Bayesian sample size determination using commensurate priors to leverage pre-experimental data

This paper develops Bayesian sample size formulae for experiments comparing two groups. We assume the experimental data will be analysed in the Bayesian framework, where pre-experimental information from multiple sources can be represented…

Methodology · Statistics 2022-03-09 Haiyan Zheng , Thomas Jaki , James M. S. Wason

Towards a statistical theory of data selection under weak supervision

Given a sample of size $N$, it is often useful to select a subsample of smaller size $n<N$ to be used for statistical estimation or learning. Such a data selection step is useful to reduce the requirements of data labeling and the…

Machine Learning · Statistics 2023-10-05 Germain Kolossov , Andrea Montanari , Pulkit Tandon

Bayesian inference through encompassing priors and importance sampling for a class of marginal models for categorical data

We develop a Bayesian approach for selecting the model which is the most supported by the data within a class of marginal models for categorical variables formulated through equality and/or inequality constraints on generalised logits…

Statistics Theory · Mathematics 2012-02-21 Francesco Bartolucci , Luisa Scaccia , Alessio Farcomeni

Bayesian Optimization for Selecting Efficient Machine Learning Models

The performance of many machine learning models depends on their hyper-parameter settings. Bayesian Optimization has become a successful tool for hyper-parameter optimization of machine learning algorithms, which aims to identify optimal…

Machine Learning · Computer Science 2020-08-04 Lidan Wang , Franck Dernoncourt , Trung Bui

Bayesian Negative Sampling for Recommendation

How to sample high quality negative instances from unlabeled data, i.e., negative sampling, is important for training implicit collaborative filtering and contrastive learning models. Although previous studies have proposed some approaches…

Information Retrieval · Computer Science 2022-07-12 Bin Liu , Bang Wang

Bayesian subset selection and variable importance for interpretable prediction and classification

Subset selection is a valuable tool for interpretable learning, scientific discovery, and data compression. However, classical subset selection is often avoided due to selection instability, lack of regularization, and difficulties with…

Machine Learning · Statistics 2022-02-17 Daniel R. Kowal

Bayesian Sampling Bias Correction: Training with the Right Loss Function

We derive a family of loss functions to train models in the presence of sampling bias. Examples are when the prevalence of a pathology differs from its sampling rate in the training dataset, or when a machine learning practioner rebalances…

Machine Learning · Computer Science 2020-06-25 L. Le Folgoc , V. Baltatzis , A. Alansary , S. Desai , A. Devaraj , S. Ellis , O. E. Martinez Manzanera , F. Kanavati , A. Nair , J. Schnabel , B. Glocker

Progressive Sampling-Based Bayesian Optimization for Efficient and Automatic Machine Learning Model Selection

Purpose: Machine learning is broadly used for clinical data analysis. Before training a model, a machine learning algorithm must be selected. Also, the values of one or more model parameters termed hyper-parameters must be set. Selecting…

Machine Learning · Computer Science 2018-12-10 Xueqiang Zeng , Gang Luo

Objective Priors: An Introduction for Frequentists

Bayesian methods are increasingly applied in these days in the theory and practice of statistics. Any Bayesian inference depends on a likelihood and a prior. Ideally one would like to elicit a prior from related sources of information or…

Methodology · Statistics 2011-08-11 Malay Ghosh