Related papers: Bayesian data selection

High-dimensional Ising model selection with Bayesian information criteria

We consider the use of Bayesian information criteria for selection of the graph underlying an Ising model. In an Ising model, the full conditional distributions of each variable form logistic regression models, and variable selection…

Statistics Theory · Mathematics 2015-03-09 Rina Foygel Barber , Mathias Drton

Fair Bayesian Data Selection via Generalized Discrepancy Measures

Fairness concerns are increasingly critical as machine learning models are deployed in high-stakes applications. While existing fairness-aware methods typically intervene at the model level, they often suffer from high computational costs,…

Machine Learning · Computer Science 2025-11-11 Yixuan Zhang , Jiabin Luo , Zhenggang Wang , Feng Zhou , Quyu Kong

Consistent Bayesian Information Criterion Based on a Mixture Prior for Possibly High-Dimensional Multivariate Linear Regression Models

In the problem of selecting variables in a multivariate linear regression model, we derive new Bayesian information criteria based on a prior mixing a smooth distribution and a delta distribution. Each of them can be interpreted as a fusion…

Statistics Theory · Mathematics 2022-09-29 Haruki Kono , Tatsuya Kubokawa

Inferring the shape of data: A probabilistic framework for analyzing experiments in the natural sciences

A critical step in data analysis for many different types of experiments is the identification of features with theoretically defined shapes in N-dimensional datasets; examples of this process include finding peaks in multi-dimensional…

Data Analysis, Statistics and Probability · Physics 2022-08-25 Korak Kumar Ray , Anjali R. Verma , Ruben L. Gonzalez , Colin D. Kinz-Thompson

Variational Nonparametric Discriminant Analysis

Variable selection and classification are common objectives in the analysis of high-dimensional data. Most such methods make distributional assumptions that may not be compatible with the diverse families of distributions data can take. A…

Methodology · Statistics 2019-08-28 Weichang Yu , Lamiae Azizi , John T. Ormerod

On the geometry of Stein variational gradient descent

Bayesian inference problems require sampling or approximating high-dimensional probability distributions. The focus of this paper is on the recently introduced Stein variational gradient descent methodology, a class of algorithms that rely…

Machine Learning · Statistics 2023-02-14 A. Duncan , N. Nuesken , L. Szpruch

Bayesian estimate of the degree of a polynomial given a noisy data sample

A widely used method to create a continuous representation of a discrete data-set is regression analysis. When the regression model is not based on a mathematical description of the physics underlying the data, heuristic techniques play a…

Statistics Theory · Mathematics 2013-07-18 Giovanni Mana , Paolo Alberto Giuliano Albo , Simona Lago

Hierarchical Bayesian data selection

There are many issues that can cause problems when attempting to infer model parameters from data. Data and models are both imperfect, and as such there are multiple scenarios in which standard methods of inference will lead to misleading…

Computation · Statistics 2024-05-01 Simon L. Cotter

Model Selection in High-Dimensional Block-Sparse Linear Regression

Model selection is an indispensable part of data analysis dealing very frequently with fitting and prediction purposes. In this paper, we tackle the problem of model selection in a general linear regression where the parameter matrix…

Signal Processing · Electrical Eng. & Systems 2022-09-19 Prakash B. Gohain , Magnus Jansson

Bayesian Variable Selection in a Million Dimensions

Bayesian variable selection is a powerful tool for data analysis, as it offers a principled method for variable selection that accounts for prior information and uncertainty. However, wider adoption of Bayesian variable selection has been…

Methodology · Statistics 2023-12-06 Martin Jankowiak

Bayesian Nonparametric Variable Selection as an Exploratory Tool for Finding Genes that Matter

High-throughput scientific studies involving no clear a'priori hypothesis are common. For example, a large-scale genomic study of a disease may examine thousands of genes without hypothesizing that any specific gene is responsible for the…

Methodology · Statistics 2012-03-02 Babak Shahbaba

Bayesian Model Selection of Stochastic Block Models

A central problem in analyzing networks is partitioning them into modules or communities. One of the best tools for this is the stochastic block model, which clusters vertices into blocks with statistically homogeneous pattern of links.…

Machine Learning · Statistics 2016-05-24 Xiaoran Yan

Gradient-based data and parameter dimension reduction for Bayesian models: an information theoretic perspective

We consider the problem of reducing the dimensions of parameters and data in non-Gaussian Bayesian inference problems. Our goal is to identify an "informed" subspace of the parameters and an "informative" subspace of the data so that a…

Computation · Statistics 2022-07-19 Ricardo Baptista , Youssef Marzouk , Olivier Zahm

Bandwidth selection for kernel estimation in mixed multi-dimensional spaces

Kernel estimation techniques, such as mean shift, suffer from one major drawback: the kernel bandwidth selection. The bandwidth can be fixed for all the data set or can vary at each points. Automatic bandwidth selection becomes a real…

Computer Vision and Pattern Recognition · Computer Science 2011-11-10 Aurelie Bugeau , Patrick Pérez

Applications of Bayesian model selection to cosmological parameters

Bayesian model selection is a tool to decide whether the introduction of a new parameter is warranted by data. I argue that the usual sampling statistic significance tests for a null hypothesis can be misleading, since they do not take into…

Astrophysics · Physics 2008-11-26 Roberto Trotta

Cosmological model selection

Model selection aims to determine which theoretical models are most plausible given some data, without necessarily asking about the preferred values of the model parameters. A common model selection question is to ask when new data require…

Astrophysics · Physics 2008-11-26 Andrew R. Liddle , Pia Mukherjee , David Parkinson

High-dimensional posterior consistency for hierarchical non-local priors in regression

The choice of tuning parameters in Bayesian variable selection is a critical problem in modern statistics. In particular, for Bayesian linear regression with non-local priors, the scale parameter in the non-local prior density is an…

Statistics Theory · Mathematics 2019-02-25 Xuan Cao , Kshitij Khare , Malay Ghosh

Bayesian Model Selection on Random Networks

A general Bayesian framework for model selection on random network models regarding their features is considered. The goal is to develop a principle Bayesian model selection approach to compare different fittable, not necessarily nested,…

Methodology · Statistics 2020-04-30 Papamichalis Marios

Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples

For many important problems the quantity of interest is an unknown function of the parameters, which is a random vector with known statistics. Since the dependence of the output on this random vector is unknown, the challenge is to identify…

Machine Learning · Statistics 2021-04-28 Themistoklis P. Sapsis

Model selection in the average of inconsistent data: an analysis of the measured Planck-constant values

When the data do not conform to the hypothesis of a known sampling-variance, the fitting of a constant to a set of measured values is a long debated problem. Given the data, fitting would require to find what measurand value is the most…

Data Analysis, Statistics and Probability · Physics 2020-07-21 Giovanni Mana , Enrico Massa , Maria Predescu