Related papers: Distributed Computation for Marginal Likelihood ba…

Robust and Parallel Bayesian Model Selection

Effective and accurate model selection is an important problem in modern data analysis. One of the major challenges is the computational burden required to handle large data sets that cannot be stored or processed on one machine. Another…

Machine Learning · Statistics 2018-06-26 Michael Minyi Zhang , Henry Lam , Lizhen Lin

Proximal nested sampling for high-dimensional Bayesian model selection

Bayesian model selection provides a powerful framework for objectively comparing models directly from observed data, without reference to ground truth data. However, Bayesian model selection requires the computation of the marginal…

Methodology · Statistics 2024-01-17 Xiaohao Cai , Jason D. McEwen , Marcelo Pereyra

Divide-and-Conquer Bayesian Inference in Hidden Markov Models

Divide-and-conquer Bayesian methods consist of three steps: dividing the data into smaller computationally manageable subsets, running a sampling algorithm in parallel on all the subsets, and combining parameter draws from all the subsets.…

Methodology · Statistics 2021-06-01 Chunlei Wang , Sanvesh Srivastava

High-Dimensional Inference in Bayesian Networks

Inference of the marginal probability distribution is defined as the calculation of the probability of a subset of the variables and is relevant for handling missing data and hidden variables. While inference of the marginal probability…

Machine Learning · Statistics 2022-07-22 Fritz M. Bayer , Giusi Moffa , Niko Beerenwinkel , Jack Kuipers

Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers

Many machine learning applications require operating on a spatially distributed dataset. Despite technological advances, privacy considerations and communication constraints may prevent gathering the entire dataset in a central unit. In…

Machine Learning · Statistics 2024-01-30 Alexandros E. Tzikas , Licio Romao , Mert Pilanci , Alessandro Abate , Mykel J. Kochenderfer

Computationally Efficient Bayesian Estimation of High Dimensional Copulas with Discrete and Mixed Margins

Estimating copulas with discrete marginal distributions is challenging, especially in high dimensions, because computing the likelihood contribution of each observation requires evaluating $2^{J}$ terms, with $J$ the number of discrete…

Methodology · Statistics 2018-11-12 D. Gunawan , M. -N. Tran , K. Suzuki , J. Dick , R. Kohn

A Divide and Conquer Strategy for High Dimensional Bayesian Factor Models

We propose a distributed computing framework, based on a divide and conquer strategy and hierarchical modeling, to accelerate posterior inference for high-dimensional Bayesian factor models. Our approach distributes the task of…

Methodology · Statistics 2016-12-30 Gautam Sabnis , Debdeep Pati , Barbara Engelhardt , Natesh Pillai

Bayesian model selection on linear mixed-effects models for comparisons between multiple treatments and a control

We propose a novel Bayesian model selection technique on linear mixed-effects models to compare multiple treatments with a control. A fully Bayesian approach is implemented to estimate the marginal inclusion probabilities that provide a…

Applications · Statistics 2015-09-28 Lei Gong , James M. Flegal , Stephen R. Spindler , Patricia L. Mote

Parallelising MCMC via Random Forests

For Bayesian computation in big data contexts, the divide-and-conquer MCMC concept splits the whole data set into batches, runs MCMC algorithms separately over each batch to produce samples of parameters, and combines them to produce an…

Computation · Statistics 2019-11-25 Wu Changye , Christian P. Robert

A subsampling approach for Bayesian model selection

It is common practice to use Laplace approximations to compute marginal likelihoods in Bayesian versions of generalised linear models (GLM). Marginal likelihoods combined with model priors are then used in different search algorithms to…

Methodology · Statistics 2022-02-01 Jon Lachmann , Geir Storvik , Florian Frommlet , Aliaksadr Hubin

Scalable Bayesian computation for crossed and nested hierarchical models

We develop sampling algorithms to fit Bayesian hierarchical models, the computational complexity of which scales linearly with the number of observations and the number of parameters in the model. We focus on crossed random effect and…

Computation · Statistics 2025-01-03 Omiros Papaspiliopoulos , Timothée Stumpf-Fétizon , Giacomo Zanella

Sampling Conditionally on a Rare Event via Generalized Splitting

We propose and analyze a generalized splitting method to sample approximately from a distribution conditional on the occurrence of a rare event. This has important applications in a variety of contexts in operations research, engineering,…

Methodology · Statistics 2019-09-10 Zdravko I. Botev , Pierre L'Ecuyer

A Bayesian Method for Causal Modeling and Discovery Under Selection

This paper describes a Bayesian method for learning causal networks using samples that were selected in a non-random manner from a population of interest. Examples of data obtained by non-random sampling include convenience samples and…

Artificial Intelligence · Computer Science 2013-01-18 Gregory F. Cooper

Bayesian Fusion of Data Partitioned Particle Estimates

We present a Bayesian data fusion method to approximate a posterior distribution from an ensemble of particle estimates that only have access to subsets of the data. Our approach relies on approximate probabilistic inference of model…

Computation · Statistics 2020-10-28 Caleb Miller , Michael D. Schneider , Jem N. Corcoran , Jason Bernstein

An Algorithm for Distributed Bayesian Inference in Generalized Linear Models

Monte Carlo algorithms, such as Markov chain Monte Carlo (MCMC) and Hamiltonian Monte Carlo (HMC), are routinely used for Bayesian inference in generalized linear models; however, these algorithms are prohibitively slow in massive data…

Computation · Statistics 2020-08-31 Nariankadu D. Shyamalkumar , Sanvesh Srivastava

Distributed, partially collapsed MCMC for Bayesian Nonparametrics

Bayesian nonparametric (BNP) models provide elegant methods for discovering underlying latent features within a data set, but inference in such models can be slow. We exploit the fact that completely random measures, which commonly used…

Machine Learning · Statistics 2020-07-17 Avinava Dubey , Michael Minyi Zhang , Eric P. Xing , Sinead A. Williamson

Bayesian inference through encompassing priors and importance sampling for a class of marginal models for categorical data

We develop a Bayesian approach for selecting the model which is the most supported by the data within a class of marginal models for categorical variables formulated through equality and/or inequality constraints on generalised logits…

Statistics Theory · Mathematics 2012-02-21 Francesco Bartolucci , Luisa Scaccia , Alessio Farcomeni

Distributed Bayesian clustering using finite mixture of mixtures

In many modern applications, there is interest in analyzing enormous data sets that cannot be easily moved across computers or loaded into memory on a single computer. In such settings, it is very common to be interested in clustering.…

Computation · Statistics 2020-05-15 Hanyu Song , Yingjian Wang , David B. Dunson

Inference for Trans-dimensional Bayesian Models with Diffusive Nested Sampling

Many inference problems involve inferring the number $N$ of components in some region, along with their properties $\{\mathbf{x}_i\}_{i=1}^N$, from a dataset $\mathcal{D}$. A common statistical example is finite mixture modelling. In the…

Computation · Statistics 2015-01-15 Brendon J. Brewer

Differentially Private Distributed Bayesian Linear Regression with MCMC

We propose a novel Bayesian inference framework for distributed differentially private linear regression. We consider a distributed setting where multiple parties hold parts of the data and share certain summary statistics of their portions…

Machine Learning · Statistics 2023-06-08 Barış Alparslan , Sinan Yıldırım , Ş. İlker Birbil