English
Related papers

Related papers: Distributed Computation for Marginal Likelihood ba…

200 papers

Effective and accurate model selection is an important problem in modern data analysis. One of the major challenges is the computational burden required to handle large data sets that cannot be stored or processed on one machine. Another…

Machine Learning · Statistics 2018-06-26 Michael Minyi Zhang , Henry Lam , Lizhen Lin

Bayesian model selection provides a powerful framework for objectively comparing models directly from observed data, without reference to ground truth data. However, Bayesian model selection requires the computation of the marginal…

Methodology · Statistics 2024-01-17 Xiaohao Cai , Jason D. McEwen , Marcelo Pereyra

Divide-and-conquer Bayesian methods consist of three steps: dividing the data into smaller computationally manageable subsets, running a sampling algorithm in parallel on all the subsets, and combining parameter draws from all the subsets.…

Methodology · Statistics 2021-06-01 Chunlei Wang , Sanvesh Srivastava

Inference of the marginal probability distribution is defined as the calculation of the probability of a subset of the variables and is relevant for handling missing data and hidden variables. While inference of the marginal probability…

Machine Learning · Statistics 2022-07-22 Fritz M. Bayer , Giusi Moffa , Niko Beerenwinkel , Jack Kuipers

Many machine learning applications require operating on a spatially distributed dataset. Despite technological advances, privacy considerations and communication constraints may prevent gathering the entire dataset in a central unit. In…

Machine Learning · Statistics 2024-01-30 Alexandros E. Tzikas , Licio Romao , Mert Pilanci , Alessandro Abate , Mykel J. Kochenderfer

Estimating copulas with discrete marginal distributions is challenging, especially in high dimensions, because computing the likelihood contribution of each observation requires evaluating $2^{J}$ terms, with $J$ the number of discrete…

Methodology · Statistics 2018-11-12 D. Gunawan , M. -N. Tran , K. Suzuki , J. Dick , R. Kohn

We propose a distributed computing framework, based on a divide and conquer strategy and hierarchical modeling, to accelerate posterior inference for high-dimensional Bayesian factor models. Our approach distributes the task of…

Methodology · Statistics 2016-12-30 Gautam Sabnis , Debdeep Pati , Barbara Engelhardt , Natesh Pillai

We propose a novel Bayesian model selection technique on linear mixed-effects models to compare multiple treatments with a control. A fully Bayesian approach is implemented to estimate the marginal inclusion probabilities that provide a…

Applications · Statistics 2015-09-28 Lei Gong , James M. Flegal , Stephen R. Spindler , Patricia L. Mote

For Bayesian computation in big data contexts, the divide-and-conquer MCMC concept splits the whole data set into batches, runs MCMC algorithms separately over each batch to produce samples of parameters, and combines them to produce an…

Computation · Statistics 2019-11-25 Wu Changye , Christian P. Robert

It is common practice to use Laplace approximations to compute marginal likelihoods in Bayesian versions of generalised linear models (GLM). Marginal likelihoods combined with model priors are then used in different search algorithms to…

Methodology · Statistics 2022-02-01 Jon Lachmann , Geir Storvik , Florian Frommlet , Aliaksadr Hubin

We develop sampling algorithms to fit Bayesian hierarchical models, the computational complexity of which scales linearly with the number of observations and the number of parameters in the model. We focus on crossed random effect and…

Computation · Statistics 2025-01-03 Omiros Papaspiliopoulos , Timothée Stumpf-Fétizon , Giacomo Zanella

We propose and analyze a generalized splitting method to sample approximately from a distribution conditional on the occurrence of a rare event. This has important applications in a variety of contexts in operations research, engineering,…

Methodology · Statistics 2019-09-10 Zdravko I. Botev , Pierre L'Ecuyer

This paper describes a Bayesian method for learning causal networks using samples that were selected in a non-random manner from a population of interest. Examples of data obtained by non-random sampling include convenience samples and…

Artificial Intelligence · Computer Science 2013-01-18 Gregory F. Cooper

We present a Bayesian data fusion method to approximate a posterior distribution from an ensemble of particle estimates that only have access to subsets of the data. Our approach relies on approximate probabilistic inference of model…

Computation · Statistics 2020-10-28 Caleb Miller , Michael D. Schneider , Jem N. Corcoran , Jason Bernstein

Monte Carlo algorithms, such as Markov chain Monte Carlo (MCMC) and Hamiltonian Monte Carlo (HMC), are routinely used for Bayesian inference in generalized linear models; however, these algorithms are prohibitively slow in massive data…

Computation · Statistics 2020-08-31 Nariankadu D. Shyamalkumar , Sanvesh Srivastava

Bayesian nonparametric (BNP) models provide elegant methods for discovering underlying latent features within a data set, but inference in such models can be slow. We exploit the fact that completely random measures, which commonly used…

Machine Learning · Statistics 2020-07-17 Avinava Dubey , Michael Minyi Zhang , Eric P. Xing , Sinead A. Williamson

We develop a Bayesian approach for selecting the model which is the most supported by the data within a class of marginal models for categorical variables formulated through equality and/or inequality constraints on generalised logits…

Statistics Theory · Mathematics 2012-02-21 Francesco Bartolucci , Luisa Scaccia , Alessio Farcomeni

In many modern applications, there is interest in analyzing enormous data sets that cannot be easily moved across computers or loaded into memory on a single computer. In such settings, it is very common to be interested in clustering.…

Computation · Statistics 2020-05-15 Hanyu Song , Yingjian Wang , David B. Dunson

Many inference problems involve inferring the number $N$ of components in some region, along with their properties $\{\mathbf{x}_i\}_{i=1}^N$, from a dataset $\mathcal{D}$. A common statistical example is finite mixture modelling. In the…

Computation · Statistics 2015-01-15 Brendon J. Brewer

We propose a novel Bayesian inference framework for distributed differentially private linear regression. We consider a distributed setting where multiple parties hold parts of the data and share certain summary statistics of their portions…

Machine Learning · Statistics 2023-06-08 Barış Alparslan , Sinan Yıldırım , Ş. İlker Birbil
‹ Prev 1 2 3 10 Next ›