English
Related papers

Related papers: Parallelizing MCMC with Random Partition Trees

200 papers

For Bayesian computation in big data contexts, the divide-and-conquer MCMC concept splits the whole data set into batches, runs MCMC algorithms separately over each batch to produce samples of parameters, and combines them to produce an…

Computation · Statistics 2019-11-25 Wu Changye , Christian P. Robert

Markov Chain Monte Carlo (MCMC) is a well-established family of algorithms primarily used in Bayesian statistics to sample from a target distribution when direct sampling is challenging. Existing work on Bayesian decision trees uses MCMC.…

Computation · Statistics 2023-01-24 Efthyvoulos Drousiotis , Paul G. Spirakis , Simon Maskell

While MCMC methods have become a main work-horse for Bayesian inference, scaling them to large distributed datasets is still a challenge. Embarrassingly parallel MCMC strategies take a divide-and-conquer stance to achieve this by writing…

Machine Learning · Computer Science 2021-06-16 Diego Mesquita , Paul Blomstedt , Samuel Kaski

In big data context, traditional MCMC methods, such as Metropolis-Hastings algorithms and hybrid Monte Carlo, scale poorly because of their need to evaluate the likelihood over the whole data set at each iteration. In order to resurrect…

Computation · Statistics 2017-06-20 Changye Wu , Christian P. Robert

In this paper, we propose a MCMC algorithm based on elliptical slice sampling with the purpose to improve sampling efficiency. During sampling, a mixture distribution is fitted periodically to previous samples. The components of the mixture…

Computation · Statistics 2019-03-14 Song Li , Geoffrey K. F. Tso

Due to the escalating growth of big data sets in recent years, new Bayesian Markov chain Monte Carlo (MCMC) parallel computing methods have been developed. These methods partition large data sets by observations into subsets. However, for…

Methodology · Statistics 2019-01-21 Zheng Wei , Erin M. Conlon

Performing Bayesian inference via Markov chain Monte Carlo (MCMC) can be exceedingly expensive when posterior evaluations invoke the evaluation of a computationally expensive model, such as a system of partial differential equations. In…

Computation · Statistics 2017-12-27 Patrick Conrad , Andrew Davis , Youssef Marzouk , Natesh Pillai , Aaron Smith

Communication costs, resulting from synchronization requirements during learning, can greatly slow down many parallel machine learning algorithms. In this paper, we present a parallel Markov chain Monte Carlo (MCMC) algorithm in which…

Machine Learning · Statistics 2014-03-24 Willie Neiswanger , Chong Wang , Eric Xing

Markov chain Monte Carlo (MCMC) is the predominant tool used in Bayesian parameter estimation for hierarchical models. When the model expands due to an increasing number of hierarchical levels, number of groups at a particular level, or…

Computation · Statistics 2016-06-22 Will Landau , Jarad Niemi

Probabilistic models are conceptually powerful tools for finding structure in data, but their practical effectiveness is often limited by our ability to perform inference in them. Exact inference is frequently intractable, so approximate…

Computation · Statistics 2014-07-25 Robert Nishihara , Iain Murray , Ryan P. Adams

Markov chain Monte Carlo (MCMC) algorithms have become powerful tools for Bayesian inference. However, they do not scale well to large-data problems. Divide-and-conquer strategies, which split the data into batches and, for each batch, run…

Computation · Statistics 2017-07-18 Christopher Nemeth , Chris Sherlock

In the era of Big Data, Markov chain Monte Carlo (MCMC) methods, which are currently essential for Bayesian estimation, face significant computational challenges owing to their sequential nature. To achieve a faster and more effective…

Computation · Statistics 2024-11-08 Tomoki Matsumoto

With the rapidly growing scales of statistical problems, subset based communication-free parallel MCMC methods are a promising future for large scale Bayesian analysis. In this article, we propose a new Weierstrass sampler for parallel MCMC…

Computation · Statistics 2014-05-27 Xiangyu Wang , David B. Dunson

We develop an Evolutionary Markov Chain Monte Carlo (EMCMC) algorithm for sampling spatial partitions that lie within a large and complex spatial state space. Our algorithm combines the advantages of evolutionary algorithms (EAs) as…

Computation · Statistics 2021-01-19 Wendy K. Tam Cho , Yan Y. Liu

Bayesian computation crucially relies on Markov chain Monte Carlo (MCMC) algorithms. In the case of massive data sets, running the Metropolis-Hastings sampler to draw from the posterior distribution becomes prohibitive due to the large…

Computation · Statistics 2015-12-07 Roberto Casarin , Radu V. Craiu , Fabrizio Leisen

Decision tree learning is a popular approach for classification and regression in machine learning and statistics, and Bayesian formulations---which introduce a prior distribution over decision trees, and formulate learning as posterior…

Machine Learning · Statistics 2013-08-26 Balaji Lakshminarayanan , Daniel M. Roy , Yee Whye Teh

Standard MCMC methods can scale poorly to big data settings due to the need to evaluate the likelihood at each iteration. There have been a number of approximate MCMC algorithms that use sub-sampling ideas to reduce this computational…

Computation · Statistics 2020-09-29 Joris Bierkens , Paul Fearnhead , Gareth Roberts

Bayesian inference with Markov Chain Monte Carlo (MCMC) is challenging when the likelihood function is irregular and expensive to compute. We explore several sampling algorithms that make use of subset evaluations to reduce computational…

Machine Learning · Statistics 2025-05-16 Conor Rosato , Harvinder Lehal , Simon Maskell , Lee Devlin , Malcolm Strens

Parallel Markov Chain Monte Carlo (pMCMC) algorithms generate clouds of proposals at each step to efficiently resolve a target probability distribution. We build a rigorous foundational framework for pMCMC algorithms that situates these…

This paper proposes a novel approach to generate samples from target distributions that are difficult to sample from using Markov Chain Monte Carlo (MCMC) methods. Traditional MCMC algorithms often face slow convergence due to the…

Cosmology and Nongalactic Astrophysics · Physics 2023-08-11 Sandro Dias Pinto Vitenti , Eduardo J. Barroso
‹ Prev 1 2 3 10 Next ›