English
Related papers

Related papers: Divide-and-Conquer MCMC for Multivariate Binary Da…

200 papers

In large-scale genomic applications vast numbers of molecular features are scanned in order to find a small number of candidates which are linked to a particular disease or phenotype. This is a variable selection problem in the "large p,…

Computation · Statistics 2014-02-13 Manuela Zucknick , Sylvia Richardson

Varying coefficient models (VCMs) are widely used for estimating nonlinear regression functions for functional data. Their Bayesian variants using Gaussian process priors on the functional coefficients, however, have received limited…

Methodology · Statistics 2022-03-01 Rajarshi Guhaniyogi , Cheng Li , Terrance D. Savitsky , Sanvesh Srivastava

Monte Carlo algorithms, such as Markov chain Monte Carlo (MCMC) and Hamiltonian Monte Carlo (HMC), are routinely used for Bayesian inference in generalized linear models; however, these algorithms are prohibitively slow in massive data…

Computation · Statistics 2020-08-31 Nariankadu D. Shyamalkumar , Sanvesh Srivastava

Discrete data are abundant and often arise as counts or rounded data. These data commonly exhibit complex distributional features such as zero-inflation, over-/under-dispersion, boundedness, and heaping, which render many parametric models…

Methodology · Statistics 2023-02-27 Daniel R. Kowal , Bohan Wu

Functional mixed models are widely useful for regression analysis with dependent functional data, including longitudinal functional data with scalar predictors. However, existing algorithms for Bayesian inference with these models only…

Methodology · Statistics 2023-06-14 Thomas Y. Sun , Daniel R. Kowal

Divide-and-conquer MCMC is a strategy for parallelising Markov Chain Monte Carlo sampling by running independent samplers on disjoint subsets of a dataset and merging their output. An ongoing challenge in the literature is to efficiently…

Machine Learning · Statistics 2024-06-18 C. Trojan , P. Fearnhead , C. Nemeth

Bayesian computation crucially relies on Markov chain Monte Carlo (MCMC) algorithms. In the case of massive data sets, running the Metropolis-Hastings sampler to draw from the posterior distribution becomes prohibitive due to the large…

Computation · Statistics 2015-12-07 Roberto Casarin , Radu V. Craiu , Fabrizio Leisen

Large-scale population-based studies in medicine are a key resource towards better diagnosis, monitoring, and treatment of diseases. They also serve as enablers of clinical decision support systems, in particular Computer Aided Diagnosis…

Machine Learning · Computer Science 2022-03-01 Gerome Vivar , Anees Kazi , Hendrik Burwinkel , Andreas Zwergal , Nassir Navab , Seyed-Ahmad Ahmadi

Binary optimization has a wide range of applications in combinatorial optimization problems such as MaxCut, MIMO detection, and MaxSAT. However, these problems are typically NP-hard due to the binary constraints. We develop a novel…

Optimization and Control · Mathematics 2023-07-04 Cheng Chen , Ruitao Chen , Tianyou Li , Ruichen Ao , Zaiwen Wen

For Bayesian computation in big data contexts, the divide-and-conquer MCMC concept splits the whole data set into batches, runs MCMC algorithms separately over each batch to produce samples of parameters, and combines them to produce an…

Computation · Statistics 2019-11-25 Wu Changye , Christian P. Robert

There has been considerable interest in making Bayesian inference more scalable. In big data settings, most literature focuses on reducing the computing time per iteration, with less focused on reducing the number of iterations needed in…

Methodology · Statistics 2017-09-28 Leo L. Duan , James E. Johndrow , David B. Dunson

Ordinal categorical data are routinely encountered in many practical applications. When the primary goal is to construct a regression model for ordinal outcomes, cumulative link models represent one of the most popular choices to link the…

Methodology · Statistics 2026-03-13 Emanuele Aliverti

Advances in digital sensors, digital data storage and communications have resulted in systems being capable of accumulating large collections of data. In the light of dealing with the challenges that massive data present, this work proposes…

Computation · Statistics 2015-12-09 Allan De Freitas , François Septier , Lyudmila Mihaylova

Markov chain Monte Carlo (MCMC) algorithms have become powerful tools for Bayesian inference. However, they do not scale well to large-data problems. Divide-and-conquer strategies, which split the data into batches and, for each batch, run…

Computation · Statistics 2017-07-18 Christopher Nemeth , Chris Sherlock

Bayesian profile regression mixture models (BPRM) allow to assess a health risk in a multi-exposed population. These mixture models cluster individuals according to their exposure profile and their health risk. However, their results, based…

Methodology · Statistics 2025-12-30 Fendler Julie , Guihenneuc Chantal , Ancelet Sophie

The multinomial probit model is a popular tool for analyzing choice behaviour as it allows for correlation between choice alternatives. Because current model specifications employ a full covariance matrix of the latent utilities for the…

Econometrics · Economics 2021-03-25 Ruben Loaiza-Maya , Didier Nibbering

To conduct Bayesian inference with large data sets, it is often convenient or necessary to distribute the data across multiple machines. We consider a likelihood function expressed as a product of terms, each associated with a subset of the…

Computation · Statistics 2020-04-09 Lewis J. Rendell , Adam M. Johansen , Anthony Lee , Nick Whiteley

Many modern applications collect highly imbalanced categorical data, with some categories relatively rare. Bayesian hierarchical models combat data sparsity by borrowing information, while also quantifying uncertainty. However, posterior…

Statistics Theory · Mathematics 2017-06-27 James E. Johndrow , Aaron Smith , Natesh Pillai , David B. Dunson

Markov chain Monte Carlo (MCMC) algorithms are widely used to sample from complicated distributions, especially to sample from the posterior distribution in Bayesian inference. However, MCMC is not directly applicable when facing the doubly…

Computation · Statistics 2019-03-29 Guanyang Wang

Estimating model parameters of a general family of cure models is always a challenging task mainly due to flatness and multimodality of the likelihood function. In this work, we propose a fully Bayesian approach in order to overcome these…

Methodology · Statistics 2024-08-20 Panagiotis Papastamoulis , Fotios Milienos
‹ Prev 1 2 3 10 Next ›