English
Related papers

Related papers: Massive parallelization boosts big Bayesian multid…

200 papers

Bayesian multidimensional scaling (BMDS) is a probabilistic dimension reduction tool that allows one to model and visualize data consisting of dissimilarities between pairs of objects. Although BMDS has proven useful within, e.g., Bayesian…

Methodology · Statistics 2025-05-23 Ami Sheth , Aaron Smith , Andrew J. Holbrook

Multidimensional scaling (MDS) is widely used to reconstruct a low-dimensional representation of high-dimensional data while preserving pairwise distances. However, Bayesian MDS approaches based on Markov chain Monte Carlo (MCMC) face…

Methodology · Statistics 2026-02-26 Jiarui Zhang , Jiguo Cao , Liangliang Wang

Machine learning models, and deep neural networks in particular, are increasingly deployed in risk-sensitive domains such as healthcare, environmental forecasting, and finance, where reliable quantification of predictive uncertainty is…

Machine Learning · Computer Science 2026-04-07 Asena Karolin Özdemir , Lars H. Heyen , Arvid Weyrauch , Achim Streit , Markus Götz , Charlotte Debus

Markov chain Monte Carlo (MCMC) is the predominant tool used in Bayesian parameter estimation for hierarchical models. When the model expands due to an increasing number of hierarchical levels, number of groups at a particular level, or…

Computation · Statistics 2016-06-22 Will Landau , Jarad Niemi

In the era of Big Data, scalable and accurate clustering algorithms for high-dimensional data are essential. We present new Bayesian Distance Clustering (BDC) models and inference algorithms with improved scalability while maintaining the…

Methodology · Statistics 2024-09-02 Rafael Cabral , Maria de Iorio , Andrew Harris

We introduce a class of scalable Bayesian hierarchical models for the analysis of massive geostatistical datasets. The underlying idea combines ideas on high-dimensional geostatistics by partitioning the spatial domain and modeling the…

Methodology · Statistics 2020-10-09 Michele Peruzzi , Sudipto Banerjee , Andrew O. Finley

Massive datasets in the gigabyte and terabyte range combined with the availability of increasingly sophisticated statistical tools yield analyses at the boundary of what is computationally feasible. Compromising in the face of this…

Applications · Statistics 2011-01-06 Jennifer A. Tom , Janet S. Sinsheimer , Marc A. Suchard

Nonstationary non-Gaussian spatial data are common in many disciplines, including climate science, ecology, epidemiology, and social sciences. Examples include count data on disease incidence and binary satellite data on cloud mask…

Computation · Statistics 2020-11-30 Benjamin Seiyon Lee , Jaewoo Park

We introduce a novel and scalable Bayesian framework for multivariate-density-density regression (DDR), designed to model relationships between multivariate distributions. Our approach addresses the critical issue of distributions residing…

Methodology · Statistics 2025-09-24 Khai Nguyen , Yang Ni , Peter Mueller

Following a series of high-profile drug safety disasters in recent years, many countries are redoubling their efforts to ensure the safety of licensed medical products. Large-scale observational databases such as claims databases or…

Computation · Statistics 2012-08-07 Marc A. Suchard , Shawn E. Simpson , Ivan Zorych , Patrick Ryan , David Madigan

We present a set of algorithms implementing multidimensional scaling (MDS) for large data sets. MDS is a family of dimensionality reduction techniques using a $n \times n$ distance matrix as input, where $n$ is the number of individuals,…

Computation · Statistics 2024-02-02 Pedro Delicado , Cristian Pachón-García

Although Bayesian density estimation using discrete mixtures has good performance in modest dimensions, there is a lack of statistical and computational scalability to high-dimensional multivariate cases. To combat the curse of…

Methodology · Statistics 2014-10-29 Ye Wang , Antonio Canale , David Dunson

Multidimensional scaling (MDS) is a dimensionality reduction tool used for information analysis, data visualization and manifold learning. Most MDS procedures embed data points in low-dimensional Euclidean (flat) domains, such that…

Computational Geometry · Computer Science 2018-10-23 Gil Shamai , Michael Zibulevsky , Ron Kimmel

Bayesian optimization is an effective methodology for the global optimization of functions with expensive evaluations. It relies on querying a distribution over functions defined by a relatively cheap surrogate model. An accurate model for…

Viral mutations pose significant threats to public health by increasing infectivity, strengthening vaccine resistance, and altering disease severity. To track these evolving patterns, agencies like the CDC annually evaluate thousands of…

Populations and Evolution · Quantitative Biology 2025-04-07 Emilee Walden , Jiahui Chen , Guo-Wei Wei

Bayesian models are a powerful tool for studying complex data, allowing the analyst to encode rich hierarchical dependencies and leverage prior information. Most importantly, they facilitate a complete characterization of uncertainty…

Machine Learning · Statistics 2023-04-25 Steven Winter , Trevor Campbell , Lizhen Lin , Sanvesh Srivastava , David B. Dunson

Multidimensional Scaling (MDS) is a classic technique that seeks vectorial representations for data points, given the pairwise distances between them. However, in recent years, data are usually collected from diverse sources or have…

Computer Vision and Pattern Recognition · Computer Science 2017-08-29 Song Bai , Xiang Bai , Longin Jan Latecki , Qi Tian

With continued advances in Geographic Information Systems and related computational technologies, statisticians are often required to analyze very large spatial datasets. This has generated substantial interest over the last decade, already…

Methodology · Statistics 2019-05-14 Lu Zhang , Abhirup Datta , Sudipto Banerjee

Large-scale observational health databases are increasingly popular for conducting comparative effectiveness and safety studies of medical products. However, increasing number of patients poses computational challenges when fitting survival…

Computation · Statistics 2023-10-26 Jianxiao Yang , Martijn J. Schuemie , Xiang Ji , Marc A. Suchard

Multivariate Gaussian processes (GPs) offer a powerful probabilistic framework to represent complex interdependent phenomena. They pose, however, significant computational challenges in high-dimensional settings, which frequently arise in…

‹ Prev 1 2 3 10 Next ›