Related papers: Bayesian inference via sparse Hamiltonian flows

Bayesian Coresets: Revisiting the Nonconvex Optimization Perspective

Bayesian coresets have emerged as a promising approach for implementing scalable Bayesian inference. The Bayesian coreset problem involves selecting a (weighted) subset of the data samples, such that the posterior inference using the…

Machine Learning · Statistics 2021-03-01 Jacky Y. Zhang , Rajiv Khanna , Anastasios Kyrillidis , Oluwasanmi Koyejo

Sparse Variational Inference: Bayesian Coresets from Scratch

The proliferation of automated inference algorithms in Bayesian statistics has provided practitioners newfound access to fast, reproducible data analysis and powerful statistical models. Designing automated methods that are also both…

Machine Learning · Statistics 2019-10-29 Trevor Campbell , Boyan Beronov

Fast Bayesian Coresets via Subsampling and Quasi-Newton Refinement

Bayesian coresets approximate a posterior distribution by building a small weighted subset of the data points. Any inference procedure that is too computationally expensive to be run on the full posterior can instead be run inexpensively on…

Machine Learning · Statistics 2023-01-18 Cian Naik , Judith Rousseau , Trevor Campbell

Coresets for Scalable Bayesian Logistic Regression

The use of Bayesian methods in large-scale data settings is attractive because of the rich hierarchical models, uncertainty quantification, and prior specification they provide. Standard Bayesian inference algorithms are computationally…

Computation · Statistics 2017-02-07 Jonathan H. Huggins , Trevor Campbell , Tamara Broderick

On Divergence Measures for Bayesian Pseudocoresets

A Bayesian pseudocoreset is a small synthetic dataset for which the posterior over parameters approximates that of the original dataset. While promising, the scalability of Bayesian pseudocoresets is not yet validated in realistic problems…

Machine Learning · Computer Science 2022-10-13 Balhae Kim , Jungwon Choi , Seanie Lee , Yoonho Lee , Jung-Woo Ha , Juho Lee

Wasserstein Measure Coresets

The proliferation of large data sets and Bayesian inference techniques motivates demand for better data sparsification. Coresets provide a principled way of summarizing a large dataset via a smaller one that is guaranteed to match the…

Machine Learning · Statistics 2020-03-04 Sebastian Claici , Aude Genevay , Justin Solomon

Function Space Bayesian Pseudocoreset for Bayesian Neural Networks

A Bayesian pseudocoreset is a compact synthetic dataset summarizing essential information of a large-scale dataset and thus can be used as a proxy dataset for scalable Bayesian inference. Typically, a Bayesian pseudocoreset is constructed…

Machine Learning · Computer Science 2023-10-30 Balhae Kim , Hyungi Lee , Juho Lee

Black-box Coreset Variational Inference

Recent advances in coreset methods have shown that a selection of representative datapoints can replace massive volumes of data for Bayesian inference, preserving the relevant statistical information and significantly accelerating…

Machine Learning · Statistics 2023-01-18 Dionysis Manousakas , Hippolyt Ritter , Theofanis Karaletsos

Bayesian Pseudo-Coresets via Contrastive Divergence

Bayesian methods provide an elegant framework for estimating parameter posteriors and quantification of uncertainty associated with probabilistic models. However, they often suffer from slow inference times. To address this challenge,…

Machine Learning · Computer Science 2024-05-10 Piyush Tiwary , Kumar Shubham , Vivek V. Kashyap , Prathosh A. P

Automated Scalable Bayesian Inference via Hilbert Coresets

The automation of posterior inference in Bayesian data analysis has enabled experts and nonexperts alike to use more sophisticated models, engage in faster exploratory modeling and analysis, and ensure experimental reproducibility. However,…

Machine Learning · Statistics 2019-03-01 Trevor Campbell , Tamara Broderick

Bayesian Inverse Problems Meet Flow Matching: Efficient and Flexible Inference via Transformers

The efficient resolution of Bayesian inverse problems remains challenging due to the high computational cost of traditional sampling methods. In this paper, we propose a novel framework that integrates Conditional Flow Matching (CFM) with a…

Machine Learning · Computer Science 2025-05-20 Daniil Sherki , Ivan Oseledets , Ekaterina Muravleva

BayesFlow: Amortized Bayesian Workflows With Neural Networks

Modern Bayesian inference involves a mixture of computational techniques for estimating, validating, and drawing conclusions from probabilistic models as part of principled workflows for data analysis. Typical problems in Bayesian workflows…

Machine Learning · Computer Science 2023-07-12 Stefan T Radev , Marvin Schmitt , Lukas Schumacher , Lasse Elsemüller , Valentin Pratz , Yannik Schälte , Ullrich Köthe , Paul-Christian Bürkner

Bayesian sparsity and class sparsity priors for dictionary learning and coding

Dictionary learning methods continue to gain popularity for the solution of challenging inverse problems. In the dictionary learning approach, the computational forward model is replaced by a large dictionary of possible outcomes, and the…

Machine Learning · Statistics 2023-09-06 Alberto Bocchinfuso , Daniela Calvetti , Erkki Somersalo

Fair Wasserstein Coresets

Data distillation and coresets have emerged as popular approaches to generate a smaller representative set of samples for downstream learning tasks to handle large-scale datasets. At the same time, machine learning is being increasingly…

Machine Learning · Statistics 2024-10-31 Zikai Xiong , Niccolò Dalmasso , Shubham Sharma , Freddy Lecue , Daniele Magazzeni , Vamsi K. Potluru , Tucker Balch , Manuela Veloso

A Novel Sequential Coreset Method for Gradient Descent Algorithms

A wide range of optimization problems arising in machine learning can be solved by gradient descent algorithms, and a central question in this area is how to efficiently compress a large-scale dataset so as to reduce the computational…

Machine Learning · Computer Science 2022-10-11 Jiawei Huang , Ruomin Huang , Wenjie Liu , Nikolaos M. Freris , Hu Ding

Learning with Sparsely Permuted Data: A Robust Bayesian Approach

Data dispersed across multiple files are commonly integrated through probabilistic linkage methods, where even minimal error rates in record matching can significantly contaminate subsequent statistical analyses. In regression problems, we…

Statistics Theory · Mathematics 2024-09-18 Abhisek Chakraborty , Saptati Datta

Variational Bayesian Pseudo-Coreset

The success of deep learning requires large datasets and extensive training, which can create significant computational challenges. To address these challenges, pseudo-coresets, small learnable datasets that mimic the entire data, have been…

Machine Learning · Computer Science 2025-03-03 Hyungi Lee , Seungyoo Lee , Juho Lee

Physics-Constrained Bayesian Neural Network for Fluid Flow Reconstruction with Sparse and Noisy Data

In many applications, flow measurements are usually sparse and possibly noisy. The reconstruction of a high-resolution flow field from limited and imperfect flow information is significant yet challenging. In this work, we propose an…

Computational Physics · Physics 2020-01-17 Luning Sun , Jian-Xun Wang

$\beta$-Cores: Robust Large-Scale Bayesian Data Summarization in the Presence of Outliers

Modern machine learning applications should be able to address the intrinsic challenges arising over inference on massive real-world datasets, including scalability and robustness to outliers. Despite the multiple benefits of Bayesian…

Machine Learning · Computer Science 2020-11-10 Dionysis Manousakas , Cecilia Mascolo

Scalable k-Means Clustering via Lightweight Coresets

Coresets are compact representations of data sets such that models trained on a coreset are provably competitive with models trained on the full data set. As such, they have been successfully used to scale up clustering models to massive…

Machine Learning · Statistics 2018-06-08 Olivier Bachem , Mario Lucic , Andreas Krause