Related papers: On Data Analysis Pipelines and Modular Bayesian Mo…

A General Framework for Cutting Feedback within Modularised Bayesian Inference

Standard Bayesian inference can build models that combine information from various sources, but this inference may not be reliable if components of a model are misspecified. Cut inference, as a particular type of modularized Bayesian…

Methodology · Statistics 2026-03-18 Yang Liu , Robert J. B. Goudie

Bayesian inference in hierarchical models by combining independent posteriors

Hierarchical models are versatile tools for joint modeling of data sets arising from different, but related, sources. Fully Bayesian inference may, however, become computationally prohibitive if the source-specific data models are complex,…

Computation · Statistics 2016-05-06 Ritabrata Dutta , Paul Blomstedt , Samuel Kaski

Modularized Bayesian analyses and cutting feedback in likelihood-free inference

There has been much recent interest in modifying Bayesian inference for misspecified models so that it is useful for specific purposes. One popular modified Bayesian inference method is "cutting feedback" which can be used when the model…

Methodology · Statistics 2022-03-21 Atlanta Chakraborty , David J. Nott , Christopher Drovandi , David T. Frazier , Scott A. Sisson

Cutting feedback and modularized analyses in generalized Bayesian inference

This work considers Bayesian inference under misspecification for complex statistical models comprised of simpler submodels, referred to as modules, that are coupled together. Such ``multi-modular" models often arise when combining…

Statistics Theory · Mathematics 2023-08-02 David T. Frazier , David J. Nott

Bayesian Workflow

The Bayesian approach to data analysis provides a powerful way to handle uncertainty in all observations, model parameters, and model structure using probability theory. Probabilistic programming languages make it easier to specify and fit…

Methodology · Statistics 2020-11-04 Andrew Gelman , Aki Vehtari , Daniel Simpson , Charles C. Margossian , Bob Carpenter , Yuling Yao , Lauren Kennedy , Jonah Gabry , Paul-Christian Bürkner , Martin Modrák

Posterior risk of modular and semi-modular Bayesian inference

Modular Bayesian methods perform inference in models that are specified through a collection of coupled sub-models, known as modules. These modules often arise from modelling different data sources or from combining domain knowledge from…

Methodology · Statistics 2024-11-26 David T. Frazier , David J. Nott

BayesFlow: Learning complex stochastic models with invertible neural networks

Estimating the parameters of mathematical models is a common problem in almost all branches of science. However, this problem can prove notably difficult when processes and model descriptions become increasingly complex and an explicit…

Machine Learning · Statistics 2024-02-09 Stefan T. Radev , Ulf K. Mertens , Andreas Voss , Lynton Ardizzone , Ullrich Köthe

Stochastic Approximation Cut Algorithm for Inference in Modularized Bayesian Models

Bayesian modelling enables us to accommodate complex forms of data and make a comprehensive inference, but the effect of partial misspecification of the model is a concern. One approach in this setting is to modularize the model, and…

Methodology · Statistics 2026-03-18 Yang Liu , Robert J. B. Goudie

Bayes Linear Analysis for Statistical Modelling with Uncertain Inputs

Statistical models typically capture uncertainties in our knowledge of the corresponding real-world processes, however, it is less common for this uncertainty specification to capture uncertainty surrounding the values of the inputs to the…

Methodology · Statistics 2023-05-10 Samuel E. Jackson , David C. Woods

Hierarchical Bayesian data selection

There are many issues that can cause problems when attempting to infer model parameters from data. Data and models are both imperfect, and as such there are multiple scenarios in which standard methods of inference will lead to misleading…

Computation · Statistics 2024-05-01 Simon L. Cotter

Statistical Testing Framework for Clustering Pipelines by Selective Inference

A data analysis pipeline is a structured sequence of steps that transforms raw data into meaningful insights by integrating multiple analysis algorithms. In many practical applications, analytical findings are obtained only after data pass…

Machine Learning · Statistics 2026-05-04 Yugo Miyata , Tomohiro Shiraishi , Shuichi Nishino , Ichiro Takeuchi

Quantifying contribution and propagation of error from computational steps, algorithms and hyperparameter choices in image classification pipelines

Data science relies on pipelines that are organized in the form of interdependent computational steps. Each step consists of various candidate algorithms that maybe used for performing a particular function. Each algorithm consists of…

Computer Vision and Pattern Recognition · Computer Science 2019-03-04 Aritra Chowdhury , Malik Magdon-Ismail , Bulent Yener

A Bayesian Approach to the Partitioning of Workflows

When partitioning workflows in realistic scenarios, the knowledge of the processing units is often vague or unknown. A naive approach to addressing this issue is to perform many controlled experiments for different workloads, each…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-11-03 Freddy C. Chua , Bernardo A. Huberman

Automated Evolutionary Approach for the Design of Composite Machine Learning Pipelines

The effectiveness of the machine learning methods for real-world tasks depends on the proper structure of the modeling pipeline. The proposed approach is aimed to automate the design of composite machine learning pipelines, which is…

Machine Learning · Computer Science 2021-09-09 Nikolay O. Nikitin , Pavel Vychuzhanin , Mikhail Sarafanov , Iana S. Polonskaia , Ilia Revin , Irina V. Barabanova , Gleb Maximov , Anna V. Kalyuzhnaya , Alexander Boukhanovsky

BayesFlow: Amortized Bayesian Workflows With Neural Networks

Modern Bayesian inference involves a mixture of computational techniques for estimating, validating, and drawing conclusions from probabilistic models as part of principled workflows for data analysis. Typical problems in Bayesian workflows…

Machine Learning · Computer Science 2023-07-12 Stefan T Radev , Marvin Schmitt , Lukas Schumacher , Lasse Elsemüller , Valentin Pratz , Yannik Schälte , Ullrich Köthe , Paul-Christian Bürkner

Statistical Test for Feature Selection Pipelines by Selective Inference

A data analysis pipeline is a structured sequence of steps that transforms raw data into meaningful insights by integrating various analysis algorithms. In this paper, we propose a novel statistical test to assess the significance of data…

Machine Learning · Statistics 2024-10-15 Tomohiro Shiraishi , Tatsuya Matsukawa , Shuichi Nishino , Ichiro Takeuchi

Bayesian Optimization for Design Parameters of 3D Image Data Analysis

Deep learning-based segmentation and classification are crucial to large-scale biomedical imaging, particularly for 3D data, where manual analysis is impractical. Although many methods exist, selecting suitable models and tuning parameters…

Computer Vision and Pattern Recognition · Computer Science 2026-02-18 David Exler , Joaquin Eduardo Urrutia Gómez , Martin Krüger , Maike Schliephake , John Jbeily , Mario Vitacolonna , Rüdiger Rudolf , Markus Reischl

Quantifying error contributions of computational steps, algorithms and hyperparameter choices in image classification pipelines

Data science relies on pipelines that are organized in the form of interdependent computational steps. Each step consists of various candidate algorithms that maybe used for performing a particular function. Each algorithm consists of…

Computer Vision and Pattern Recognition · Computer Science 2019-03-07 Aritra Chowdhury , Malik Magdin-Ismail , Bulent Yener

A strategy to avoid particle depletion in recursive Bayesian inference

Recursive Bayesian inference, in which posterior beliefs are updated in light of accumulating data, is a tool for implementing Bayesian models in applications with streaming and/or very large data sets. As the posterior of one iteration…

Methodology · Statistics 2025-08-05 Henry R. Scharf

FLASH: Fast Bayesian Optimization for Data Analytic Pipelines

Modern data science relies on data analytic pipelines to organize interdependent computational steps. Such analytic pipelines often involve different algorithms across multiple steps, each with its own hyperparameters. To achieve the best…

Machine Learning · Computer Science 2016-06-27 Yuyu Zhang , Mohammad Taha Bahadori , Hang Su , Jimeng Sun