Related papers: Bootstrapping and Multiple Imputation Ensemble App…

Bootstrap Inference when Using Multiple Imputation

Many modern estimators require bootstrapping to calculate confidence intervals because either no analytic standard error is available or the distribution of the parameter of interest is non-symmetric. It remains however unclear how to…

Methodology · Statistics 2018-09-13 Michael Schomaker , Christian Heumann

Multiple Imputation Methods under Extreme Values

Missing data are ubiquitous in empirical databases, yet statistical analyses typically require complete data matrices. Multiple imputation offers a principled solution for filling these gaps. This study evaluates the performance of several…

Computation · Statistics 2026-02-05 Enzo Porto Brasil

A comparison of multiple imputation methods for bivariate hierarchical outcomes

Missing observations are common in cluster randomised trials. Approaches taken to handling such missing data include: complete case analysis, single-level multiple imputation that ignores the clustering, multiple imputation with a fixed…

Methodology · Statistics 2014-07-18 Karla Diaz-Ordaz , Michael G. Kenward , Manuel Gomes , Richard Grieve

Multiple imputation in data that grow over time: A comparison of three strategies

Multiple imputation is a highly recommended technique to deal with missing data, but the application to longitudinal datasets can be done in multiple ways. When a new wave of longitudinal data arrives, we can treat the combined data of…

Methodology · Statistics 2026-05-18 X. M. Kavelaars , S. van Buuren , J. R. van Ginkel

Meta-Imputation Balanced (MIB): An Ensemble Approach for Handling Missing Data in Biomedical Machine Learning

Missing data represents a fundamental challenge in machine learning applications, often reducing model performance and reliability. This problem is particularly acute in fields like bioinformatics and clinical machine learning, where…

Machine Learning · Computer Science 2025-09-04 Fatemeh Azad , Zoran Bosnić , Matjaž Kukar

Accounting for model uncertainty in multiple imputation under complex sampling

Multiple imputation provides an effective way to handle missing data. When several possible models are under consideration for the data, the multiple imputation is typically performed under a single-best model selected from the candidate…

Methodology · Statistics 2018-11-30 Gyuhyeong Goh , Jae Kwang Kim

An ensemble learning method for variable selection: application to high dimensional data and missing values

Standard approaches for variable selection in linear models are not tailored to deal properly with high-dimensional and incomplete data. Currently, methods dedicated to high-dimensional data handle missing values by ad-hoc strategies, like…

Methodology · Statistics 2021-06-09 Avner Bar-Hen , Vincent Audigier

Combining missing data imputation and internal validation in clinical risk prediction models

Methods to handle missing data have been extensively explored in the context of estimation and descriptive studies, with multiple imputation being the most widely used method in clinical research. However, in the context of clinical risk…

Methodology · Statistics 2024-11-25 Junhui Mi , Rahul D. Tendulkar , Sarah M. C. Sittenfeld , Sujata Patil , Emily C. Zabor

Bootstrap Inference for Multiple Imputation under Uncongeniality and Misspecification

Multiple imputation has become one of the most popular approaches for handling missing data in statistical analyses. Part of this success is due to Rubin's simple combination rules. These give frequentist valid inferences when the…

Methodology · Statistics 2019-11-28 Jonathan W. Bartlett , Rachael A. Hughes

A Unified Inference Framework for Multiple Imputation Using Martingales

Multiple imputation is widely used to handle missing data. Although Rubin's combining rule is simple, it is not clear whether or not the standard multiple imputation inference is consistent when coupled with the commonly-used full sample…

Methodology · Statistics 2023-01-03 Qian Guan , Shu Yang

Handling missing data in model-based clustering

Gaussian Mixture models (GMMs) are a powerful tool for clustering, classification and density estimation when clustering structures are embedded in the data. The presence of missing values can largely impact the GMMs estimation process,…

Machine Learning · Statistics 2020-06-05 Alessio Serafini , Thomas Brendan Murphy , Luca Scrucca

Multiple Imputation: A Review of Practical and Theoretical Findings

Multiple imputation is a straightforward method for handling missing data in a principled fashion. This paper presents an overview of multiple imputation, including important theoretical results and their practical implications for…

Methodology · Statistics 2018-01-15 Jared S. Murray

Enhancing Data Completeness in Time Series: Imputation Strategies for Missing Data Using Significant Periodically Correlated Components

Missing data is a pervasive issue in statistical analyses, affecting the reliability and validity of research across diverse scientific disciplines. Failure to adequately address missing data can lead to biased estimates and consequently…

Methodology · Statistics 2025-05-06 Asmaa Ahmad , Eric J Rose , Michael Roy , Edward Valachovic

Dealing with missing data using attention and latent space regularization

Most practical data science problems encounter missing data. A wide variety of solutions exist, each with strengths and weaknesses that depend upon the missingness-generating process. Here we develop a theoretical framework for training and…

Machine Learning · Computer Science 2022-11-15 Jahan C. Penny-Dimri , Christoph Bergmeir , Julian Smith

Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping

In machine learning, an agent needs to estimate uncertainty to efficiently explore and adapt and to make effective decisions. A common approach to uncertainty estimation maintains an ensemble of models. In recent years, several approaches…

Machine Learning · Computer Science 2022-06-09 Vikranth Dwaracherla , Zheng Wen , Ian Osband , Xiuyuan Lu , Seyed Mohammad Asghari , Benjamin Van Roy

No imputation without representation

By filling in missing values in datasets, imputation allows these datasets to be used with algorithms that cannot handle missing values by themselves. However, missing values may in principle contribute useful information that is lost…

Machine Learning · Computer Science 2024-10-31 Oliver Urs Lenz , Daniel Peralta , Chris Cornelis

Beyond Accuracy: An Empirical Study of Uncertainty Estimation in Imputation

Handling missing data is a central challenge in data-driven analysis. Modern imputation methods not only aim for accurate reconstruction but also differ in how they represent and quantify uncertainty. Yet, the reliability and calibration of…

Databases · Computer Science 2025-11-27 Zarin Tahia Hossain , Mostafa Milani

Treatment bootstrapping: A new approach to quantify uncertainty of average treatment effect estimates

This paper proposes a new non-parametric bootstrap method to quantify the uncertainty of average treatment effect estimate for the treated from matching estimators. More specifically, it seeks to quantify the uncertainty associated with the…

Methodology · Statistics 2024-08-21 Jing Li

Integrating multi-source block-wise missing data in model selection

For multi-source data, blocks of variable information from certain sources are likely missing. Existing methods for handling missing data do not take structures of block-wise missing data into consideration. In this paper, we propose a…

Methodology · Statistics 2020-04-07 Fei Xue , Annie Qu

MAIN: Multihead-Attention Imputation Networks

The problem of missing data, usually absent incurated and competition-standard datasets, is an unfortunate reality for most machine learning models used in industry applications. Recent work has focused on understanding the nature and the…

Machine Learning · Computer Science 2022-01-25 Spyridon Mouselinos , Kyriakos Polymenakos , Antonis Nikitakis , Konstantinos Kyriakopoulos