Related papers: Multiple imputation using dimension reduction tech…

High-dimensional Imputation for the Social Sciences: a Comparison of State-of-the-art Methods

Including a large number of predictors in the imputation model underlying a multiple imputation (MI) procedure is one of the most challenging tasks imputers face. A variety of high-dimensional MI techniques can help, but there has been…

Methodology · Statistics 2023-08-15 Edoardo Costantini , Kyle M. Lang , Tim Reeskens , Klaas Sijtsma

Multiple Imputation with Neural Network Gaussian Process for High-dimensional Incomplete Data

Missing data are ubiquitous in real world applications and, if not adequately handled, may lead to the loss of information and biased findings in downstream analysis. Particularly, high-dimensional incomplete data with a moderate sample…

Machine Learning · Computer Science 2022-12-23 Zongyu Dai , Zhiqi Bu , Qi Long

The Bias and Efficiency of Incomplete-Data Estimators in Small Univariate Normal Samples

Widely used methods for analyzing missing data can be biased in small samples. To understand these biases, we evaluate in detail the situation where a small univariate normal sample, with values missing at random, is analyzed using either…

Statistics Theory · Mathematics 2017-03-27 Paul T. von Hippel

Multiple Imputation with Massive Data: An Application to the Panel Study of Income Dynamics

\Multiple imputation (MI) is a popular and well-established method for handling missing data in multivariate data sets, but its practicality for use in massive and complex data sets has been questioned. One such data set is the Panel Study…

Methodology · Statistics 2021-08-18 Yajuan Si , Steve Heeringa , David Johnson , Roderick Little , Wenshuo Liu , Fabian Pfeffer , Trivellore Raghunathan

Supervised dimensionality reduction for multiple imputation by chained equations

Multivariate imputation by chained equations (MICE) is one of the most popular approaches to address missing values in a data set. This approach requires specifying a univariate imputation model for every variable under imputation. The…

Methodology · Statistics 2023-11-01 Edoardo Costantini , Kyle M. Lang , Klaas Sijtsma

Multiple imputation with missing data indicators

Multiple imputation is a well-established general technique for analyzing data with missing values. A convenient way to implement multiple imputation is sequential regression multiple imputation (SRMI), also called chained equations…

Methodology · Statistics 2021-03-04 Lauren J Beesley , Irina Bondarenko , Michael R Elliott , Allison W Kurian , Steven J Katz , Jeremy M G Taylor

Multiple Imputation Method for High-Dimensional Neuroimaging Data

Missingness is a common issue for neuroimaging data, and neglecting it in downstream statistical analysis can introduce bias and lead to misguided inferential conclusions. It is therefore crucial to conduct appropriate statistical methods…

Methodology · Statistics 2025-03-25 Tong Lu , Chixiang Chen , Hsin-Hsiung Huang , Peter Kochunov , Elliot Hong , Shuo Chen

Maximum likelihood multiple imputation: Faster imputations and consistent standard errors without posterior draws

Multiple imputation (MI) is a method for repairing and analyzing data with missing values. MI replaces missing values with a sample of random values drawn from an imputation model. The most popular form of MI, which we call posterior draw…

Methodology · Statistics 2019-11-18 Paul T. von Hippel , Jonathan Bartlett

Imputation and Missing Indicators for handling missing data in the development and implementation of clinical prediction models: a simulation study

Background: Existing guidelines for handling missing data are generally not consistent with the goals of prediction modelling, where missing data can occur at any stage of the model pipeline. Multiple imputation (MI), often heralded as the…

Methodology · Statistics 2022-06-27 Rose Sisk , Matthew Sperrin , Niels Peek , Maarten van Smeden , Glen P. Martin

Multiple Imputation via Generative Adversarial Network for High-dimensional Blockwise Missing Value Problems

Missing data are present in most real world problems and need careful handling to preserve the prediction accuracy and statistical consistency in the downstream analysis. As the gold standard of handling missing data, multiple imputation…

Machine Learning · Computer Science 2021-12-23 Zongyu Dai , Zhiqi Bu , Qi Long

Multiple imputation for longitudinal data: A tutorial

Longitudinal studies are frequently used in medical research and involve collecting repeated measures on individuals over time. Observations from the same individual are invariably correlated and thus an analytic approach that accounts for…

Methodology · Statistics 2024-04-11 Rushani Wijesuriya , Margarita Moreno-Betancur , John B Carlin , Ian R White , Matteo Quartagno , Katherine J Lee

MIDA: Multiple Imputation using Denoising Autoencoders

Missing data is a significant problem impacting all domains. State-of-the-art framework for minimizing missing data bias is multiple imputation, for which the choice of an imputation model remains nontrivial. We propose a multiple…

Machine Learning · Computer Science 2018-02-20 Lovedeep Gondara , Ke Wang

Solving the "many variables" problem in MICE with principal component regression

Multiple Imputation (MI) is one of the most popular approaches to addressing missing values in questionnaires and surveys. MI with multivariate imputation by chained equations (MICE) allows flexible imputation of many types of data. In…

Methodology · Statistics 2023-04-24 Edoardo Costantini , Kyle M. Lang , Klaas Sijtsma , Tim Reeskens

Evaluation of multiple imputation to address intended and unintended missing data in case-cohort studies with a binary endpoint

Case-cohort studies are conducted within cohort studies, wherein collection of exposure data is limited to a subset of the cohort, leading to a large proportion of missing data by design. Standard analysis uses inverse probability weighting…

Methodology · Statistics 2022-10-21 Melissa Middleton , Cattram Nguyen , John B. Carlin , Margarita Moreno-Betancur , Katherine J. Lee

Are deep learning models superior for missing data imputation in large surveys? Evidence from an empirical comparison

Multiple imputation (MI) is a popular approach for dealing with missing data arising from non-response in sample surveys. Multiple imputation by chained equations (MICE) is one of the most widely used MI algorithms for multivariate data,…

Machine Learning · Computer Science 2022-03-22 Zhenhua Wang , Olanrewaju Akande , Jason Poulos , Fan Li

How to apply multiple imputation in propensity score matching with partially observed confounders: a simulation study and practical recommendations

Propensity score matching (PSM) has been widely used to mitigate confounding in observational studies, although complications arise when the covariates used to estimate the PS are only partially observed. Multiple imputation (MI) is a…

Applications · Statistics 2021-07-22 Albee Y. Ling , Maria E. Montez-Rath , Maya B. Mathur , Kris Kapphahn , Manisha Desai

Multiple imputation of missing covariate values in multilevel models with random slopes: A cautionary note

Multiple imputation (MI) has become one of the main procedures used to treat missing data, but the guidelines from the methodological literature are not easily transferred to multilevel research. For models including random slopes, proper…

Methodology · Statistics 2016-06-30 Simon Grund , Oliver Lüdtke , Alexander Robitzsch

A Comparative Study of Imputation Methods for Multivariate Ordinal Data

Missing data remains a very common problem in large datasets, including survey and census data containing many ordinal responses, such as political polls and opinion surveys. Multiple imputation (MI) is usually the go-to approach for…

Methodology · Statistics 2024-12-25 Chayut Wongkamthong , Olanrewaju Akande

MISNN: Multiple Imputation via Semi-parametric Neural Networks

Multiple imputation (MI) has been widely applied to missing value problems in biomedical, social and econometric research, in order to avoid improper inference in the downstream data analysis. In the presence of high-dimensional data,…

Methodology · Statistics 2023-05-04 Zhiqi Bu , Zongyu Dai , Yiliang Zhang , Qi Long

Internal Data Imputation in Data Warehouse Dimensions

Missing values occur commonly in the multidimensional data warehouses. They may generate problems of usefulness of data since the analysis performed on a multidimensional data warehouse is through different dimensions with hierarchies where…

Databases · Computer Science 2021-10-05 Yuzhao Yang , Fatma Abdelhedi , Jérôme Darmont , Franck Ravat , Olivier Teste