Related papers: Multiple imputation for longitudinal data: A tutor…

Multiple imputation in data that grow over time: A comparison of three strategies

Multiple imputation is a highly recommended technique to deal with missing data, but the application to longitudinal datasets can be done in multiple ways. When a new wave of longitudinal data arrives, we can treat the combined data of…

Methodology · Statistics 2026-05-18 X. M. Kavelaars , S. van Buuren , J. R. van Ginkel

Implementing multiple imputation for missing data in longitudinal studies when models are not feasible: A tutorial on the random hot deck approach

Objective: Researchers often use model-based multiple imputation to handle missing at random data to minimize bias while making the best use of all available data. However, there are sometimes constraints within the data that make…

Methodology · Statistics 2020-11-03 Chinchin Wang , Tyrel Stokes , Russell Steele , Niels Wedderkopp , Ian Shrier

Multiple imputation using dimension reduction techniques for high-dimensional data

Missing data present challenges in data analysis. Naive analyses such as complete-case and available-case analysis may introduce bias and loss of efficiency, and produce unreliable results. Multiple imputation (MI) is one of the most widely…

Methodology · Statistics 2019-05-15 Domonique W. Hodge , Sandra E. Safo , Qi Long

Missing data imputation for a multivariate outcome of mixed variable types

Data collected in clinical trials are often composed of multiple types of variables. For example, laboratory measurements and vital signs are longitudinal data of continuous or categorical variables, adverse events may be recurrent events,…

Methodology · Statistics 2023-01-12 Tuo Wang , Rachel Zilinskas , Ying Li , Yongming Qu

Clustering with missing data: which equivalent for Rubin's rules?

Multiple imputation (MI) is a popular method for dealing with missing values. However, the suitable way for applying clustering after MI remains unclear: how to pool partitions? How to assess the clustering instability when data are…

Methodology · Statistics 2022-05-16 Vincent Audigier , Ndèye Niang

Time-dependent Iterative Imputation for Multivariate Longitudinal Clinical Data

Missing data is a major challenge in clinical research. In electronic medical records, often a large fraction of the values in laboratory tests and vital signs are missing. The missingness can lead to biased estimates and limit our ability…

Machine Learning · Computer Science 2023-04-18 Omer Noy , Ron Shamir

Multiple Improvements of Multiple Imputation Likelihood Ratio Tests

Multiple imputation (MI) inference handles missing data by imputing the missing values $m$ times, and then combining the results from the $m$ complete-data analyses. However, the existing method for combining likelihood ratio tests (LRTs)…

Statistics Theory · Mathematics 2022-01-03 Kin Wai Chan , Xiao-Li Meng

Imputation and Missing Indicators for handling missing data in the development and implementation of clinical prediction models: a simulation study

Background: Existing guidelines for handling missing data are generally not consistent with the goals of prediction modelling, where missing data can occur at any stage of the model pipeline. Multiple imputation (MI), often heralded as the…

Methodology · Statistics 2022-06-27 Rose Sisk , Matthew Sperrin , Niels Peek , Maarten van Smeden , Glen P. Martin

Sensitivity analysis in longitudinal clinical trials via distributional imputation

Missing data is inevitable in longitudinal clinical trials. Conventionally, the missing at random assumption is assumed to handle missingness, which however is unverifiable empirically. Thus, sensitivity analysis is critically important to…

Methodology · Statistics 2022-03-18 Siyi Liu , Shu Yang , Yilong Zhang , Guanghan , Liu

Evaluation of approaches for accommodating interactions and non-linear terms in multiple imputation of incomplete three-level data

Three-level data structures arising from repeated measures on individuals clustered within larger units are common in health research studies. Missing data are prominent in such studies and are often handled via multiple imputation (MI).…

Methodology · Statistics 2020-11-02 Rushani Wijesuriya , Margarita Moreno-Betancur , John B. Carlin , Anurika P. De Silva , Katherine J. Lee

Multiple Imputation: A Review of Practical and Theoretical Findings

Multiple imputation is a straightforward method for handling missing data in a principled fashion. This paper presents an overview of multiple imputation, including important theoretical results and their practical implications for…

Methodology · Statistics 2018-01-15 Jared S. Murray

Multiple Imputation Methods for Missing Multilevel Ordinal Outcomes

Multiple imputation (MI) is an established technique to handle missing data in observational studies. Joint modeling (JM) and fully conditional specification (FCS) are commonly used methods for imputing multilevel clustered data. However,…

Methodology · Statistics 2022-09-28 Mei Dong , Aya Mitani

Evaluation of multiple imputation to address intended and unintended missing data in case-cohort studies with a binary endpoint

Case-cohort studies are conducted within cohort studies, wherein collection of exposure data is limited to a subset of the cohort, leading to a large proportion of missing data by design. Standard analysis uses inverse probability weighting…

Methodology · Statistics 2022-10-21 Melissa Middleton , Cattram Nguyen , John B. Carlin , Margarita Moreno-Betancur , Katherine J. Lee

Multiple imputation of multilevel missing data: An introduction to the R package pan

The treatment of missing data can be difficult in multilevel research because state-of-the-art procedures such as multiple imputation (MI) may require advanced statistical knowledge or a high degree of familiarity with certain statistical…

Computation · Statistics 2016-11-11 Simon Grund , Oliver Lüdtke , Alexander Robitzsch

G-formula for causal inference via multiple imputation

G-formula is a popular approach for estimating treatment or exposure effects from longitudinal data that are subject to time-varying confounding. G-formula estimation is typically performed by Monte-Carlo simulation, with non-parametric…

Methodology · Statistics 2023-10-12 Jonathan W. Bartlett , Camila Olarte Parra , Emily Granger , Ruth H. Keogh , Erik W. van Zwet , Rhian M. Daniel

The Bias and Efficiency of Incomplete-Data Estimators in Small Univariate Normal Samples

Widely used methods for analyzing missing data can be biased in small samples. To understand these biases, we evaluate in detail the situation where a small univariate normal sample, with values missing at random, is analyzed using either…

Statistics Theory · Mathematics 2017-03-27 Paul T. von Hippel

Handling missing values in cost-effectiveness analyses that use data from cluster randomised trials

Public policy-makers use cost-effectiveness analyses (CEA) to decide which health and social care interventions to provide. Appropriate methods have not been developed for handling missing data in complex settings, such as for CEA that use…

Methodology · Statistics 2012-06-27 Karla Diaz-Ordaz , Michael G. Kenward , Richard Grieve

A Comparative Study of Imputation Methods for Multivariate Ordinal Data

Missing data remains a very common problem in large datasets, including survey and census data containing many ordinal responses, such as political polls and opinion surveys. Multiple imputation (MI) is usually the go-to approach for…

Methodology · Statistics 2024-12-25 Chayut Wongkamthong , Olanrewaju Akande

Variational Bayesian Multiple Imputation in High-Dimensional Regression Models With Missing Responses

Multiple imputation has become one of the standard methods in drawing inferences in many incomplete data applications. Applications of multiple imputation in relatively more complex settings, such as high-dimensional clustered data, require…

Methodology · Statistics 2025-04-08 Qiushuang Li , Recai Yucel

General and Feasible Tests with Multiply-Imputed Datasets

Multiple imputation (MI) is a technique especially designed for handling missing data in public-use datasets. It allows analysts to perform incomplete-data inference straightforwardly by using several already imputed datasets released by…

Methodology · Statistics 2022-01-03 Kin Wai Chan