English

Imputing Missing Values with External Data

Methodology 2024-10-07 v1

Abstract

Missing data is a common challenge across scientific disciplines. Current imputation methods require the availability of individual data to impute missing values. Often, however, missingness requires using external data for the imputation. In this paper, we introduce a new Stata command, mi impute from, designed to impute missing values using linear predictors and their related covariance matrix from imputation models estimated in one or multiple external studies. This allows for the imputation of any missing values without sharing individual data between studies. We describe the underlying method and present the syntax of mi impute from alongside practical examples of missing data in collaborative research projects.

Keywords

Cite

@article{arxiv.2410.02982,
  title  = {Imputing Missing Values with External Data},
  author = {Robert Thiesmeier and Matteo Bottai and Nicola Orsini},
  journal= {arXiv preprint arXiv:2410.02982},
  year   = {2024}
}

Comments

Submitted to the Stata Journal