Related papers: Choosing Imputation Models

Imputation of missing data using multivariate Gaussian Linear Cluster-Weighted Modeling

Missing data arises when certain values are not recorded or observed for variables of interest. However, most of the statistical theory assume complete data availability. To address incomplete databases, one approach is to fill the gaps…

Methodology · Statistics 2023-08-15 Luis Alejandro Masmela-Caita , Thais Paiva Galletti , Marcos Oliveira Prates

MatchThem:: Matching and Weighting after Multiple Imputation

Balancing the distributions of the confounders across the exposure levels in an observational study through matching or weighting is an accepted method to control for confounding due to these variables when estimating the association…

Methodology · Statistics 2021-08-24 Farhad Pishgar , Noah Greifer , Clémence Leyrat , Elizabeth Stuart

Imputation of Missing Data Using Linear Gaussian Cluster-Weighted Modeling

Missing data theory deals with the statistical methods in the occurrence of missing data. Missing data occurs when some values are not stored or observed for variables of interest. However, most of the statistical theory assumes that data…

Methodology · Statistics 2021-10-26 Luis Alejandro Masmela-Caita , Thais Paiva Galletti , Marcos Oliveira Prates

Regression-based imputation of explanatory discrete missing data

Imputation of missing values is a strategy for handling non-responses in surveys or data loss in measurement processes, which may be more effective than ignoring them. When the variable represents a count, the literature dealing with this…

Applications · Statistics 2020-07-31 Gilma Hernández-Herrera , Albert Navarro , David Moriña

Imputation estimators for unnormalized models with missing data

Several statistical models are given in the form of unnormalized densities, and calculation of the normalization constant is intractable. We propose estimation methods for such unnormalized models with missing data. The key concept is to…

Machine Learning · Statistics 2020-06-11 Masatoshi Uehara , Takeru Matsuda , Jae Kwang Kim

Accounting for model uncertainty in multiple imputation under complex sampling

Multiple imputation provides an effective way to handle missing data. When several possible models are under consideration for the data, the multiple imputation is typically performed under a single-best model selected from the candidate…

Methodology · Statistics 2018-11-30 Gyuhyeong Goh , Jae Kwang Kim

Imputations for High Missing Rate Data in Covariates via Semi-supervised Learning Approach

Advancements in data collection techniques and the heterogeneity of data resources can yield high percentages of missing observations on variables, such as block-wise missing data. Under missing-data scenarios, traditional methods such as…

Methodology · Statistics 2022-05-17 Wei Lan , Xuerong Chen , Tao Zou , Chih-Ling Tsai

Efficient Estimation under Multiple Missing Patterns via Balancing Weights

As one of the most commonly seen data challenges, missing data, in particular, multiple, non-monotone missing patterns, complicates estimation and inference due to the fact that missingness mechanisms are often not missing at random, and…

Methodology · Statistics 2025-04-21 Jianing Dong , Raymond K. W. Wong , Kwun Chuen Gary Chan

Imputing missing values with unsupervised random trees

This work proposes a non-iterative strategy for missing value imputations which is guided by similarity between observations, but instead of explicitly determining distances or nearest neighbors, it assigns observations to overlapping…

Machine Learning · Statistics 2019-11-25 David Cortes

Exact balanced random imputation for sample survey data

Surveys usually suffer from non-response, which decreases the effective sample size. Item non-response is typically handled by means of some form of random imputation if we wish to preserve the distribution of the imputed variable. This…

Methodology · Statistics 2017-08-04 Guillaume Chauvet , Wilfried Do Paco

The `Why' behind including `Y' in your imputation model

Missing data is a common challenge when analyzing epidemiological data, and imputation is often used to address this issue. Here, we investigate the scenario where a covariate used in an analysis has missingness and will be imputed. There…

Methodology · Statistics 2024-03-04 Lucy D'Agostino McGowan , Sarah C. Lotspeich , Staci A. Hepler

Graphical and numerical diagnostic tools to assess multiple imputation models by posterior predictive checking

Missing data are often dealt with multiple imputation. A crucial part of the multiple imputation process is selecting sensible models to generate plausible values for incomplete data. A method based on posterior predictive checking is…

Computation · Statistics 2026-05-14 Mingyang Cai , Stef van Buuren , Gerko Vink

Multiple Imputation Methods under Extreme Values

Missing data are ubiquitous in empirical databases, yet statistical analyses typically require complete data matrices. Multiple imputation offers a principled solution for filling these gaps. This study evaluates the performance of several…

Computation · Statistics 2026-02-05 Enzo Porto Brasil

Variable Selection for Linear Regression Imputation in Surveys

Survey sampling is concerned with the estimation of finite population parameters. In practice, survey data suffer from item nonresponse, which is commonly handled through imputation, i.e., replacing missing values with predicted values. As…

Methodology · Statistics 2026-03-06 Ziming An , Mehdi Dagdoug , David Haziza

Multivariate Density Estimation with Missing Data

Multivariate density estimation is a popular technique in statistics with wide applications including regression models allowing for heteroskedasticity in conditional variances. The estimation problems become more challenging when…

Methodology · Statistics 2018-08-15 Zhen Li , Lili Wu , Weilian Zhou , Sujit Ghosh

Do we Need Dozens of Methods for Real World Missing Value Imputation?

Missing values pose a persistent challenge in modern data science. Consequently, there is an ever-growing number of publications introducing new imputation methods in various fields. While many studies compare imputation approaches, they…

Computation · Statistics 2025-11-10 Krystyna Grzesiak , Christophe Muller , Julie Josse , Jeffrey Näf

A Weighted Likelihood Approach Based on Statistical Data Depths

We propose a general approach to construct weighted likelihood estimating equations with the aim of obtain robust estimates. The weight, attached to each score contribution, is evaluated by comparing the statistical data depth at the model…

Methodology · Statistics 2018-02-16 Claudio Agostinelli

Transformed Distribution Matching for Missing Value Imputation

We study the problem of imputing missing values in a dataset, which has important applications in many domains. The key to missing value imputation is to capture the data distribution with incomplete samples and impute the missing values…

Machine Learning · Computer Science 2023-06-26 He Zhao , Ke Sun , Amir Dezfouli , Edwin Bonilla

Generative Imputation and Stochastic Prediction

In many machine learning applications, we are faced with incomplete datasets. In the literature, missing data imputation techniques have been mostly concerned with filling missing values. However, the existence of missing values is…

Machine Learning · Computer Science 2020-09-07 Mohammad Kachuee , Kimmo Karkkainen , Orpaz Goldstein , Sajad Darabi , Majid Sarrafzadeh

Stop or Continue Data Collection: A Nonignorable Missing Data Approach for Continuous Variables

We present an approach to inform decisions about nonresponse follow-up sampling. The basic idea is (i) to create completed samples by imputing nonrespondents' data under various assumptions about the nonresponse mechanisms, (ii) take…

Methodology · Statistics 2022-09-16 Thais Paiva , Jerry Reiter