Related papers: Statistical Inference with Different Missing-data …

Prediction with Missing Data: Target Probabilities and Missingness Mechanisms

Conditions ensuring optimal parameter estimation in the presence of missing data are well established in inference, typically relying on the Missing-at-Random (MAR) assumption. In prediction, similar principles are often assumed to apply.…

Methodology · Statistics 2026-03-19 Pierre Catoire , Robin Genuer , Cecile Proust-Lima

Evaluation of missing data mechanisms in two and three dimensional incomplete tables

The analysis of incomplete contingency tables is a practical and an interesting problem. In this paper, we provide characterizations for the various missing mechanisms of a variable in terms of response and non-response odds for two and…

Methodology · Statistics 2018-11-27 S. Ghosh , P. Vellaisamy

Review for Handling Missing Data with special missing mechanism

Missing data poses a significant challenge in data science, affecting decision-making processes and outcomes. Understanding what missing data is, how it occurs, and why it is crucial to handle it appropriately is paramount when working with…

Methodology · Statistics 2024-04-09 Youran Zhou , Sunil Aryal , Mohamed Reda Bouadjenek

Missing at random, likelihood ignorability and model completeness

This paper provides further insight into the key concept of missing at random (MAR) in incomplete data analysis. Following the usual selection modelling approach we envisage two models with separable parameters: a model for the response of…

Statistics Theory · Mathematics 2007-06-13 Guobing Lu , John B. Copas

Developing robust methods to handle missing data in real-world applications effectively

Missing data is a pervasive challenge spanning diverse data types, including tabular, sensor data, time-series, images and so on. Its origins are multifaceted, resulting in various missing mechanisms. Prior research in this field has…

Machine Learning · Computer Science 2025-03-03 Youran Zhou , Mohamed Reda Bouadjenek , Sunil Aryal

What Is Meant by "Missing at Random"?

The concept of missing at random is central in the literature on statistical analysis with missing data. In general, inference using incomplete data should be based not only on observed data values but should also take account of the…

Methodology · Statistics 2013-06-13 Shaun Seaman , John Galati , Dan Jackson , John Carlin

Identifiable Deep Latent Variable Models for MNAR Data

Missing data is a ubiquitous challenge in data analysis, often leading to biased and inaccurate results. Traditional imputation methods usually assume that the missingness mechanism is missing-at-random (MAR), where the missingness is…

Methodology · Statistics 2026-03-30 Huiming Xie , Fei Xue , Xiao Wang

Sequential identification of nonignorable missing data mechanisms

With nonignorable missing data, likelihood-based inference should be based on the joint distribution of the study variables and their missingness indicators. These joint models cannot be estimated from the data alone, thus requiring the…

Statistics Theory · Mathematics 2017-01-06 Mauricio Sadinle , Jerome P. Reiter

Score test for missing at random or not

Missing data are frequently encountered in various disciplines and can be divided into three categories: missing completely at random (MCAR), missing at random (MAR) and missing not at random (MNAR). Valid statistical approaches to missing…

Methodology · Statistics 2021-05-28 Hairu Wang , Zhiping Lu , Yukun Liu

A fresh look at ignorability for likelihood inference

When data are incomplete, a random vector Y for the data process together with a binary random vector R for the process that causes missing data, are modelled jointly. We review conditions under which R can be ignored for drawing likelihood…

Methodology · Statistics 2019-04-01 John C Galati

Identification Problem for The Analysis of Binary Data with Non-ignorable Missing

When a missing-data mechanism is NMAR or non-ignorable, missingness is itself vital information and it must be taken into the likelihood, which, however, needs to introduce additional parameters to be estimated. The incompleteness of the…

Methodology · Statistics 2014-05-15 Kosuke Morikawa , Yutaka Kano

Sufficient Identification Conditions and Semiparametric Estimation under Missing Not at Random Mechanisms

Conducting valid statistical analyses is challenging in the presence of missing-not-at-random (MNAR) data, where the missingness mechanism is dependent on the missing values themselves even conditioned on the observed data. Here, we…

Methodology · Statistics 2023-06-13 Anna Guo , Jiwei Zhao , Razieh Nabi

Random Indicator Imputation for Missing Not At Random Data

Imputation methods for dealing with incomplete data typically assume that the missingness mechanism is at random (MAR). These methods can also be applied to missing not at random (MNAR) situations, where the user specifies some adjustment…

Methodology · Statistics 2024-04-24 Shahab Jolani , Stef van Buuren

Diagnosing missing always at random in multivariate data

Models for analyzing multivariate data sets with missing values require strong, often unassessable, assumptions. The most common of these is that the mechanism that created the missing data is ignorable - a twofold assumption dependent on…

Applications · Statistics 2020-02-17 Iavor Bojinov , Natesh Pillai , Donald Rubin

Estimation and imputation in Probabilistic Principal Component Analysis with Missing Not At Random data

Missing Not At Random (MNAR) values lead to significant biases in the data, since the probability of missingness depends on the unobserved values.They are ''not ignorable'' in the sense that they often require defining a model for the…

Statistics Theory · Mathematics 2020-06-11 Aude Sportisse , Claire Boyer , Julie Josse

Addressing missing data mechanism uncertainty using multiple-model multiple imputation: Application to a longitudinal clinical trial

We present a framework for generating multiple imputations for continuous data when the missing data mechanism is unknown. Imputations are generated from more than one imputation model in order to incorporate uncertainty regarding the…

Applications · Statistics 2013-01-14 Juned Siddique , Ofer Harel , Catherine M. Crespi

Block-Conditional Missing at Random Models for Missing Data

Two major ideas in the analysis of missing data are (a) the EM algorithm [Dempster, Laird and Rubin, J. Roy. Statist. Soc. Ser. B 39 (1977) 1--38] for maximum likelihood (ML) estimation, and (b) the formulation of models for the joint…

Methodology · Statistics 2011-04-14 Yan Zhou , Roderick J. A. Little , John D. Kalbfleisch

An integrated approach to test for missing not at random

Missing data can lead to inefficiencies and biases in analyses, in particular when data are missing not at random (MNAR). It is thus vital to understand and correctly identify the missing data mechanism. Recovering missing values through a…

Methodology · Statistics 2022-12-08 Jack Noonan , Adetola Adedamola Adediran , Robin Mitra , Stefanie Biedermann

Recoverability of Joint Distribution from Missing Data

A probabilistic query may not be estimable from observed data corrupted by missing values if the data are not missing at random (MAR). It is therefore of theoretical interest and practical importance to determine in principle whether a…

Machine Learning · Statistics 2016-11-16 Jin Tian

Bayesian data combination model with Gaussian process latent variable model for mixed observed variables under NMAR missingness

In the analysis of observational data in social sciences and businesses, it is difficult to obtain a "(quasi) single-source dataset" in which the variables of interest are simultaneously observed. Instead, multiple-source datasets are…

Methodology · Statistics 2021-09-02 Masaki Mitsuhiro , Takahiro Hoshino