Related papers: Missing at random: a stochastic process perspectiv…

What Is Meant by "Missing at Random"?

The concept of missing at random is central in the literature on statistical analysis with missing data. In general, inference using incomplete data should be based not only on observed data values but should also take account of the…

Methodology · Statistics 2013-06-13 Shaun Seaman , John Galati , Dan Jackson , John Carlin

Missing at Random or Not: A Semiparametric Testing Approach

Practical problems with missing data are common, and statistical methods have been developed concerning the validity and/or efficiency of statistical procedures. On a central focus, there have been longstanding interests on the mechanism…

Methodology · Statistics 2020-03-26 Rui Duan , C. Jason Liang , Pamela Shaw , Cheng Yong Tang , Yong Chen

Diagnosing missing always at random in multivariate data

Models for analyzing multivariate data sets with missing values require strong, often unassessable, assumptions. The most common of these is that the mechanism that created the missing data is ignorable - a twofold assumption dependent on…

Applications · Statistics 2020-02-17 Iavor Bojinov , Natesh Pillai , Donald Rubin

Missing at random, likelihood ignorability and model completeness

This paper provides further insight into the key concept of missing at random (MAR) in incomplete data analysis. Following the usual selection modelling approach we envisage two models with separable parameters: a model for the response of…

Statistics Theory · Mathematics 2007-06-13 Guobing Lu , John B. Copas

Alternative approaches for analysing repeated measures data that are missing not at random

We consider studies where multiple measures on an outcome variable are collected over time, but some subjects drop out before the end of follow up. Analyses of such data often proceed under either a 'last observation carried forward' or…

Methodology · Statistics 2022-07-26 Oliver Dukes , David Richardson , Eric Tchetgen Tchetgen

Penalized pairwise pseudo likelihood for variable selection with nonignorable missing data

The regularization approach for variable selection was well developed for a completely observed data set in the past two decades. In the presence of missing values, this approach needs to be tailored to different missing data mechanisms. In…

Methodology · Statistics 2017-07-31 Jiwei Zhao , Yang Yang , Yang Ning

Clustering Data with Nonignorable Missingness using Semi-Parametric Mixture Models

We are concerned in clustering continuous data sets subject to non-ignorable missingness. We perform clustering with a specific semi-parametric mixture, under the assumption of conditional independence given the component. The mixture model…

Methodology · Statistics 2021-07-20 Marie Du Roy de Chaumaray , Matthieu Marbac

Missing Data as Part of the Social Behavior in Real-World Financial Complex Systems

Many real-world networks are known to exhibit facts that counter our knowledge prescribed by the theories on network creation and communication patterns. A common prerequisite in network analysis is that information on nodes and links will…

Physics and Society · Physics 2018-04-03 Guy Kelman , Eran Manes , Marco Lamieri , David Breé

Learning from data with structured missingness

Missing data are an unavoidable complication in many machine learning tasks. When data are `missing at random' there exist a range of tools and techniques to deal with the issue. However, as machine learning studies become more ambitious,…

Machine Learning · Statistics 2023-04-05 Robin Mitra , Sarah F. McGough , Tapabrata Chakraborti , Chris Holmes , Ryan Copping , Niels Hagenbuch , Stefanie Biedermann , Jack Noonan , Brieuc Lehmann , Aditi Shenvi , Xuan Vinh Doan , David Leslie , Ginestra Bianconi , Ruben Sanchez-Garcia , Alisha Davies , Maxine Mackintosh , Eleni-Rosalina Andrinopoulou , Anahid Basiri , Chris Harbron , Ben D. MacArthur

Statistical Inference with Different Missing-data Mechanisms

When data are missing due to at most one cause from some time to next time, we can make sampling distribution inferences about the parameter of the data by modeling the missing-data mechanism correctly. Proverbially, in case its mechanism…

Methodology · Statistics 2014-07-21 Kosuke Morikawa , Yutaka Kano

Fast and Reliable Missing Data Contingency Analysis with Predicate-Constraints

Today, data analysts largely rely on intuition to determine whether missing or withheld rows of a dataset significantly affect their analyses. We propose a framework that can produce automatic contingency analysis, i.e., the range of values…

Databases · Computer Science 2020-04-09 Xi Liang , Zechao Shang , Aaron J. Elmore , Sanjay Krishnan , Michael J. Franklin

A fresh look at ignorability for likelihood inference

When data are incomplete, a random vector Y for the data process together with a binary random vector R for the process that causes missing data, are modelled jointly. We review conditions under which R can be ignored for drawing likelihood…

Methodology · Statistics 2019-04-01 John C Galati

The typical set and entropy in stochastic systems with arbitrary phase space growth

The existence of the {\em typical set} is key for data compression strategies and for the emergence of robust statistical observables in macroscopic physical systems. Standard approaches derive its existence from a restricted set of…

Statistical Mechanics · Physics 2022-02-10 Rudolf Hanel , Bernat Corominas-Murtra

Incorporating Missingness in a Framework for Generating Realistic Synthetic Randomized Controlled Trial Data

The current literature regarding generation of complex, realistic synthetic tabular data, particularly for randomized controlled trials (RCTs), often ignores missing data. However, missing data are common in RCT data and often are not…

Other Statistics · Statistics 2025-12-02 Niki Z. Petrakos , Erica E. M. Moodie , Nicolas Savy

Likelihood inference for incompletely observed stochastic processes: ignorability conditions

We develop a study of ignorability and conditions thereof for likelihood inference in the framework of stochastic processes. We define a coarsening model for processes which includes discrete-time observations as well as censored…

Statistics Theory · Mathematics 2015-11-16 Daniel Commenges , Anne Gegout-Petit

Prediction with Missing Data: Target Probabilities and Missingness Mechanisms

Conditions ensuring optimal parameter estimation in the presence of missing data are well established in inference, typically relying on the Missing-at-Random (MAR) assumption. In prediction, similar principles are often assumed to apply.…

Methodology · Statistics 2026-03-19 Pierre Catoire , Robin Genuer , Cecile Proust-Lima

Full-semiparametric-likelihood-based inference for non-ignorable missing data

During the past few decades, missing-data problems have been studied extensively, with a focus on the ignorable missing case, where the missing probability depends only on observable quantities. By contrast, research into non-ignorable…

Methodology · Statistics 2019-08-06 Yukun Liu , Pengfei Li , Jing Qin

An integrated approach to test for missing not at random

Missing data can lead to inefficiencies and biases in analyses, in particular when data are missing not at random (MNAR). It is thus vital to understand and correctly identify the missing data mechanism. Recovering missing values through a…

Methodology · Statistics 2022-12-08 Jack Noonan , Adetola Adedamola Adediran , Robin Mitra , Stefanie Biedermann

Bernoulli amputation

An approach to amputation, the process of introducing missing values to a complete dataset, is presented. It allows to construct missingness indicators in a flexible and principled way via copulas and Bernoulli margins and to incorporate…

Applications · Statistics 2025-07-28 Marius Hofert , James Jackson , Niels Hagenbuch

Missing Data in Discrete Time State-Space Modeling of Ecological Momentary Assessment Data: A Monte-Carlo Study of Imputation Methods

When using ecological momentary assessment data (EMA), missing data is pervasive as participant attrition is a common issue. Thus, any EMA study must have a missing data plan. In this paper, we discuss missingness in time series analysis…

Methodology · Statistics 2025-02-18 Lindley R. Slipetz , Ami Falk , Teague R. Henry