Related papers: Conformal Inference For Missing Data under Multipl…

A Comparison of Full Information Maximum Likelihood and Machine Learning Missing Data Analytical Methods in Growth Curve Modeling

Missing data are inevitable in longitudinal studies. Traditional methods, such as the full information maximum likelihood (FIML), are commonly used to handle ignorable missing data. However, they may lead to biased model estimation due to…

Applications · Statistics 2024-01-01 Dandan Tang , Xin Tong

A Calibration Framework for Inference with Partially Observed Data

Missing data is an universal problem in statistics. We develop a unified framework for estimating parameters defined by general estimating equations under a missing-at-random (MAR) mechanism, based on generalized entropy calibration…

Methodology · Statistics 2026-03-31 Mst Moushumi Pervin , Hengfang Wang , Jae Kwang Kim

Oracle, Multiple Robust and Multipurpose Calibration in a Missing Response Problem

In the presence of a missing response, reweighting the complete case subsample by the inverse of nonmissing probability is both intuitive and easy to implement. When the population totals of some auxiliary variables are known and when the…

Methodology · Statistics 2014-10-16 Kwun Chuen Gary Chan , Sheung Chi Phillip Yam

A Unified Framework for Inference with General Missingness Patterns and Machine Learning Imputation

Pre-trained machine learning (ML) predictions have been increasingly used to complement incomplete data to enable downstream scientific inquiries, but their naive integration risks biased inferences. Recently, multiple methods have been…

Methodology · Statistics 2025-11-12 Xingran Chen , Tyler McCormick , Bhramar Mukherjee , Zhenke Wu

Towards Better Modeling with Missing Data: A Contrastive Learning-based Visual Analytics Perspective

Missing data can pose a challenge for machine learning (ML) modeling. To address this, current approaches are categorized into feature imputation and label prediction and are primarily focused on handling missing data to enhance ML…

Machine Learning · Computer Science 2023-09-19 Laixin Xie , Yang Ouyang , Longfei Chen , Ziming Wu , Quan Li

Conformal Unlearning: A New Paradigm for Unlearning in Conformal Predictors

Conformal unlearning aims to ensure that a trained conformal predictor miscovers data points with specific shared characteristics, such as those from a particular label class, associated with a specific user, or belonging to a defined…

Machine Learning · Computer Science 2026-02-13 Yahya Alkhatib , Muhammad Ahmar Jamal , Wee Peng Tay

Parametric MMD Estimation with Missing Values: Robustness to Missingness and Data Model Misspecification

In the missing data literature, the Maximum Likelihood Estimator (MLE) is celebrated for its ignorability property under missing at random (MAR) data. However, its sensitivity to misspecification of the (complete) data model, even under…

Methodology · Statistics 2025-09-23 Badr-Eddine Chérief-Abdellatif , Jeffrey Näf

RIGID: Robust Linear Regression with Missing Data

We present a robust framework to perform linear regression with missing entries in the features. By considering an elliptical data distribution, and specifically a multivariate normal model, we are able to conditionally formulate a…

Machine Learning · Computer Science 2022-11-10 Alireza Aghasi , MohammadJavad Feizollahi , Saeed Ghadimi

DPER: Efficient Parameter Estimation for Randomly Missing Data

The missing data problem has been broadly studied in the last few decades and has various applications in different areas such as statistics or bioinformatics. Even though many methods have been developed to tackle this challenge, most of…

Machine Learning · Statistics 2021-06-10 Thu Nguyen , Khoi Minh Nguyen-Duy , Duy Ho Minh Nguyen , Binh T. Nguyen , Bruce Alan Wade

Model Evaluation in the Dark: Robust Classifier Metrics with Missing Labels

Missing data in supervised learning is well-studied, but the specific issue of missing labels during model evaluation has been overlooked. Ignoring samples with missing values, a common solution, can introduce bias, especially when data is…

Machine Learning · Computer Science 2025-04-28 Danial Dervovic , Michael Cashmore

Comparison of Parametric versus Machine-learning Multiple Imputation in Clinical Trials with Missing Continuous Outcomes

The use of flexible machine-learning (ML) models to generate imputations of missing data within the framework of Multiple Imputation (MI) has recently gained traction, particularly in observational settings. For randomised controlled trials…

Methodology · Statistics 2025-10-07 Mia S. Tackney , Jonathan W. Bartlett , Elizabeth Williamson , Kim May Lee

Missing values: sparse inverse covariance estimation and an extension to sparse regression

We propose an l1-regularized likelihood method for estimating the inverse covariance matrix in the high-dimensional multivariate normal model in presence of missing data. Our method is based on the assumption that the data are missing at…

Methodology · Statistics 2012-02-28 Nicolas Städler , Peter Bühlmann

A Marginal Maximum Likelihood Approach for Hierarchical Simultaneous Autoregressive Models with Missing Data

Efficient estimation methods for simultaneous autoregressive (SAR) models with missing data in the response variable have been well-explored in the literature. A common practice is to introduce measurement error into SAR models to separate…

Methodology · Statistics 2024-10-10 Anjana Wijayawardhana , Thomas Suesse , David Gunawan

Efficient and Differentiable Conformal Prediction with General Function Classes

Quantifying the data uncertainty in learning tasks is often done by learning a prediction interval or prediction set of the label given the input. Two commonly desired properties for learned prediction sets are \emph{valid coverage} and…

Machine Learning · Computer Science 2022-05-31 Yu Bai , Song Mei , Huan Wang , Yingbo Zhou , Caiming Xiong

Redundancy-Adaptive Multimodal Learning for Imperfect Data

Multimodal models trained on complete modality data often exhibit a substantial decrease in performance when faced with imperfect data containing corruptions or missing modalities. To address this robustness challenge, prior methods have…

Multimedia · Computer Science 2023-10-24 Mengxi Chen , Jiangchao Yao , Linyu Xing , Yu Wang , Ya Zhang , Yanfeng Wang

Model-Free Robust Reinforcement Learning with Sample Complexity Analysis

Distributionally Robust Reinforcement Learning (DR-RL) aims to derive a policy optimizing the worst-case performance within a predefined uncertainty set. Despite extensive research, previous DR-RL algorithms have predominantly favored…

Machine Learning · Computer Science 2024-06-26 Yudan Wang , Shaofeng Zou , Yue Wang

Conformalized matrix completion

Matrix completion aims to estimate missing entries in a data matrix, using the assumption of a low-complexity structure (e.g., low rank) so that imputation is possible. While many effective estimation algorithms exist in the literature,…

Methodology · Statistics 2023-10-24 Yu Gui , Rina Foygel Barber , Cong Ma

Noise-Adaptive Conformal Classification with Marginal Coverage

Conformal inference provides a rigorous statistical framework for uncertainty quantification in machine learning, enabling well-calibrated prediction sets with precise coverage guarantees for any classification model. However, its reliance…

Methodology · Statistics 2025-01-31 Teresa Bortolotti , Y. X. Rachel Wang , Xin Tong , Alessandra Menafoglio , Simone Vantini , Matteo Sesia

Robust location estimation with missing data

In a missing-data setting, we have a sample in which a vector of explanatory variables x_i is observed for every subject i, while scalar outcomes y_i are missing by happenstance on some individuals. In this work we propose robust estimates…

Statistics Theory · Mathematics 2010-09-20 Mariela Sued , Victor J. Yohai

Conformal Prediction with Missing Values

Conformal prediction is a theoretically grounded framework for constructing predictive intervals. We study conformal prediction with missing values in the covariates -- a setting that brings new challenges to uncertainty quantification. We…

Machine Learning · Statistics 2023-06-06 Margaux Zaffran , Aymeric Dieuleveut , Julie Josse , Yaniv Romano