Related papers: Modern Multiple Imputation with Functional Data

Highly Irregular Functional Generalized Linear Regression with Electronic Health Records

This work presents a new approach, called MISFIT, for fitting generalized functional linear regression models with sparsely and irregularly sampled data. Current methods do not allow for consistent estimation unless one assumes that the…

Methodology · Statistics 2022-05-10 Justin Petrovich , Matthew Reimherr , Carrie Daymont

Uncertainty-Aware Variational-Recurrent Imputation Network for Clinical Time Series

Electronic health records (EHR) consist of longitudinal clinical observations portrayed with sparsity, irregularity, and high-dimensionality, which become major obstacles in drawing reliable downstream clinical outcomes. Although there…

Machine Learning · Computer Science 2020-11-17 Ahmad Wisnu Mulyadi , Eunji Jun , Heung-Il Suk

Mixture-based Multiple Imputation Model for Clinical Data with a Temporal Dimension

The problem of missing values in multivariable time series is a key challenge in many applications such as clinical data mining. Although many imputation methods show their effectiveness in many applications, few of them are designed to…

Machine Learning · Computer Science 2020-03-04 Ye Xue , Diego Klabjan , Yuan Luo

MissForest - nonparametric missing value imputation for mixed-type data

Modern data acquisition based on high-throughput technology is often facing the problem of missing data. Algorithms commonly used in the analysis of such large-scale data often depend on a complete set. Missing value imputation offers a…

Applications · Statistics 2014-06-03 Daniel J. Stekhoven , Peter Bühlmann

Multiple imputation in data that grow over time: A comparison of three strategies

Multiple imputation is a highly recommended technique to deal with missing data, but the application to longitudinal datasets can be done in multiple ways. When a new wave of longitudinal data arrives, we can treat the combined data of…

Methodology · Statistics 2026-05-18 X. M. Kavelaars , S. van Buuren , J. R. van Ginkel

Improved clinical data imputation via classical and quantum determinantal point processes

Imputing data is a critical issue for machine learning practitioners, including in the life sciences domain, where missing clinical data is a typical situation and the reliability of the imputation is of great importance. Currently, there…

Quantum Physics · Physics 2023-12-13 Skander Kazdaghli , Iordanis Kerenidis , Jens Kieckbusch , Philip Teare

A Graph-based Imputation Method for Sparse Medical Records

Electronic Medical Records (EHR) are extremely sparse. Only a small proportion of events (symptoms, diagnoses, and treatments) are observed in the lifetime of an individual. The high degree of missingness of EHR can be attributed to a large…

Artificial Intelligence · Computer Science 2021-11-18 Ramon Vinas , Xu Zheng , Jer Hayes

Multiple-level Point Embedding for Solving Human Trajectory Imputation with Prediction

Sparsity is a common issue in many trajectory datasets, including human mobility data. This issue frequently brings more difficulty to relevant learning tasks, such as trajectory imputation and prediction. Nowadays, little existing work…

Machine Learning · Computer Science 2023-01-13 Kyle K. Qin , Yongli Ren , Wei Shao , Brennan Lake , Filippo Privitera , Flora D. Salim

Multiple Imputation of Hierarchical Nonlinear Time Series Data with an Application to School Enrollment Data

International comparisons of hierarchical time series data sets based on survey data, such as annual country-level estimates of school enrollment rates, can suffer from large amounts of missing data due to differing coverage of surveys…

Methodology · Statistics 2025-03-31 Daphne H. Liu , Adrian E. Raftery

Uncertainty-Gated Stochastic Sequential Model for EHR Mortality Prediction

Electronic health records (EHR) are characterized as non-stationary, heterogeneous, noisy, and sparse data; therefore, it is challenging to learn the regularities or patterns inherent within them. In particular, sparseness caused mostly by…

Machine Learning · Computer Science 2020-03-03 Eunji Jun , Ahmad Wisnu Mulyadi , Jaehun Choi , Heung-Il Suk

How Deep is your Guess? A Fresh Perspective on Deep Learning for Medical Time-Series Imputation

We present a comprehensive analysis of deep learning approaches for Electronic Health Record (EHR) time-series imputation, examining how architectural and framework biases combine to influence model performance. Our investigation reveals…

Machine Learning · Computer Science 2025-02-05 Linglong Qian , Tao Wang , Jun Wang , Hugh Logan Ellis , Robin Mitra , Richard Dobson , Zina Ibrahim

Simultaneous inference for misaligned multivariate functional data

We consider inference for misaligned multivariate functional data that represents the same underlying curve, but where the functional samples have systematic differences in shape. In this paper we introduce a new class of generally…

Applications · Statistics 2023-01-23 Niels Lundtorp Olsen , Bo Markussen , Lars Lau Rakêt

Estimation of Over-parameterized Models from an Auto-Modeling Perspective

From a model-building perspective, we propose a paradigm shift for fitting over-parameterized models. Philosophically, the mindset is to fit models to future observations rather than to the observed sample. Technically, given an imputation…

Methodology · Statistics 2024-12-09 Yiran Jiang , Chuanhai Liu

Robust joint modeling of sparsely observed paired functional data

A reduced-rank mixed effects model is developed for robust modeling of sparsely observed paired functional data. In this model, the curves for each functional variable are summarized using a few functional principal components, and the…

Methodology · Statistics 2023-08-08 Huiya Zhou , Xiaomeng Yan , Lan Zhou

Multiple imputation for longitudinal data: A tutorial

Longitudinal studies are frequently used in medical research and involve collecting repeated measures on individuals over time. Observations from the same individual are invariably correlated and thus an analytic approach that accounts for…

Methodology · Statistics 2024-04-11 Rushani Wijesuriya , Margarita Moreno-Betancur , John B Carlin , Ian R White , Matteo Quartagno , Katherine J Lee

Addressing missing data mechanism uncertainty using multiple-model multiple imputation: Application to a longitudinal clinical trial

We present a framework for generating multiple imputations for continuous data when the missing data mechanism is unknown. Imputations are generated from more than one imputation model in order to incorporate uncertainty regarding the…

Applications · Statistics 2013-01-14 Juned Siddique , Ofer Harel , Catherine M. Crespi

Multiple Imputation Diagnostics when using Electronic Health Record Data in Observational Studies: A Case Study

Missing values in electronic health record (EHR) data pose a significant challenge for epidemiologic research. Traditional methods for handling missing data, like mean imputation, may introduce bias. Multiple imputation (MI) offers a…

Methodology · Statistics 2026-04-14 Nrupen A. Bhavsar , Lingyu Zhou , Samuel I. Berchuck , Matthew L. Maciejewski , Jerome P. Reiter

Multiple Imputation: A Review of Practical and Theoretical Findings

Multiple imputation is a straightforward method for handling missing data in a principled fashion. This paper presents an overview of multiple imputation, including important theoretical results and their practical implications for…

Methodology · Statistics 2018-01-15 Jared S. Murray

Inference for Multivariate Regression Model based on synthetic data generated under Fixed-Posterior Predictive Sampling: comparison with Plug-in Sampling

The authors derive likelihood-based exact inference methods for the multivariate regression model, for singly imputed synthetic data generated via Posterior Predictive Sampling (PPS) and for multiply imputed synthetic data generated via a…

Statistics Theory · Mathematics 2017-07-26 Ricardo Moura , Martin Klein , Carlos A. Coelho , Bimal Sinha

A new approach to data assimilation initialization problems with sparse data using multiple cost functions

This article develops a novel data assimilation methodology, addressing challenges that are common in real-world settings, such as severe sparsity of observations, lack of reliable models, and non-stationarity of the system dynamics. These…

Optimization and Control · Mathematics 2024-11-05 David J. Abers , George Hripcsak , Lena Mamykina , Melike Sirlanci , Esteban G. Tabak