English
Related papers

Related papers: Modern Multiple Imputation with Functional Data

200 papers

This work presents a new approach, called MISFIT, for fitting generalized functional linear regression models with sparsely and irregularly sampled data. Current methods do not allow for consistent estimation unless one assumes that the…

Methodology · Statistics 2022-05-10 Justin Petrovich , Matthew Reimherr , Carrie Daymont

Electronic health records (EHR) consist of longitudinal clinical observations portrayed with sparsity, irregularity, and high-dimensionality, which become major obstacles in drawing reliable downstream clinical outcomes. Although there…

Machine Learning · Computer Science 2020-11-17 Ahmad Wisnu Mulyadi , Eunji Jun , Heung-Il Suk

The problem of missing values in multivariable time series is a key challenge in many applications such as clinical data mining. Although many imputation methods show their effectiveness in many applications, few of them are designed to…

Machine Learning · Computer Science 2020-03-04 Ye Xue , Diego Klabjan , Yuan Luo

Modern data acquisition based on high-throughput technology is often facing the problem of missing data. Algorithms commonly used in the analysis of such large-scale data often depend on a complete set. Missing value imputation offers a…

Applications · Statistics 2014-06-03 Daniel J. Stekhoven , Peter Bühlmann

Multiple imputation is a highly recommended technique to deal with missing data, but the application to longitudinal datasets can be done in multiple ways. When a new wave of longitudinal data arrives, we can treat the combined data of…

Methodology · Statistics 2026-05-18 X. M. Kavelaars , S. van Buuren , J. R. van Ginkel

Imputing data is a critical issue for machine learning practitioners, including in the life sciences domain, where missing clinical data is a typical situation and the reliability of the imputation is of great importance. Currently, there…

Quantum Physics · Physics 2023-12-13 Skander Kazdaghli , Iordanis Kerenidis , Jens Kieckbusch , Philip Teare

Electronic Medical Records (EHR) are extremely sparse. Only a small proportion of events (symptoms, diagnoses, and treatments) are observed in the lifetime of an individual. The high degree of missingness of EHR can be attributed to a large…

Artificial Intelligence · Computer Science 2021-11-18 Ramon Vinas , Xu Zheng , Jer Hayes

Sparsity is a common issue in many trajectory datasets, including human mobility data. This issue frequently brings more difficulty to relevant learning tasks, such as trajectory imputation and prediction. Nowadays, little existing work…

Machine Learning · Computer Science 2023-01-13 Kyle K. Qin , Yongli Ren , Wei Shao , Brennan Lake , Filippo Privitera , Flora D. Salim

International comparisons of hierarchical time series data sets based on survey data, such as annual country-level estimates of school enrollment rates, can suffer from large amounts of missing data due to differing coverage of surveys…

Methodology · Statistics 2025-03-31 Daphne H. Liu , Adrian E. Raftery

Electronic health records (EHR) are characterized as non-stationary, heterogeneous, noisy, and sparse data; therefore, it is challenging to learn the regularities or patterns inherent within them. In particular, sparseness caused mostly by…

Machine Learning · Computer Science 2020-03-03 Eunji Jun , Ahmad Wisnu Mulyadi , Jaehun Choi , Heung-Il Suk

We present a comprehensive analysis of deep learning approaches for Electronic Health Record (EHR) time-series imputation, examining how architectural and framework biases combine to influence model performance. Our investigation reveals…

Machine Learning · Computer Science 2025-02-05 Linglong Qian , Tao Wang , Jun Wang , Hugh Logan Ellis , Robin Mitra , Richard Dobson , Zina Ibrahim

We consider inference for misaligned multivariate functional data that represents the same underlying curve, but where the functional samples have systematic differences in shape. In this paper we introduce a new class of generally…

Applications · Statistics 2023-01-23 Niels Lundtorp Olsen , Bo Markussen , Lars Lau Rakêt

From a model-building perspective, we propose a paradigm shift for fitting over-parameterized models. Philosophically, the mindset is to fit models to future observations rather than to the observed sample. Technically, given an imputation…

Methodology · Statistics 2024-12-09 Yiran Jiang , Chuanhai Liu

A reduced-rank mixed effects model is developed for robust modeling of sparsely observed paired functional data. In this model, the curves for each functional variable are summarized using a few functional principal components, and the…

Methodology · Statistics 2023-08-08 Huiya Zhou , Xiaomeng Yan , Lan Zhou

Longitudinal studies are frequently used in medical research and involve collecting repeated measures on individuals over time. Observations from the same individual are invariably correlated and thus an analytic approach that accounts for…

We present a framework for generating multiple imputations for continuous data when the missing data mechanism is unknown. Imputations are generated from more than one imputation model in order to incorporate uncertainty regarding the…

Applications · Statistics 2013-01-14 Juned Siddique , Ofer Harel , Catherine M. Crespi

Missing values in electronic health record (EHR) data pose a significant challenge for epidemiologic research. Traditional methods for handling missing data, like mean imputation, may introduce bias. Multiple imputation (MI) offers a…

Multiple imputation is a straightforward method for handling missing data in a principled fashion. This paper presents an overview of multiple imputation, including important theoretical results and their practical implications for…

Methodology · Statistics 2018-01-15 Jared S. Murray

The authors derive likelihood-based exact inference methods for the multivariate regression model, for singly imputed synthetic data generated via Posterior Predictive Sampling (PPS) and for multiply imputed synthetic data generated via a…

Statistics Theory · Mathematics 2017-07-26 Ricardo Moura , Martin Klein , Carlos A. Coelho , Bimal Sinha

This article develops a novel data assimilation methodology, addressing challenges that are common in real-world settings, such as severe sparsity of observations, lack of reliable models, and non-stationarity of the system dynamics. These…

Optimization and Control · Mathematics 2024-11-05 David J. Abers , George Hripcsak , Lena Mamykina , Melike Sirlanci , Esteban G. Tabak
‹ Prev 1 2 3 10 Next ›