Related papers: Quantum-Inspired Optimization Process for Data Imp…

Improved clinical data imputation via classical and quantum determinantal point processes

Imputing data is a critical issue for machine learning practitioners, including in the life sciences domain, where missing clinical data is a typical situation and the reliability of the imputation is of great importance. Currently, there…

Quantum Physics · Physics 2023-12-13 Skander Kazdaghli , Iordanis Kerenidis , Jens Kieckbusch , Philip Teare

Data Imputation by Pursuing Better Classification: A Supervised Kernel-Based Method

Data imputation, the process of filling in missing feature elements for incomplete data sets, plays a crucial role in data-driven learning. A fundamental belief is that data imputation is helpful for learning performance, and it follows…

Machine Learning · Computer Science 2025-09-30 Ruikai Yang , Fan He , Mingzhen He , Kaijie Wang , Xiaolin Huang

Personalized Imputation in metric spaces via conformal prediction: Applications in Predicting Diabetes Development with Continuous Glucose Monitoring Information

The challenge of handling missing data is widespread in modern data analysis, particularly during the preprocessing phase and in various inferential modeling tasks. Although numerous algorithms exist for imputing missing data, the…

Methodology · Statistics 2024-03-28 Marcos Matabuena , Carla Díaz-Louzao , Rahul Ghosal , Francisco Gude

Principal Component Analysis based frameworks for efficient missing data imputation algorithms

Missing data is a commonly occurring problem in practice. Many imputation methods have been developed to fill in the missing entries. However, not all of them can scale to high-dimensional data, especially the multiple imputation…

Machine Learning · Computer Science 2023-03-21 Thu Nguyen , Hoang Thien Ly , Michael Alexander Riegler , Pål Halvorsen , Hugo L. Hammer

Quantum-Accelerated Neural Imputation with Large Language Models (LLMs)

Missing data presents a critical challenge in real-world datasets, significantly degrading the performance of machine learning models. While Large Language Models (LLMs) have recently demonstrated remarkable capabilities in tabular data…

Machine Learning · Computer Science 2025-07-14 Hossein Jamali

Imputation techniques on missing values in breast cancer treatment and fertility data

Clinical decision support using data mining techniques offers more intelligent way to reduce the decision error in the last few years. However, clinical datasets often suffer from high missingness, which adversely impacts the quality of…

Machine Learning · Computer Science 2020-11-20 Xuetong Wu , Hadi Akbarzadeh Khorshidi , Uwe Aickelin , Zobaida Edib , Michelle Peate

Efficient Data Reduction Via PCA-Guided Quantile Based Sampling

In large-scale statistical modeling, reducing data size through subsampling is essential for balancing computational efficiency and statistical accuracy. We propose a new method, Principal Component Analysis guided Quantile Sampling…

Computation · Statistics 2026-01-13 Foo Hui-Mean , Yuan-chin Ivan Chang

Imputation of Clinical Covariates in Time Series

Missing data is a common problem in real-world settings and particularly relevant in healthcare applications where researchers use Electronic Health Records (EHR) and results of observational studies to apply analytics methods. This issue…

Machine Learning · Statistics 2018-12-04 Dimitris Bertsimas , Agni Orfanoudaki , Colin Pawlowski

Imputations for High Missing Rate Data in Covariates via Semi-supervised Learning Approach

Advancements in data collection techniques and the heterogeneity of data resources can yield high percentages of missing observations on variables, such as block-wise missing data. Under missing-data scenarios, traditional methods such as…

Methodology · Statistics 2022-05-17 Wei Lan , Xuerong Chen , Tao Zou , Chih-Ling Tsai

Quantum Circuit for Imputation of Missing Data

The imputation of missing data is a common procedure in data analysis that consists in predicting missing values of incomplete data points. In this work we analyse a variational quantum circuit for the imputation of missing data. We…

Quantum Physics · Physics 2024-05-08 Claudio Sanavio , Simone Tibaldi , Edoardo Tignone , Elisa Ercolessi

ITI-IQA: a Toolbox for Heterogeneous Univariate and Multivariate Missing Data Imputation Quality Assessment

Missing values are a major challenge in most data science projects working on real data. To avoid losing valuable information, imputation methods are used to fill in missing values with estimates, allowing the preservation of samples or…

Machine Learning · Computer Science 2024-07-17 Pedro Pons-Suñer , Laura Arnal , J. Ramón Navarro-Cerdán , François Signol

Embedding-Driven Data Distillation for 360-Degree IQA With Residual-Aware Refinement

This article identifies and addresses a fundamental bottleneck in data-driven 360-degree image quality assessment (IQA): the lack of intelligent, sample-level data selection. Hence, we propose a novel framework that introduces a critical…

Computer Vision and Pattern Recognition · Computer Science 2025-12-22 Abderrezzaq Sendjasni , Seif-Eddine Benkabou , Mohamed-Chaker Larabi

Time-dependent Iterative Imputation for Multivariate Longitudinal Clinical Data

Missing data is a major challenge in clinical research. In electronic medical records, often a large fraction of the values in laboratory tests and vital signs are missing. The missingness can lead to biased estimates and limit our ability…

Machine Learning · Computer Science 2023-04-18 Omer Noy , Ron Shamir

Combining datasets to increase the number of samples and improve model fitting

For many use cases, combining information from different datasets can be of interest to improve a machine learning model's performance, especially when the number of samples from at least one of the datasets is small. However, a potential…

Machine Learning · Statistics 2023-05-17 Thu Nguyen , Rabindra Khadka , Nhan Phan , Anis Yazidi , Pål Halvorsen , Michael A. Riegler

Meta-Imputation Balanced (MIB): An Ensemble Approach for Handling Missing Data in Biomedical Machine Learning

Missing data represents a fundamental challenge in machine learning applications, often reducing model performance and reliability. This problem is particularly acute in fields like bioinformatics and clinical machine learning, where…

Machine Learning · Computer Science 2025-09-04 Fatemeh Azad , Zoran Bosnić , Matjaž Kukar

Adiabatic quantum computing with parameterized quantum circuits

Adiabatic quantum computing is a universal model for quantum computing whose implementation using a gate-based quantum computer requires depths that are unreachable in the early fault-tolerant era. To mitigate the limitations of near-term…

Quantum Physics · Physics 2024-10-18 Ioannis Kolotouros , Ioannis Petrongonas , Miloš Prokop , Petros Wallden

Model-based framework for automated quantification of error sources in quantum state tomography

High-quality quantum state generation is essential for advanced quantum information processing, including quantum communication, quantum sensing, and quantum computing. In practice, various error sources degrade the quality of quantum…

Quantum Physics · Physics 2025-08-08 Junpei Oba , Hsin-Pin Lo , Yasuhiro Yamada , Takayuki Matsui , Takuya Ikuta , Yuya Yonezu , Toshimori Honjo , Seiji Kajita , Hiroki Takesue

Explainable Data Imputation using Constraints

Data values in a dataset can be missing or anomalous due to mishandling or human error. Analysing data with missing values can create bias and affect the inferences. Several analysis methods, such as principle components analysis or…

Artificial Intelligence · Computer Science 2022-05-11 Sandeep Hans , Diptikalyan Saha , Aniya Aggarwal

PCA-Guided Quantile Sampling: Preserving Data Structure in Large-Scale Subsampling

We introduce Principal Component Analysis guided Quantile Sampling (PCA QS), a novel sampling framework designed to preserve both the statistical and geometric structure of large scale datasets. Unlike conventional PCA, which reduces…

Methodology · Statistics 2026-01-13 Foo Hui-Mean , Yuan-chin Ivan Chang

Quantum-enhanced optimization for patient stratification in clinical trials

Clinical trials are notorious for their high failure rates and steep costs, leading to wasted time and resources spend, prolonged development timelines, and delayed patient access to new therapies. A key contributor to these failures is…

Quantum Physics · Physics 2026-01-19 Laia Domingo , Christine Johnson