Related papers: Fairness in Missing Data Imputation

Fairness and Missing Values

The causes underlying unfair decision making are complex, being internalised in different ways by decision makers, other actors dealing with data and models, and ultimately by the individuals being affected by these decisions. One frequent…

Machine Learning · Computer Science 2019-05-31 Fernando Martínez-Plumed , Cèsar Ferri , David Nieves , José Hernández-Orallo

The influence of missing data mechanisms and simple missing data handling techniques on fairness

Machine learning algorithms permeate the day-to-day aspects of our lives and therefore studying the fairness of these algorithms before implementation is crucial. One way in which bias can manifest in a dataset is through missing values.…

Machine Learning · Statistics 2026-02-23 Aeysha Bhatti , Trudie Sandrock , Johane Nienkemper-Swanepoel

Impact Of Missing Data Imputation On The Fairness And Accuracy Of Graph Node Classifiers

Analysis of the fairness of machine learning (ML) algorithms recently attracted many researchers' interest. Most ML methods show bias toward protected groups, which limits the applicability of ML models in many applications like crime rate…

Machine Learning · Computer Science 2022-11-03 Haris Mansoor , Sarwan Ali , Shafiq Alam , Muhammad Asad Khan , Umair ul Hassan , Imdadullah Khan

Assessing Fairness in the Presence of Missing Data

Missing data are prevalent and present daunting challenges in real data analysis. While there is a growing body of literature on fairness in analysis of fully observed data, there has been little theoretical work on investigating fairness…

Machine Learning · Computer Science 2021-12-10 Yiliang Zhang , Qi Long

Adapting Fairness Interventions to Missing Values

Missing values in real-world data pose a significant and unique challenge to algorithmic fairness. Different demographic groups may be unequally affected by missing data, and the standard procedure for handling missing values where first…

Machine Learning · Computer Science 2023-11-13 Raymond Feng , Flavio P. Calmon , Hao Wang

Explainable Data Imputation using Constraints

Data values in a dataset can be missing or anomalous due to mishandling or human error. Analysing data with missing values can create bias and affect the inferences. Several analysis methods, such as principle components analysis or…

Artificial Intelligence · Computer Science 2022-05-11 Sandeep Hans , Diptikalyan Saha , Aniya Aggarwal

Through the Data Management Lens: Experimental Analysis and Evaluation of Fair Classification

Classification, a heavily-studied data-driven machine learning task, drives an increasing number of prediction systems involving critical human decisions such as loan approval and criminal risk assessment. However, classifiers often…

Machine Learning · Computer Science 2022-04-12 Maliha Tashfia Islam , Anna Fariha , Alexandra Meliou , Babak Salimi

Fairness without Imputation: A Decision Tree Approach for Fair Prediction with Missing Values

We investigate the fairness concerns of training a machine learning model using data with missing values. Even though there are a number of fairness intervention methods in the literature, most of them require a complete training set as…

Machine Learning · Computer Science 2022-04-15 Haewon Jeong , Hao Wang , Flavio P. Calmon

Imputation Strategies Under Clinical Presence: Impact on Algorithmic Fairness

Machine learning risks reinforcing biases present in data and, as we argue in this work, in what is absent from data. In healthcare, societal and decision biases shape patterns in missing data, yet the algorithmic fairness implications of…

Artificial Intelligence · Computer Science 2025-03-19 Vincent Jeanselme , Maria De-Arteaga , Zhe Zhang , Jessica Barrett , Brian Tom

An Interdisciplinary and Cross-Task Review on Missing Data Imputation

Missing data is a fundamental challenge in data science, significantly hindering analysis and decision-making across a wide range of disciplines, including healthcare, bioinformatics, social science, e-commerce, and industrial monitoring.…

Machine Learning · Statistics 2026-05-12 Jicong Fan

Performance comparison of State-of-the-art Missing Value Imputation Algorithms on Some Bench mark Datasets

Decision making from data involves identifying a set of attributes that contribute to effective decision making through computational intelligence. The presence of missing values greatly influences the selection of right set of attributes…

Machine Learning · Computer Science 2013-07-23 M. Naresh Kumar

To Impute or not to Impute? Missing Data in Treatment Effect Estimation

Missing data is a systemic problem in practical scenarios that causes noise and bias when estimating treatment effects. This makes treatment effect estimation from data with missingness a particularly tricky endeavour. A key reason for this…

Machine Learning · Statistics 2023-02-27 Jeroen Berrevoets , Fergus Imrie , Trent Kyono , James Jordon , Mihaela van der Schaar

A Latent Variable Framework for Multiple Imputation with Non-ignorable Missingness: Analyzing Perceptions of Social Justice in Europe

This paper proposes a general multiple imputation approach for analyzing large-scale data with missing values. An imputation model is derived from a joint distribution induced by a latent variable model, which can flexibly capture…

Methodology · Statistics 2025-09-26 Siliang Zhang , Yunxiao Chen , Jouni Kuha

Classification of datasets with imputed missing values: does imputation quality matter?

Classifying samples in incomplete datasets is a common aim for machine learning practitioners, but is non-trivial. Missing data is found in most real-world datasets and these missing values are typically imputed using established methods,…

Machine Learning · Computer Science 2023-12-20 Tolou Shadbahr , Michael Roberts , Jan Stanczuk , Julian Gilbey , Philip Teare , Sören Dittmer , Matthew Thorpe , Ramon Vinas Torne , Evis Sala , Pietro Lio , Mishal Patel , AIX-COVNET Collaboration , James H. F. Rudd , Tuomas Mirtti , Antti Rannikko , John A. D. Aston , Jing Tang , Carola-Bibiane Schönlieb

Iterative missing value imputation based on feature importance

Many datasets suffer from missing values due to various reasons,which not only increases the processing difficulty of related tasks but also reduces the accuracy of classification. To address this problem, the mainstream approach is to use…

Machine Learning · Computer Science 2024-08-14 Cong Guo , Chun Liu , Wei Yang

Imputation of Missing Data Using Linear Gaussian Cluster-Weighted Modeling

Missing data theory deals with the statistical methods in the occurrence of missing data. Missing data occurs when some values are not stored or observed for variables of interest. However, most of the statistical theory assumes that data…

Methodology · Statistics 2021-10-26 Luis Alejandro Masmela-Caita , Thais Paiva Galletti , Marcos Oliveira Prates

Auditing the Imputation Effect on Fairness of Predictive Analytics in Higher Education

Colleges and universities use predictive analytics in a variety of ways to increase student success rates. Despite the potential for predictive analytics, two major barriers exist to their adoption in higher education: (a) the lack of…

Computers and Society · Computer Science 2023-01-02 Hadis Anahideh , Parian Haghighat , Nazanin Nezami , Denisa G`andara

Reliable fairness auditing with semi-supervised inference

Machine learning (ML) models often exhibit bias that can exacerbate inequities in biomedical applications. Fairness auditing, the process of evaluating a model's performance across subpopulations, is critical for identifying and mitigating…

Methodology · Statistics 2026-05-19 Jianhui Gao , Jessica Gronsbell

On the Performance of Imputation Techniques for Missing Values on Healthcare Datasets

Missing values or data is one popular characteristic of real-world datasets, especially healthcare data. This could be frustrating when using machine learning algorithms on such datasets, simply because most machine learning models perform…

Machine Learning · Computer Science 2024-03-25 Luke Oluwaseye Joel , Wesley Doorsamy , Babu Sena Paul

Deep Imputation of Missing Values in Time Series Health Data: A Review with Benchmarking

The imputation of missing values in multivariate time series (MTS) data is critical in ensuring data quality and producing reliable data-driven predictive models. Apart from many statistical approaches, a few recent studies have proposed…

Machine Learning · Computer Science 2023-05-17 Maksims Kazijevs , Manar D. Samad