Related papers: Introducing Partial Matching Approach in Associati…

Using Association Rules for Better Treatment of Missing Values

The quality of training data for knowledge discovery in databases (KDD) and data mining depends upon many factors, but handling missing values is considered to be a crucial factor in overall data quality. Today real world datasets contains…

Databases · Computer Science 2009-04-22 Shariq Bashir , Saad Razzaq , Umer Maqbool , Sonya Tahir , Abdul Rauf Baig

Algorithm for Missing Values Imputation in Categorical Data with Use of Association Rules

This paper presents algorithm for missing values imputation in categorical data. The algorithm is based on using association rules and is presented in three variants. Experimental shows better accuracy of missing values imputation using the…

Machine Learning · Computer Science 2012-11-09 Jiří Kaiser

An Innovative Imputation and Classification Approach for Accurate Disease Prediction

Imputation of missing attribute values in medical datasets for extracting hidden knowledge from medical datasets is an interesting research topic of interest which is very challenging. One cannot eliminate missing values in medical records.…

Databases · Computer Science 2016-03-11 Yelipe UshaRani , P. Sammulal

A Novel Approach for Imputation of Missing Attribute Values for Efficient Mining of Medical Datasets - Class Based Cluster Approach

Missing attribute values are quite common in the datasets available in the literature. Missing values are also possible because all attributes values may not be recorded and hence unavailable due to several practical reasons. For all these…

Information Retrieval · Computer Science 2016-05-04 Yelipe UshaRani , P. Sammulal

Interpretable Almost Matching Exactly for Causal Inference

We aim to create the highest possible quality of treatment-control matches for categorical data in the potential outcomes framework. Matching methods are heavily used in the social sciences due to their interpretability, but most matching…

Machine Learning · Statistics 2019-06-11 Yameng Liu , Aw Dieng , Sudeepa Roy , Cynthia Rudin , Alexander Volfovsky

An approach to dealing with missing values in heterogeneous data using k-nearest neighbors

Techniques such as clusterization, neural networks and decision making usually rely on algorithms that are not well suited to deal with missing values. However, real world data frequently contains such cases. The simplest solution is to…

Machine Learning · Computer Science 2016-08-16 Davi E. N. Frossard , Igor O. Nunes , Renato A. Krohling

Combined Algorithm for Data Mining using Association rules

Association Rule mining is one of the most important fields in data mining and knowledge discovery. This paper proposes an algorithm that combines the simple association rules derived from basic Apriori Algorithm with the multiple minimum…

Databases · Computer Science 2014-10-07 Walaa Medhat , Ahmed Hassan Yousef , Hoda Korashy Mohamed

On the consistency of supervised learning with missing values

In many application settings, the data have missing entries which make analysis challenging. An abundant literature addresses missing values in an inferential framework: estimating parameters and their variance from incomplete tables. Here,…

Machine Learning · Statistics 2024-03-22 Julie Josse , Jacob M. Chen , Nicolas Prost , Erwan Scornet , Gaël Varoquaux

Une nouvelle approche de compl\'etion des valeurs manquantes dans les bases de donn\'ees

When tackling real-life datasets, it is common to face the existence of scrambled missing values within data. Considered as 'dirty data', usually it is removed during a pre-processing step. Starting from the fact that 'making up this…

Databases · Computer Science 2019-01-04 Leila Ben Othman

Semiparametric fractional imputation using Gaussian mixture models for handling multivariate missing data

Item nonresponse is frequently encountered in practice. Ignoring missing data can lose efficiency and lead to misleading inference. Fractional imputation is a frequentist approach of imputation for handling missing data. However, the…

Methodology · Statistics 2018-09-18 Hejian Sang , Jae Kwang Kim

Iterative missing value imputation based on feature importance

Many datasets suffer from missing values due to various reasons,which not only increases the processing difficulty of related tasks but also reduces the accuracy of classification. To address this problem, the mainstream approach is to use…

Machine Learning · Computer Science 2024-08-14 Cong Guo , Chun Liu , Wei Yang

A Structured Prediction Approach for Missing Value Imputation

Missing value imputation is an important practical problem. There is a large body of work on it, but there does not exist any work that formulates the problem in a structured output setting. Also, most applications have constraints on the…

Machine Learning · Computer Science 2013-11-12 Rahul Kidambi , Vinod Nair , Sundararajan Sellamanickam , S. Sathiya Keerthi

Performance comparison of State-of-the-art Missing Value Imputation Algorithms on Some Bench mark Datasets

Decision making from data involves identifying a set of attributes that contribute to effective decision making through computational intelligence. The presence of missing values greatly influences the selection of right set of attributes…

Machine Learning · Computer Science 2013-07-23 M. Naresh Kumar

Statistical Matching using Fractional Imputation

Statistical matching is a technique for integrating two or more data sets when information available for matching records for individual participants across data sets is incomplete. Statistical matching can be viewed as a missing data…

Methodology · Statistics 2015-10-14 Jae-kwang Kim , Emily Berg , Taesung Park

Machine learning with incomplete datasets using multi-objective optimization models

Machine learning techniques have been developed to learn from complete data. When missing values exist in a dataset, the incomplete data should be preprocessed separately by removing data points with missing values or imputation. In this…

Machine Learning · Computer Science 2020-12-25 Hadi A. Khorshidi , Michael Kirley , Uwe Aickelin

Adaptive imputation of missing values for incomplete pattern classification

In classification of incomplete pattern, the missing values can either play a crucial role in the class determination, or have only little influence (or eventually none) on the classification results according to the context. We propose a…

Artificial Intelligence · Computer Science 2016-02-09 Zhun-Ga Liu , Quan Pan , Jean Dezert , Arnaud Martin

Benchmarking missing-values approaches for predictive models on health databases

BACKGROUND: As databases grow larger, it becomes harder to fully control their collection, and they frequently come with missing values: incomplete observations. These large databases are well suited to train machine-learning models, for…

Machine Learning · Computer Science 2022-02-23 Alexandre Perez-Lebel , Gaël Varoquaux , Marine Le Morvan , Julie Josse , Jean-Baptiste Poline

Imputations for High Missing Rate Data in Covariates via Semi-supervised Learning Approach

Advancements in data collection techniques and the heterogeneity of data resources can yield high percentages of missing observations on variables, such as block-wise missing data. Under missing-data scenarios, traditional methods such as…

Methodology · Statistics 2022-05-17 Wei Lan , Xuerong Chen , Tao Zou , Chih-Ling Tsai

Classification of datasets with imputed missing values: does imputation quality matter?

Classifying samples in incomplete datasets is a common aim for machine learning practitioners, but is non-trivial. Missing data is found in most real-world datasets and these missing values are typically imputed using established methods,…

Machine Learning · Computer Science 2023-12-20 Tolou Shadbahr , Michael Roberts , Jan Stanczuk , Julian Gilbey , Philip Teare , Sören Dittmer , Matthew Thorpe , Ramon Vinas Torne , Evis Sala , Pietro Lio , Mishal Patel , AIX-COVNET Collaboration , James H. F. Rudd , Tuomas Mirtti , Antti Rannikko , John A. D. Aston , Jing Tang , Carola-Bibiane Schönlieb

Algorithmic Recourse with Missing Values

This paper proposes a new framework of algorithmic recourse (AR) that works even in the presence of missing values. AR aims to provide a recourse action for altering the undesired prediction result given by a classifier. Existing AR methods…

Machine Learning · Computer Science 2024-05-24 Kentaro Kanamori , Takuya Takagi , Ken Kobayashi , Yuichi Ike