Related papers: Handling Missing Data with Graph Representation Le…

GEDI: A Graph-based End-to-end Data Imputation Framework

Data imputation is an effective way to handle missing data, which is common in practical applications. In this study, we propose and test a novel data imputation process that achieve two important goals: (1) preserve the row-wise…

Machine Learning · Computer Science 2023-09-13 Katrina Chen , Xiuqin Liang , Zheng Ma , Zhibin Zhang

Data Imputation by Pursuing Better Classification: A Supervised Kernel-Based Method

Data imputation, the process of filling in missing feature elements for incomplete data sets, plays a crucial role in data-driven learning. A fundamental belief is that data imputation is helpful for learning performance, and it follows…

Machine Learning · Computer Science 2025-09-30 Ruikai Yang , Fan He , Mingzhen He , Kaijie Wang , Xiaolin Huang

Enhancing Missing Data Imputation through Combined Bipartite Graph and Complete Directed Graph

In this paper, we aim to address a significant challenge in the field of missing data imputation: identifying and leveraging the interdependencies among features to enhance missing data imputation for tabular data. We introduce a novel…

Machine Learning · Computer Science 2024-11-08 Zhaoyang Zhang , Hongtu Zhu , Ziqi Chen , Yingjie Zhang , Hai Shu

Learning Graph-based Patch Representations for Identifying and Assessing Silent Vulnerability Fixes

Software projects are dependent on many third-party libraries, therefore high-risk vulnerabilities can propagate through the dependency chain to downstream projects. Owing to the subjective nature of patch management, software vendors…

Software Engineering · Computer Science 2024-09-16 Mei Han , Lulu Wang , Jianming Chang , Bixin Li , Chunguang Zhang

Data Imputation with Iterative Graph Reconstruction

Effective data imputation demands rich latent ``structure" discovery capabilities from ``plain" tabular data. Recent advances in graph neural networks-based data imputation solutions show their strong structure learning potential by…

Machine Learning · Computer Science 2024-04-16 Jiajun Zhong , Weiwei Ye , Ning Gui

Label-Guided Imputation via Forest-Based Proximities for Improved Time Series Classification

Missing data is a common problem in time series data. Most methods for imputation ignore label information pertaining to the time series even if that information exists. In this paper, we provide a framework for missing data imputation in…

Machine Learning · Statistics 2025-09-30 Jake S. Rhodes , Adam G. Rustad , Sofia Pelagalli Maia , Evan Thacker , Hyunmi Choi , Jose Gutierrez , Tatjana Rundek , Ben Shaw

Generative Imputation and Stochastic Prediction

In many machine learning applications, we are faced with incomplete datasets. In the literature, missing data imputation techniques have been mostly concerned with filling missing values. However, the existence of missing values is…

Machine Learning · Computer Science 2020-09-07 Mohammad Kachuee , Kimmo Karkkainen , Orpaz Goldstein , Sajad Darabi , Majid Sarrafzadeh

Label Efficient Semi-Supervised Learning via Graph Filtering

Graph-based methods have been demonstrated as one of the most effective approaches for semi-supervised learning, as they can exploit the connectivity patterns between labeled and unlabeled data samples to improve learning performance.…

Machine Learning · Computer Science 2019-07-01 Qimai Li , Xiao-Ming Wu , Han Liu , Xiaotong Zhang , Zhichao Guan

An End-to-End Model for Time Series Classification In the Presence of Missing Values

Time series classification with missing data is a prevalent issue in time series analysis, as temporal data often contain missing values in practical applications. The traditional two-stage approach, which handles imputation and…

Machine Learning · Computer Science 2024-08-13 Pengshuai Yao , Mengna Liu , Xu Cheng , Fan Shi , Huan Li , Xiufeng Liu , Shengyong Chen

GRAPE for Fast and Scalable Graph Processing and random walk-based Embedding

Graph Representation Learning (GRL) methods opened new avenues for addressing complex, real-world problems represented by graphs. However, many graphs used in these applications comprise millions of nodes and billions of edges and are…

Machine Learning · Computer Science 2023-05-09 Luca Cappelletti , Tommaso Fontana , Elena Casiraghi , Vida Ravanmehr , Tiffany J. Callahan , Carlos Cano , Marcin P. Joachimiak , Christopher J. Mungall , Peter N. Robinson , Justin Reese , Giorgio Valentini

Graph Convolutional Networks for Graphs Containing Missing Features

Graph Convolutional Network (GCN) has experienced great success in graph analysis tasks. It works by smoothing the node features across the graph. The current GCN models overwhelmingly assume that the node feature information is complete.…

Machine Learning · Computer Science 2020-12-08 Hibiki Taguchi , Xin Liu , Tsuyoshi Murata

A simple yet effective baseline for non-attributed graph classification

Graphs are complex objects that do not lend themselves easily to typical learning tasks. Recently, a range of approaches based on graph kernels or graph neural networks have been developed for graph classification and for representation…

Machine Learning · Computer Science 2022-05-19 Chen Cai , Yusu Wang

Confidence-Based Feature Imputation for Graphs with Partially Known Features

This paper investigates a missing feature imputation problem for graph learning tasks. Several methods have previously addressed learning tasks on graphs with missing features. However, in cases of high rates of missing features, they were…

Machine Learning · Computer Science 2023-05-30 Daeho Um , Jiwoong Park , Seulki Park , Jin Young Choi

A Flexible Generative Framework for Graph-based Semi-supervised Learning

We consider a family of problems that are concerned about making predictions for the majority of unlabeled, graph-structured data samples based on a small proportion of labeled samples. Relational information among the data samples, often…

Machine Learning · Computer Science 2019-11-05 Jiaqi Ma , Weijing Tang , Ji Zhu , Qiaozhu Mei

Graph Belief Propagation Networks

With the wide-spread availability of complex relational data, semi-supervised node classification in graphs has become a central machine learning problem. Graph neural networks are a recent class of easy-to-train and accurate methods for…

Machine Learning · Computer Science 2021-06-08 Junteng Jia , Cenk Baykal , Vamsi K. Potluru , Austin R. Benson

Novel Representation Learning Technique using Graphs for Performance Analytics

The performance analytics domain in High Performance Computing (HPC) uses tabular data to solve regression problems, such as predicting the execution time. Existing Machine Learning (ML) techniques leverage the correlations among features…

Machine Learning · Computer Science 2024-01-22 Tarek Ramadan , Ankur Lahiry , Tanzima Z. Islam

Graph Representation Learning via Multi-task Knowledge Distillation

Machine learning on graph structured data has attracted much research interest due to its ubiquity in real world data. However, how to efficiently represent graph data in a general way is still an open problem. Traditional methods use…

Machine Learning · Computer Science 2019-11-14 Jiaqi Ma , Qiaozhu Mei

Semi-Supervised Graph Imbalanced Regression

Data imbalance is easily found in annotated data when the observations of certain continuous label values are difficult to collect for regression tasks. When they come to molecule and polymer property predictions, the annotated graph…

Machine Learning · Computer Science 2023-05-23 Gang Liu , Tong Zhao , Eric Inae , Tengfei Luo , Meng Jiang

Impact Of Missing Data Imputation On The Fairness And Accuracy Of Graph Node Classifiers

Analysis of the fairness of machine learning (ML) algorithms recently attracted many researchers' interest. Most ML methods show bias toward protected groups, which limits the applicability of ML models in many applications like crime rate…

Machine Learning · Computer Science 2022-11-03 Haris Mansoor , Sarwan Ali , Shafiq Alam , Muhammad Asad Khan , Umair ul Hassan , Imdadullah Khan

Filling the G_ap_s: Multivariate Time Series Imputation by Graph Neural Networks

Dealing with missing values and incomplete time series is a labor-intensive, tedious, inevitable task when handling data coming from real-world applications. Effective spatio-temporal representations would allow imputation methods to…

Machine Learning · Computer Science 2022-02-11 Andrea Cini , Ivan Marisca , Cesare Alippi