Causally-Aware Unsupervised Feature Selection Learning

Zongxin Shen; Yanyong Huang; Dongjie Wang; Minbo Ma; Fengmao Lv; Tianrui Li

Causally-Aware Unsupervised Feature Selection Learning

Machine Learning 2025-01-28 v2 Methodology

Authors: Zongxin Shen , Yanyong Huang , Dongjie Wang , Minbo Ma , Fengmao Lv , Tianrui Li

Abstract

Unsupervised feature selection (UFS) has recently gained attention for its effectiveness in processing unlabeled high-dimensional data. However, existing methods overlook the intrinsic causal mechanisms within the data, resulting in the selection of irrelevant features and poor interpretability. Additionally, previous graph-based methods fail to account for the differing impacts of non-causal and causal features in constructing the similarity graph, which leads to false links in the generated graph. To address these issues, a novel UFS method, called Causally-Aware UnSupErvised Feature Selection learning (CAUSE-FS), is proposed. CAUSE-FS introduces a novel causal regularizer that reweights samples to balance the confounding distribution of each treatment feature. This regularizer is subsequently integrated into a generalized unsupervised spectral regression model to mitigate spurious associations between features and clustering labels, thus achieving causal feature selection. Furthermore, CAUSE-FS employs causality-guided hierarchical clustering to partition features with varying causal contributions into multiple granularities. By integrating similarity graphs learned adaptively at different granularities, CAUSE-FS increases the importance of causal features when constructing the fused similarity graph to capture the reliable local structure of data. Extensive experimental results demonstrate the superiority of CAUSE-FS over state-of-the-art methods, with its interpretability further validated through feature visualization.

Keywords

feature selection causal inference nonnegative matrix factorization

Cite

@article{arxiv.2410.12224,
  title  = {Causally-Aware Unsupervised Feature Selection Learning},
  author = {Zongxin Shen and Yanyong Huang and Dongjie Wang and Minbo Ma and Fengmao Lv and Tianrui Li},
  journal= {arXiv preprint arXiv:2410.12224},
  year   = {2025}
}

Causally-Aware Unsupervised Feature Selection Learning

Abstract

Keywords

Cite

Related papers