Related papers: A Supervised Feature Selection Method For Mixed-Ty…

Clustering High-dimensional Data via Feature Selection

High-dimensional clustering analysis is a challenging problem in statistics and machine learning, with broad applications such as the analysis of microarray data and RNA-seq data. In this paper, we propose a new clustering procedure called…

Methodology · Statistics 2022-10-31 Tianqi Liu , Yu Lu , Biqing Zhu , Hongyu Zhao

Unsupervised Feature Selection based on Adaptive Similarity Learning and Subspace Clustering

Feature selection methods have an important role on the readability of data and the reduction of complexity of learning algorithms. In recent years, a variety of efforts are investigated on feature selection problems based on unsupervised…

Machine Learning · Computer Science 2019-12-12 Mohsen Ghassemi Parsa , Hadi Zare , Mehdi Ghatee

Feature selection or extraction decision process for clustering using PCA and FRSD

This paper concerns the critical decision process of extracting or selecting the features before applying a clustering algorithm. It is not obvious to evaluate the importance of the features since the most popular methods to do it are…

Machine Learning · Computer Science 2021-11-23 Jean-Sebastien Dessureault , Daniel Massicotte

A Contrast Based Feature Selection Algorithm for High-dimensional Data set in Machine Learning

Feature selection is an important process in machine learning and knowledge discovery. By selecting the most informative features and eliminating irrelevant ones, the performance of learning algorithms can be improved and the extraction of…

Machine Learning · Computer Science 2024-01-17 Chunxu Cao , Qiang Zhang

Compactness Score: A Fast Filter Method for Unsupervised Feature Selection

Along with the flourish of the information age, massive amounts of data are generated day by day. Due to the large-scale and high-dimensional characteristics of these data, it is often difficult to achieve better decision-making in…

Machine Learning · Computer Science 2023-04-04 Peican Zhu , Xin Hou , Keke Tang , Zhen Wang , Feiping Nie

Deep Feature Screening: Feature Selection for Ultra High-Dimensional Data via Deep Neural Networks

The applications of traditional statistical feature selection methods to high-dimension, low sample-size data often struggle and encounter challenging problems, such as overfitting, curse of dimensionality, computational infeasibility, and…

Machine Learning · Statistics 2023-12-19 Kexuan Li , Fangfang Wang , Lingli Yang , Ruiqi Liu

Combining clustering of variables and feature selection using random forests

Standard approaches to tackle high-dimensional supervised classification problem often include variable selection and dimension reduction procedures. The novel methodology proposed in this paper combines clustering of variables and feature…

Statistics Theory · Mathematics 2018-11-07 Marie Chavent , Robin Genuer , Jerome Saracco

Variable Selection for Clustering and Classification

As data sets continue to grow in size and complexity, effective and efficient techniques are needed to target important features in the variable space. Many of the variable selection techniques that are commonly used alongside clustering…

Computation · Statistics 2013-03-22 Jeffrey L. Andrews , Paul D. McNicholas

Randomized Dimensionality Reduction for k-means Clustering

We study the topic of dimensionality reduction for $k$-means clustering. Dimensionality reduction encompasses the union of two approaches: \emph{feature selection} and \emph{feature extraction}. A feature selection based algorithm for…

Data Structures and Algorithms · Computer Science 2015-03-19 Christos Boutsidis , Anastasios Zouzias , Michael W. Mahoney , Petros Drineas

Multi-view Unsupervised Feature Selection by Cross-diffused Matrix Alignment

Multi-view high-dimensional data become increasingly popular in the big data era. Feature selection is a useful technique for alleviating the curse of dimensionality in multi-view learning. In this paper, we study unsupervised feature…

Machine Learning · Computer Science 2017-05-03 Xiaokai Wei , Bokai Cao , Philip S. Yu

Deep Feature Selection Using a Novel Complementary Feature Mask

Feature selection has drawn much attention over the last decades in machine learning because it can reduce data dimensionality while maintaining the original physical meaning of features, which enables better interpretability than feature…

Machine Learning · Computer Science 2022-09-27 Yiwen Liao , Jochen Rivoir , Raphaël Latty , Bin Yang

Beyond Discrete Selection: Continuous Embedding Space Optimization for Generative Feature Selection

The goal of Feature Selection - comprising filter, wrapper, and embedded approaches - is to find the optimal feature subset for designated downstream tasks. Nevertheless, current feature selection methods are limited by: 1) the selection…

Machine Learning · Computer Science 2023-09-18 Meng Xiao , Dongjie Wang , Min Wu , Pengfei Wang , Yuanchun Zhou , Yanjie Fu

Unsupervised feature selection algorithm framework based on neighborhood interval disturbance fusion

Feature selection technology is a key technology of data dimensionality reduction. Becauseof the lack of label information of collected data samples, unsupervised feature selection has attracted more attention. The universality and…

Machine Learning · Computer Science 2024-10-22 Xiaolin Lv , Liang Du , Peng Zhou , Peng Wu

Feature Selection: A perspective on inter-attribute cooperation

High-dimensional datasets depict a challenge for learning tasks in data mining and machine learning. Feature selection is an effective technique in dealing with dimensionality reduction. It is often an essential data processing step prior…

Machine Learning · Computer Science 2023-09-18 Gustavo Sosa-Cabrera , Santiago Gómez-Guerrero , Miguel García-Torres , Christian E. Schaerer

Hypernetworks for Dynamic Feature Selection

Dynamic feature selection (DFS) is a machine learning framework in which features are acquired sequentially for individual samples under budget constraints. The exponential growth in the number of possible feature acquisition paths forces a…

Machine Learning · Computer Science 2026-05-13 Javier Fumanal-Idocin , Raquel Fernandez-Peralta , Javier Andreu-Perez

Review of Swarm Intelligence-based Feature Selection Methods

In the past decades, the rapid growth of computer and database technologies has led to the rapid growth of large-scale datasets. On the other hand, data mining applications with high dimensional datasets that require high speed and accuracy…

Machine Learning · Computer Science 2020-08-11 Mehrdad Rostami , Kamal Berahmand , Saman Forouzandeh

A new computationally efficient algorithm to solve Feature Selection for Functional Data Classification in high-dimensional spaces

This paper introduces a novel methodology for Feature Selection for Functional Classification, FSFC, that addresses the challenge of jointly performing feature selection and classification of functional data in scenarios with categorical…

Machine Learning · Statistics 2024-10-18 Tobia Boschi , Francesca Bonin , Rodrigo Ordonez-Hurtado , Alessandra Pascale , Jonathan Epperlein

Unsupervised Feature Selection via Robust Autoencoder and Adaptive Graph Learning

Effective feature selection is essential for high-dimensional data analysis and machine learning. Unsupervised feature selection (UFS) aims to simultaneously cluster data and identify the most discriminative features. Most existing UFS…

Machine Learning · Statistics 2026-03-23 Feng Yu , MD Saifur Rahman Mazumder , Ying Su , Oscar Contreras Velasco

Feature Selection Using Classifier in High Dimensional Data

Feature selection is frequently used as a pre-processing step to machine learning. It is a process of choosing a subset of original features so that the feature space is optimally reduced according to a certain evaluation criterion. The…

Computer Vision and Pattern Recognition · Computer Science 2014-01-07 Vijendra Singh , Shivani Pathak

SACA: Selective Attention-Based Clustering Algorithm

Clustering algorithms are fundamental tools across many fields, with density-based methods offering particular advantages in identifying arbitrarily shaped clusters and handling noise. However, their effectiveness is often limited by the…

Machine Learning · Computer Science 2025-12-01 Meysam Shirdel Bilehsavar , Razieh Ghaedi , Samira Seyed Taheri , Xinqi Fan , Christian O'Reilly