Related papers: Feature Selection On Boolean Symbolic Objects

Feature and Variable Selection in Classification

The amount of information in the form of features and variables avail- able to machine learning algorithms is ever increasing. This can lead to classifiers that are prone to overfitting in high dimensions, high di- mensional models do not…

Machine Learning · Computer Science 2014-02-12 Aaron Karper

Higher Order Mutual Information Approximation for Feature Selection

Feature selection is a process of choosing a subset of relevant features so that the quality of prediction models can be improved. An extensive body of work exists on information-theoretic feature selection, based on maximizing Mutual…

Machine Learning · Computer Science 2016-12-05 Jilin Wu , Soumyajit Gupta , Chandrajit Bajaj

Dynamic Feature Selection from Variable Feature Sets Using Features of Features

Machine learning models usually assume that a set of feature values used to obtain an output is fixed in advance. However, in many real-world problems, a cost is associated with measuring these features. To address the issue of reducing…

Machine Learning · Computer Science 2025-03-13 Katsumi Takahashi , Koh Takeuchi , Hisashi Kashima

Scalable and Accurate Online Feature Selection for Big Data

Feature selection is important in many big data applications. Two critical challenges closely associate with big data. Firstly, in many big data applications, the dimensionality is extremely high, in millions, and keeps growing. Secondly,…

Machine Learning · Computer Science 2016-07-29 Kui Yu , Xindong Wu , Wei Ding , Jian Pei

Feature Selection Using Classifier in High Dimensional Data

Feature selection is frequently used as a pre-processing step to machine learning. It is a process of choosing a subset of original features so that the feature space is optimally reduced according to a certain evaluation criterion. The…

Computer Vision and Pattern Recognition · Computer Science 2014-01-07 Vijendra Singh , Shivani Pathak

Relevant based structure learning for feature selection

Feature selection is an important task in many problems occurring in pattern recognition, bioinformatics, machine learning and data mining applications. The feature selection approach enables us to reduce the computation burden and the…

Machine Learning · Computer Science 2016-08-30 Hadi Zare , Mojtaba Niazi

Multi-objective Binary Coordinate Search for Feature Selection

A supervised feature selection method selects an appropriate but concise set of features to differentiate classes, which is highly expensive for large-scale datasets. Therefore, feature selection should aim at both minimizing the number of…

Machine Learning · Computer Science 2024-02-21 Sevil Zanjani Miyandoab , Shahryar Rahnamayan , Azam Asilian Bidgoli

Deep Feature Selection Using a Novel Complementary Feature Mask

Feature selection has drawn much attention over the last decades in machine learning because it can reduce data dimensionality while maintaining the original physical meaning of features, which enables better interpretability than feature…

Machine Learning · Computer Science 2022-09-27 Yiwen Liao , Jochen Rivoir , Raphaël Latty , Bin Yang

Feature Selection with Distance Correlation

Choosing which properties of the data to use as input to multivariate decision algorithms -- a.k.a. feature selection -- is an important step in solving any problem with machine learning. While there is a clear trend towards training…

High Energy Physics - Phenomenology · Physics 2022-12-02 Ranit Das , Gregor Kasieczka , David Shih

A data-science pipeline to enable the Interpretability of Many-Objective Feature Selection

Many-Objective Feature Selection (MOFS) approaches use four or more objectives to determine the relevance of a subset of features in a supervised learning task. As a consequence, MOFS typically returns a large set of non-dominated…

Machine Learning · Computer Science 2023-12-01 Uchechukwu F. Njoku , Alberto Abelló , Besim Bilalli , Gianluca Bontempi

Survey on Feature Selection

Feature selection plays an important role in the data mining process. It is needed to deal with the excessive number of features, which can become a computational burden on the learning algorithms. It is also necessary, even when…

Machine Learning · Computer Science 2015-10-13 Tarek Amr Abdallah , Beatriz de La Iglesia

Feature Selection: A Data Perspective

Feature selection, as a data preprocessing strategy, has been proven to be effective and efficient in preparing data (especially high-dimensional data) for various data mining and machine learning problems. The objectives of feature…

Machine Learning · Computer Science 2018-08-28 Jundong Li , Kewei Cheng , Suhang Wang , Fred Morstatter , Robert P. Trevino , Jiliang Tang , Huan Liu

Challenges of Feature Selection for Big Data Analytics

We are surrounded by huge amounts of large-scale high dimensional data. It is desirable to reduce the dimensionality of data for many learning tasks due to the curse of dimensionality. Feature selection has shown its effectiveness in many…

Machine Learning · Computer Science 2016-11-08 Jundong Li , Huan Liu

On the (In)Significance of Feature Selection in High-Dimensional Datasets

Feature selection (FS) is assumed to improve predictive performance and identify meaningful features in high-dimensional datasets. Surprisingly, small random subsets of features (0.02-1%) match or outperform the predictive performance of…

Machine Learning · Computer Science 2025-09-22 Bhavesh Neekhra , Debayan Gupta , Partha Pratim Chakrabarti

Large-scale Multi-objective Feature Selection: A Multi-phase Search Space Shrinking Approach

Feature selection is a crucial step in machine learning, especially for high-dimensional datasets, where irrelevant and redundant features can degrade model performance and increase computational costs. This paper proposes a novel…

Neural and Evolutionary Computing · Computer Science 2024-10-30 Azam Asilian Bidgoli , Shahryar Rahnamayan

A Matrix Logic Approach to Efficient Frequent Itemset Discovery in Large Data Sets

This paper proposes a frequent itemset mining algorithm based on the Boolean matrix method, aiming to solve the storage and computational bottlenecks of traditional frequent pattern mining algorithms in high-dimensional and large-scale…

Databases · Computer Science 2024-12-30 Xuan Li , Tingyi Ruan , Yankaiqi Li , Quanchao Lu , Xiaoxuan Sun

Clustering of Modal Valued Symbolic Data

Symbolic Data Analysis is based on special descriptions of data - symbolic objects (SO). Such descriptions preserve more detailed information about units and their clusters than the usual representations with mean values. A special kind of…

Machine Learning · Statistics 2020-10-27 Vladimir Batagelj , Nataša Kejžar , Simona Korenjak-Černe

Permutation-based multi-objective evolutionary feature selection for high-dimensional data

Feature selection is a critical step in the analysis of high-dimensional data, where the number of features often vastly exceeds the number of samples. Effective feature selection not only improves model performance and interpretability but…

Machine Learning · Computer Science 2025-01-27 Raquel Espinosa , Gracia Sánchez , José Palma , Fernando Jiménez

Review of Swarm Intelligence-based Feature Selection Methods

In the past decades, the rapid growth of computer and database technologies has led to the rapid growth of large-scale datasets. On the other hand, data mining applications with high dimensional datasets that require high speed and accuracy…

Machine Learning · Computer Science 2020-08-11 Mehrdad Rostami , Kamal Berahmand , Saman Forouzandeh

MOANOFS: Multi-Objective Automated Negotiation based Online Feature Selection System for Big Data Classification

Feature Selection (FS) plays an important role in learning and classification tasks. The object of FS is to select the relevant and non-redundant features. Considering the huge amount number of features in real-world applications, FS…

Machine Learning · Computer Science 2019-06-19 Fatma BenSaid , Adel M. Alimi