Related papers: Feature selection for high-dimensional integrated …

Feature Selection and Dualities in Maximum Entropy Discrimination

Incorporating feature selection into a classification or regression method often carries a number of advantages. In this paper we formalize feature selection specifically from a discriminative perspective of improving…

Machine Learning · Computer Science 2013-01-18 Tony S. Jebara , Tommi S. Jaakkola

Discovering Support and Affiliated Features from Very High Dimensions

In this paper, a novel learning paradigm is presented to automatically identify groups of informative and correlated features from very high dimensions. Specifically, we explicitly incorporate correlation measures as constraints and then…

Machine Learning · Computer Science 2012-07-03 Yiteng Zhai , Mingkui Tan , Ivor Tsang , Yew Soon Ong

Permutation-based multi-objective evolutionary feature selection for high-dimensional data

Feature selection is a critical step in the analysis of high-dimensional data, where the number of features often vastly exceeds the number of samples. Effective feature selection not only improves model performance and interpretability but…

Machine Learning · Computer Science 2025-01-27 Raquel Espinosa , Gracia Sánchez , José Palma , Fernando Jiménez

Graph Autoencoder-Based Unsupervised Feature Selection with Broad and Local Data Structure Preservation

Feature selection is a dimensionality reduction technique that selects a subset of representative features from high dimensional data by eliminating irrelevant and redundant features. Recently, feature selection combined with sparse…

Computer Vision and Pattern Recognition · Computer Science 2018-04-24 Siwei Feng , Marco F. Duarte

Ultra High-Dimensional Nonlinear Feature Selection for Big Biological Data

Machine learning methods are used to discover complex nonlinear relationships in biological and medical data. However, sophisticated learning models are computationally unfeasible for data with millions of features. Here we introduce the…

Machine Learning · Statistics 2016-08-16 Makoto Yamada , Jiliang Tang , Jose Lugo-Martinez , Ermin Hodzic , Raunak Shrestha , Avishek Saha , Hua Ouyang , Dawei Yin , Hiroshi Mamitsuka , Cenk Sahinalp , Predrag Radivojac , Filippo Menczer , Yi Chang

An Effective Feature Selection Method Based on Pair-Wise Feature Proximity for High Dimensional Low Sample Size Data

Feature selection has been studied widely in the literature. However, the efficacy of the selection criteria for low sample size applications is neglected in most cases. Most of the existing feature selection criteria are based on the…

Computer Vision and Pattern Recognition · Computer Science 2018-07-16 S L Happy , Ramanarayan Mohanty , Aurobinda Routray

Feature Selection Based on Orthogonal Constraints and Polygon Area

The goal of feature selection is to choose the optimal subset of features for a recognition task by evaluating the importance of each feature, thereby achieving effective dimensionality reduction. Currently, proposed feature selection…

Machine Learning · Computer Science 2024-02-27 Zhenxing Zhang , Jun Ge , Zheng Wei , Chunjie Zhou , Yilei Wang

Max-Margin Feature Selection

Many machine learning applications such as in vision, biology and social networking deal with data in high dimensions. Feature selection is typically employed to select a subset of features which im- proves generalization accuracy as well…

Machine Learning · Computer Science 2016-06-15 Yamuna Prasad , Dinesh Khandelwal , K. K. Biswas

Deep Feature Screening: Feature Selection for Ultra High-Dimensional Data via Deep Neural Networks

The applications of traditional statistical feature selection methods to high-dimension, low sample-size data often struggle and encounter challenging problems, such as overfitting, curse of dimensionality, computational infeasibility, and…

Machine Learning · Statistics 2023-12-19 Kexuan Li , Fangfang Wang , Lingli Yang , Ruiqi Liu

Supervised Feature Selection via Dependence Estimation

We introduce a framework for filtering features that employs the Hilbert-Schmidt Independence Criterion (HSIC) as a measure of dependence between the features and the labels. The key idea is that good features should maximise such…

Machine Learning · Computer Science 2007-05-23 Le Song , Alex Smola , Arthur Gretton , Karsten Borgwardt , Justin Bedo

A Cross-Entropy-based Method to Perform Information-based Feature Selection

From a machine learning point of view, identifying a subset of relevant features from a real data set can be useful to improve the results achieved by classification methods and to reduce their time and space complexity. To achieve this…

Machine Learning · Computer Science 2017-05-23 Pietro Cassara , Alessandro Rozza , Mirco Nanni

A Contrast Based Feature Selection Algorithm for High-dimensional Data set in Machine Learning

Feature selection is an important process in machine learning and knowledge discovery. By selecting the most informative features and eliminating irrelevant ones, the performance of learning algorithms can be improved and the extraction of…

Machine Learning · Computer Science 2024-01-17 Chunxu Cao , Qiang Zhang

Towards Ultrahigh Dimensional Feature Selection for Big Data

In this paper, we present a new adaptive feature scaling scheme for ultrahigh-dimensional feature selection on Big Data. To solve this problem effectively, we first reformulate it as a convex semi-infinite programming (SIP) problem and then…

Machine Learning · Computer Science 2019-12-17 Mingkui Tan , Ivor W. Tsang , Li Wang

Feature Selection: A perspective on inter-attribute cooperation

High-dimensional datasets depict a challenge for learning tasks in data mining and machine learning. Feature selection is an effective technique in dealing with dimensionality reduction. It is often an essential data processing step prior…

Machine Learning · Computer Science 2023-09-18 Gustavo Sosa-Cabrera , Santiago Gómez-Guerrero , Miguel García-Torres , Christian E. Schaerer

Embedding Feature Selection for Large-scale Hierarchical Classification

Large-scale Hierarchical Classification (HC) involves datasets consisting of thousands of classes and millions of training instances with high-dimensional features posing several big data challenges. Feature selection that aims to select…

Machine Learning · Computer Science 2017-06-07 Azad Naik , Huzefa Rangwala

A Hybrid Two-layer Feature Selection Method Using GeneticAlgorithm and Elastic Net

Feature selection, as a critical pre-processing step for machine learning, aims at determining representative predictors from a high-dimensional feature space dataset to improve the prediction accuracy. However, the increase in feature…

Machine Learning · Statistics 2020-11-16 Fatemeh Amini , Guiping Hu

Feature Selection Using Classifier in High Dimensional Data

Feature selection is frequently used as a pre-processing step to machine learning. It is a process of choosing a subset of original features so that the feature space is optimally reduced according to a certain evaluation criterion. The…

Computer Vision and Pattern Recognition · Computer Science 2014-01-07 Vijendra Singh , Shivani Pathak

Recognizing Variables from their Data via Deep Embeddings of Distributions

A key obstacle in automated analytics and meta-learning is the inability to recognize when different datasets contain measurements of the same variable. Because provided attribute labels are often uninformative in practice, this task may be…

Machine Learning · Computer Science 2019-09-12 Jonas Mueller , Alex Smola

A hybrid evolutionary algorithm with importance sampling for multi-dimensional optimization

A hybrid evolutionary algorithm with importance sampling method is proposed for multi-dimensional optimization problems in this paper. In order to make use of the information provided in the search process, a set of visited solutions is…

Neural and Evolutionary Computing · Computer Science 2013-08-26 Guanghui Huang , Zhifeng Pan

Ultrahigh dimensional variable selection: beyond the linear model

Variable selection in high-dimensional space characterizes many contemporary problems in scientific discovery and decision making. Many frequently-used techniques are based on independence screening; examples include correlation ranking…

Methodology · Statistics 2008-12-18 Jianqing Fan , Richard Samworth , Yichao Wu