Related papers: PPFS: Predictive Permutation Feature Selection

Mutual Information-Based Unsupervised Feature Transformation for Heterogeneous Feature Subset Selection

Conventional mutual information (MI) based feature selection (FS) methods are unable to handle heterogeneous feature subset selection properly because of data format differences or estimation methods of MI between feature subset and class…

Machine Learning · Statistics 2015-03-31 Min Wei , Tommy W. S. Chow , Rosa H. M. Chan

Unsupervised Feature Selection via Multi-step Markov Transition Probability

Feature selection is a widely used dimension reduction technique to select feature subsets because of its interpretability. Many methods have been proposed and achieved good results, in which the relationships between adjacent data points…

Machine Learning · Computer Science 2020-06-01 Yan Min , Mao Ye , Liang Tian , Yulin Jian , Ce Zhu , Shangming Yang

Variable screening using factor analysis for high-dimensional data with multicollinearity

Screening methods are useful tools for variable selection in regression analysis when the number of predictors is much larger than the sample size. Factor analysis is used to eliminate multicollinearity among predictors, which improves the…

Methodology · Statistics 2025-10-28 Shuntaro Tanaka , Hidetoshi Matsui

Feature Selection via Maximizing Distances between Class Conditional Distributions

For many data-intensive tasks, feature selection is an important preprocessing step. However, most existing methods do not directly and intuitively explore the intrinsic discriminative information of features. We propose a novel feature…

Machine Learning · Computer Science 2024-01-17 Chunxu Cao , Qiang Zhang

Nonparametric IPSS: Fast, flexible feature selection with false discovery control

Feature selection is a critical task in machine learning and statistics. However, existing feature selection methods either (i) rely on parametric methods such as linear or generalized linear models, (ii) lack theoretical false discovery…

Machine Learning · Statistics 2025-07-18 Omar Melikechi , David B. Dunson , Jeffrey W. Miller

Supervised Infinite Feature Selection

In this paper, we present a new feature selection method that is suitable for both unsupervised and supervised problems. We build upon the recently proposed Infinite Feature Selection (IFS) method where feature subsets of all sizes…

Machine Learning · Computer Science 2017-08-22 Sadegh Eskandari , Emre Akbas

Model-Augmented Estimation of Conditional Mutual Information for Feature Selection

Markov blanket feature selection, while theoretically optimal, is generally challenging to implement. This is due to the shortcomings of existing approaches to conditional independence (CI) testing, which tend to struggle either with the…

Machine Learning · Computer Science 2020-06-23 Alan Yang , AmirEmad Ghassami , Maxim Raginsky , Negar Kiyavash , Elyse Rosenbaum

Markov Blanket Ranking using Kernel-based Conditional Dependence Measures

Developing feature selection algorithms that move beyond a pure correlational to a more causal analysis of observational data is an important problem in the sciences. Several algorithms attempt to do so by discovering the Markov blanket of…

Machine Learning · Statistics 2014-05-06 Eric V. Strobl , Shyam Visweswaran

AutoNFS: Automatic Neural Feature Selection

Feature selection (FS) is a fundamental challenge in machine learning, particularly for high-dimensional tabular data, where interpretability and computational efficiency are critical. Existing FS methods often cannot automatically detect…

Machine Learning · Computer Science 2026-04-22 Witold Wydmański , Marek Śmieja

A Contrast Based Feature Selection Algorithm for High-dimensional Data set in Machine Learning

Feature selection is an important process in machine learning and knowledge discovery. By selecting the most informative features and eliminating irrelevant ones, the performance of learning algorithms can be improved and the extraction of…

Machine Learning · Computer Science 2024-01-17 Chunxu Cao , Qiang Zhang

Embedded methods for feature selection in neural networks

The representational capacity of modern neural network architectures has made them a default choice in various applications with high dimensional feature sets. But these high dimensional and potentially noisy features combined with the…

Machine Learning · Computer Science 2020-10-13 Vinay Varma K

Testing Conditional Independence in Supervised Learning Algorithms

We propose the conditional predictive impact (CPI), a consistent and unbiased estimator of the association between one or several features and a given outcome, conditional on a reduced feature set. Building on the knockoff framework of…

Methodology · Statistics 2021-05-14 David S. Watson , Marvin N. Wright

SuRF: a New Method for Sparse Variable Selection, with Application in Microbiome Data Analysis

In this paper, we present a new variable selection method for regression and classification purposes. Our method, called Subsampling Ranking Forward selection (SuRF), is based on LASSO penalised regression, subsampling and forward-selection…

Methodology · Statistics 2021-05-25 Lihui Liu , Hong Gu , Johan Van Limbergen , Toby Kenney

Inference for Multivariate Regression Model based on synthetic data generated under Fixed-Posterior Predictive Sampling: comparison with Plug-in Sampling

The authors derive likelihood-based exact inference methods for the multivariate regression model, for singly imputed synthetic data generated via Posterior Predictive Sampling (PPS) and for multiply imputed synthetic data generated via a…

Statistics Theory · Mathematics 2017-07-26 Ricardo Moura , Martin Klein , Carlos A. Coelho , Bimal Sinha

TRUST-FS: Tensorized Reliable Unsupervised Multi-View Feature Selection for Incomplete Data

Multi-view unsupervised feature selection (MUFS), which selects informative features from multi-view unlabeled data, has attracted increasing research interest in recent years. Although great efforts have been devoted to MUFS, several…

Machine Learning · Computer Science 2025-11-12 Minghui Lu , Yanyong Huang , Minbo Ma , Jinyuan Chang , Dongjie Wang , Xiuwen Yi , Tianrui Li

MH-FSF: A Unified Framework for Overcoming Benchmarking and Reproducibility Limitations in Feature Selection Evaluation

Feature selection is vital for building effective predictive models, as it reduces dimensionality and emphasizes key features. However, current research often suffers from limited benchmarking and reliance on proprietary datasets. This…

Machine Learning · Computer Science 2025-07-16 Vanderson Rocha , Diego Kreutz , Gabriel Canto , Hendrio Bragança , Eduardo Feitosa

Permutation-Invariant Representation Learning for Robust and Privacy-Preserving Feature Selection

Feature selection eliminates redundancy among features to improve downstream task performance while reducing computational overhead. Existing methods often struggle to capture intricate feature interactions and adapt across diverse…

Machine Learning · Computer Science 2026-03-02 Rui Liu , Tao Zhe , Yanjie Fu , Feng Xia , Ted Senator , Dongjie Wang

Piecewise Deterministic Markov Processes for Bayesian Neural Networks

Inference on modern Bayesian Neural Networks (BNNs) often relies on a variational inference treatment, imposing violated assumptions of independence and the form of the posterior. Traditional MCMC approaches avoid these assumptions at the…

Machine Learning · Statistics 2026-04-07 Ethan Goan , Dimitri Perrin , Kerrie Mengersen , Clinton Fookes

Gradient Boosted Feature Selection

A feature selection algorithm should ideally satisfy four conditions: reliably extract relevant features; be able to identify non-linear feature interactions; scale linearly with the number of features and dimensions; allow the…

Machine Learning · Computer Science 2019-01-15 Zhixiang Eddie Xu , Gao Huang , Kilian Q. Weinberger , Alice X. Zheng

Feature Selection via Binary Simultaneous Perturbation Stochastic Approximation

Feature selection (FS) has become an indispensable task in dealing with today's highly complex pattern recognition problems with massive number of features. In this study, we propose a new wrapper approach for FS based on binary…

Machine Learning · Statistics 2016-03-08 Vural Aksakalli , Milad Malekipirbazari