Related papers: Gradient Boosted Feature Selection

Scalable Feature Selection for (Multitask) Gradient Boosted Trees

Gradient Boosted Decision Trees (GBDTs) are widely used for building ranking and relevance models in search and recommendation. Considerations such as latency and interpretability dictate the use of as few features as possible to train…

Machine Learning · Statistics 2021-09-07 Cuize Han , Nikhil Rao , Daria Sorokina , Karthik Subbian

Gradient-based Laplacian Feature Selection

Analysis of high dimensional noisy data is of essence across a variety of research fields. Feature selection techniques are designed to find the relevant feature subset that can facilitate classification or pattern detection. Traditional…

Machine Learning · Computer Science 2014-04-14 Bo Wang , Anna Goldenberg

Feature Gradients: Scalable Feature Selection via Discrete Relaxation

In this paper we introduce Feature Gradients, a gradient-based search algorithm for feature selection. Our approach extends a recent result on the estimation of learnability in the sublinear data regime by showing that the calculation can…

Machine Learning · Statistics 2019-08-29 Rishit Sheth , Nicolo Fusi

Relevant based structure learning for feature selection

Feature selection is an important task in many problems occurring in pattern recognition, bioinformatics, machine learning and data mining applications. The feature selection approach enables us to reduce the computation burden and the…

Machine Learning · Computer Science 2016-08-30 Hadi Zare , Mojtaba Niazi

Unbiased Gradient Boosting Decision Tree with Unbiased Feature Importance

Gradient Boosting Decision Tree (GBDT) has achieved remarkable success in a wide variety of applications. The split finding algorithm, which determines the tree construction process, is one of the most crucial components of GBDT. However,…

Machine Learning · Computer Science 2023-05-19 Zheyu Zhang , Tianping Zhang , Jian Li

Block-distributed Gradient Boosted Trees

The Gradient Boosted Tree (GBT) algorithm is one of the most popular machine learning algorithms used in production, for tasks that include Click-Through Rate (CTR) prediction and learning-to-rank. To deal with the massive datasets…

Machine Learning · Computer Science 2019-05-30 Theodore Vasiloudis , Hyunsu Cho , Henrik Boström

FairFS: Addressing Deep Feature Selection Biases for Recommender System

Large-scale online marketplaces and recommender systems serve as critical technological support for e-commerce development. In industrial recommender systems, features play vital roles as they carry information for downstream models.…

Information Retrieval · Computer Science 2026-02-24 Xianquan Wang , Zhaocheng Du , Jieming Zhu , Qinglin Jia , Zhenhua Dong , Kai Zhang

Financial fraud detection system based on improved random forest and gradient boosting machine (GBM)

This paper proposes a financial fraud detection system based on improved Random Forest (RF) and Gradient Boosting Machine (GBM). Specifically, the system introduces a novel model architecture called GBM-SSRF (Gradient Boosting Machine with…

Statistical Finance · Quantitative Finance 2025-02-25 Tianzuo Hu

Gradient Boosting for Linear Mixed Models

Gradient boosting from the field of statistical learning is widely known as a powerful framework for estimation and selection of predictor effects in various regression models by adapting concepts from classification theory. Current…

Methodology · Statistics 2020-11-03 Colin Griesbach , Benjamin Säfken , Elisabeth Waldmann

IVFS: Simple and Efficient Feature Selection for High Dimensional Topology Preservation

Feature selection is an important tool to deal with high dimensional data. In unsupervised case, many popular algorithms aim at maintaining the structure of the original data. In this paper, we propose a simple and effective feature…

Machine Learning · Statistics 2020-04-06 Xiaoyun Li , Chengxi Wu , Ping Li

GPMFS: Global Foundation and Personalized Optimization for Multi-Label Feature Selection

As artificial intelligence methods are increasingly applied to complex task scenarios, high dimensional multi-label learning has emerged as a prominent research focus. At present, the curse of dimensionality remains one of the major…

Machine Learning · Computer Science 2025-04-18 Yifan Cao , Zhilong Mi , Ziqiao Yin , Binghui Guo , Jin Dong

Unsupervised Feature Selection Through Group Discovery

Unsupervised feature selection (FS) is essential for high-dimensional learning tasks where labels are not available. It helps reduce noise, improve generalization, and enhance interpretability. However, most existing unsupervised FS methods…

Machine Learning · Computer Science 2025-11-13 Shira Lifshitz , Ofir Lindenbaum , Gal Mishne , Ron Meir , Hadas Benisty

GOLFS: Feature Selection via Combining Both Global and Local Information for High Dimensional Clustering

It is important to identify the discriminative features for high dimensional clustering. However, due to the lack of cluster labels, the regularization methods developed for supervised feature selection can not be directly applied. To learn…

Machine Learning · Statistics 2025-07-16 Zhaoyu Xing , Yang Wan , Juan Wen , Wei Zhong

Class-Level Feature Selection Method Using Feature Weighted Growing Self-Organising Maps

There have been several attempts to develop Feature Selection (FS) algorithms capable of identifying features that are relevant in a dataset. Although in certain applications the FS algorithms can be seen to be successful, they have similar…

Machine Learning · Computer Science 2025-03-18 Andrew Starkey , Uduak Idio Akpan , Omaimah AL Hosni , Yaseen Pullissery

Feature Selection Based on Sparse Neural Network Layer with Normalizing Constraints

Feature selection is important step in machine learning since it has shown to improve prediction accuracy while depressing the curse of dimensionality of high dimensional data. The neural networks have experienced tremendous success in…

Machine Learning · Computer Science 2021-07-13 Peter Bugata , Peter Drotar

Large-scale Multi-objective Feature Selection: A Multi-phase Search Space Shrinking Approach

Feature selection is a crucial step in machine learning, especially for high-dimensional datasets, where irrelevant and redundant features can degrade model performance and increase computational costs. This paper proposes a novel…

Neural and Evolutionary Computing · Computer Science 2024-10-30 Azam Asilian Bidgoli , Shahryar Rahnamayan

Performance analysis of unsupervised feature selection methods

Feature selection (FS) is a process which attempts to select more informative features. In some cases, too many redundant or irrelevant features may overpower main features for classification. Feature selection can remedy this problem and…

Machine Learning · Computer Science 2013-06-07 A. Nisthana Parveen , H. Hannah Inbarani , E. N. Sathishkumar

A Contrast Based Feature Selection Algorithm for High-dimensional Data set in Machine Learning

Feature selection is an important process in machine learning and knowledge discovery. By selecting the most informative features and eliminating irrelevant ones, the performance of learning algorithms can be improved and the extraction of…

Machine Learning · Computer Science 2024-01-17 Chunxu Cao , Qiang Zhang

Supervised Feature Selection with Neuron Evolution in Sparse Neural Networks

Feature selection that selects an informative subset of variables from data not only enhances the model interpretability and performance but also alleviates the resource demands. Recently, there has been growing attention on feature…

Neural and Evolutionary Computing · Computer Science 2023-03-15 Zahra Atashgahi , Xuhao Zhang , Neil Kichler , Shiwei Liu , Lu Yin , Mykola Pechenizkiy , Raymond Veldhuis , Decebal Constantin Mocanu

Margin-aware Fuzzy Rough Feature Selection: Bridging Uncertainty Characterization and Pattern Classification

Fuzzy rough feature selection (FRFS) is an effective means of addressing the curse of dimensionality in high-dimensional data. By removing redundant and irrelevant features, FRFS helps mitigate classifier overfitting, enhance generalization…

Machine Learning · Computer Science 2025-05-22 Suping Xu , Lin Shang , Keyu Liu , Hengrong Ju , Xibei Yang , Witold Pedrycz