Related papers: Single Sample Feature Importance: An Interpretable…

Provable Recovery of Locally Important Signed Features and Interactions from Random Forest

Feature and Interaction Importance (FII) methods are essential in supervised learning for assessing the relevance of input variables and their interactions in complex prediction models. In many domains, such as personalized medicine, local…

Machine Learning · Statistics 2025-12-15 Kata Vuk , Nicolas Alexander Ihlo , Merle Behr

Feature Importance Measurement based on Decision Tree Sampling

Random forest is effective for prediction tasks but the randomness of tree generation hinders interpretability in feature importance analysis. To address this, we proposed DT-Sampler, a SAT-based method for measuring feature importance in…

Machine Learning · Computer Science 2023-07-26 Chao Huang , Diptesh Das , Koji Tsuda

Relative Feature Importance

Interpretable Machine Learning (IML) methods are used to gain insight into the relevance of a feature of interest for the performance of a model. Commonly used IML methods differ in whether they consider features of interest in isolation,…

Machine Learning · Statistics 2021-04-23 Gunnar König , Christoph Molnar , Bernd Bischl , Moritz Grosse-Wentrup

On the (In)Significance of Feature Selection in High-Dimensional Datasets

Feature selection (FS) is assumed to improve predictive performance and identify meaningful features in high-dimensional datasets. Surprisingly, small random subsets of features (0.02-1%) match or outperform the predictive performance of…

Machine Learning · Computer Science 2025-09-22 Bhavesh Neekhra , Debayan Gupta , Partha Pratim Chakrabarti

Interpreting Deep Forest through Feature Contribution and MDI Feature Importance

Deep forest is a non-differentiable deep model which has achieved impressive empirical success across a wide variety of applications, especially on categorical/symbolic or mixed modeling tasks. Many of the application fields prefer…

Machine Learning · Computer Science 2023-05-02 Yi-Xiao He , Shen-Huan Lyu , Yuan Jiang

Better Model Selection with a new Definition of Feature Importance

Feature importance aims at measuring how crucial each input feature is for model prediction. It is widely used in feature engineering, model selection and explainable artificial intelligence (XAI). In this paper, we propose a new tree-model…

Machine Learning · Statistics 2020-09-17 Fan Fang , Carmine Ventre , Lingbo Li , Leslie Kanthan , Fan Wu , Michail Basios

Feature Importance Measure for Non-linear Learning Algorithms

Complex problems may require sophisticated, non-linear learning methods such as kernel machines or deep neural networks to achieve state of the art prediction accuracies. However, high prediction accuracies are not the only objective to…

Artificial Intelligence · Computer Science 2016-11-24 Marina M. -C. Vidovic , Nico Görnitz , Klaus-Robert Müller , Marius Kloft

Feature Importance Depends on Properties of the Data: Towards Choosing the Correct Explanations for Your Data and Decision Trees based Models

In order to ensure the reliability of the explanations of machine learning models, it is crucial to establish their advantages and limits and in which case each of these methods outperform. However, the current understanding of when and how…

Machine Learning · Computer Science 2025-02-12 Célia Wafa Ayad , Thomas Bonnier , Benjamin Bosch , Sonali Parbhoo , Jesse Read

Inferring feature importance with uncertainties in high-dimensional data

Estimating feature importance is a significant aspect of explaining data-based models. Besides explaining the model itself, an equally relevant question is which features are important in the underlying data generating process. We present a…

Machine Learning · Computer Science 2021-09-21 Pål Vegard Johnsen , Inga Strümke , Signe Riemer-Sørensen , Andrew Thomas DeWan , Mette Langaas

SHAFF: Fast and consistent SHApley eFfect estimates via random Forests

Interpretability of learning algorithms is crucial for applications involving critical decisions, and variable importance is one of the main interpretation tools. Shapley effects are now widely used to interpret both tree ensembles and…

Machine Learning · Statistics 2022-02-03 Clément Bénard , Gérard Biau , Sébastien da Veiga , Erwan Scornet

Interpretable Network-assisted Random Forest+

Machine learning algorithms often assume that training samples are independent. When data points are connected by a network, the induced dependency between samples is both a challenge, reducing effective sample size, and an opportunity to…

Machine Learning · Statistics 2025-09-22 Tiffany M. Tang , Elizaveta Levina , Ji Zhu

Inherent Inconsistencies of Feature Importance

The rapid advancement and widespread adoption of machine learning-driven technologies have underscored the practical and ethical need for creating interpretable artificial intelligence systems. Feature importance, a method that assigns…

Machine Learning · Computer Science 2023-12-07 Nimrod Harel , Uri Obolski , Ran Gilad-Bachrach

Interpreting random forest classification models using a feature contribution method

Model interpretation is one of the key aspects of the model evaluation process. The explanation of the relationship between model variables and outputs is relatively easy for statistical models, such as linear regressions, thanks to the…

Machine Learning · Computer Science 2013-12-05 Anna Palczewska , Jan Palczewski , Richard Marchese Robinson , Daniel Neagu

Statistical Significance of Feature Importance Rankings

Feature importance scores are ubiquitous tools for understanding the predictions of machine learning models. However, many popular attribution methods suffer from high instability due to random sampling. Leveraging novel ideas from…

Machine Learning · Statistics 2025-07-08 Jeremy Goldwasser , Giles Hooker

Feature Importance Guided Random Forest Learning with Simulated Annealing Based Hyperparameter Tuning

This paper introduces a novel framework for enhancing Random Forest classifiers by integrating probabilistic feature sampling and hyperparameter tuning via Simulated Annealing. The proposed framework exhibits substantial advancements in…

Machine Learning · Computer Science 2025-11-12 Kowshik Balasubramanian , Andre Williams , Ismail Butun

Efficient nonparametric statistical inference on population feature importance using Shapley values

The true population-level importance of a variable in a prediction task provides useful knowledge about the underlying data-generating mechanism and can help in deciding which measurements to collect in subsequent experiments. Valid…

Methodology · Statistics 2025-10-23 Brian D. Williamson , Jean Feng

A scalable saliency-based Feature selection method with instance level information

Classic feature selection techniques remove those features that are either irrelevant or redundant, achieving a subset of relevant features that help to provide a better knowledge extraction. This allows the creation of compact models that…

Machine Learning · Computer Science 2020-12-16 Brais Cancela , Verónica Bolón-Canedo , Amparo Alonso-Betanzos , João Gama

The Feature Importance Ranking Measure

Most accurate predictions are typically obtained by learning machines with complex feature spaces (as e.g. induced by kernels). Unfortunately, such decision rules are hardly accessible to humans and cannot easily be used to gain insights…

Machine Learning · Statistics 2010-08-13 Alexander Zien , Nicole Kraemer , Soeren Sonnenburg , Gunnar Raetsch

The Berkelmans-Pries Feature Importance Method: A Generic Measure of Informativeness of Features

Over the past few years, the use of machine learning models has emerged as a generic and powerful means for prediction purposes. At the same time, there is a growing demand for interpretability of prediction models. To determine which…

Machine Learning · Computer Science 2023-01-13 Joris Pries , Guus Berkelmans , Sandjai Bhulai , Rob van der Mei

Bayesian Importance of Features (BIF)

We introduce a simple and intuitive framework that provides quantitative explanations of statistical models through the probabilistic assessment of input feature importance. The core idea comes from utilizing the Dirichlet distribution to…

Machine Learning · Statistics 2022-09-20 Kamil Adamczewski , Frederik Harder , Mijung Park