Related papers: Statistically Valid Variable Importance Assessment…

Variable Importance in High-Dimensional Settings Requires Grouping

Explaining the decision process of machine learning algorithms is nowadays crucial for both model's performance enhancement and human comprehension. This can be achieved by assessing the variable importance of single variables, even for…

Machine Learning · Computer Science 2023-12-19 Ahmad Chamma , Bertrand Thirion , Denis A. Engemann

Measuring Variable Importance in Heterogeneous Treatment Effects with Confidence

Causal machine learning holds promise for estimating individual treatment effects from complex data. For successful real-world applications of machine learning methods, it is of paramount importance to obtain reliable insights into which…

Machine Learning · Computer Science 2026-05-22 Joseph Paillard , Angel Reyero Lobo , Vitaliy Kolodyazhniy , Bertrand Thirion , Denis A. Engemann

Challenges in Variable Importance Ranking Under Correlation

Variable importance plays a pivotal role in interpretable machine learning as it helps measure the impact of factors on the output of the prediction model. Model agnostic methods based on the generation of "null" features via permutation…

Machine Learning · Statistics 2024-02-07 Annie Liang , Thomas Jemielita , Andy Liaw , Vladimir Svetnik , Lingkang Huang , Richard Baumgartner , Jason M. Klusowski

Conditional Feature Importance revisited: Double Robustness, Efficiency and Inference

Conditional Feature Importance (CFI) is a classical variable importance measure that accounts for the relationship between the studied feature and the others. However, CFI has not yet been studied from a theoretical perspective because the…

Statistics Theory · Mathematics 2026-02-03 Angel Reyero-Lobo , Pierre Neuvial , Bertrand Thirion

Hierarchical Variable Importance with Statistical Control for Medical Data-Based Prediction

Recent advances in machine learning have greatly expanded the repertoire of predictive methods for medical imaging. However, the interpretability of complex models remains a challenge, which limits their utility in medical applications.…

Machine Learning · Statistics 2025-08-13 Joseph Paillard , Antoine Collas , Denis A. Engemann , Bertrand Thirion

Testing Conditional Independence in Supervised Learning Algorithms

We propose the conditional predictive impact (CPI), a consistent and unbiased estimator of the association between one or several features and a given outcome, conditional on a reduced feature set. Building on the knockoff framework of…

Methodology · Statistics 2021-05-14 David S. Watson , Marvin N. Wright

Conditional Feature Importance for Mixed Data

Despite the popularity of feature importance (FI) measures in interpretable machine learning, the statistical adequacy of these methods is rarely discussed. From a statistical perspective, a major distinction is between analyzing a…

Machine Learning · Statistics 2023-05-03 Kristin Blesch , David S. Watson , Marvin N. Wright

One Permutation Is All You Need: Fast, Reliable Variable Importance and Model Stress-Testing

Reliable estimation of feature contributions in machine learning models is essential for trust, transparency and regulatory compliance, especially when models are proprietary or otherwise operate as black boxes. While permutation-based…

Machine Learning · Statistics 2025-12-24 Albert Dorador

Variable Importance Assessments and Backward Variable Selection for High-Dimensional Data

Variable selection in high-dimensional scenarios is of great interested in statistics. One application involves identifying differentially expressed genes in genomic analysis. Existing methods for addressing this problem have some limits or…

Methodology · Statistics 2018-06-19 Liuhua Peng , Long Qu , Dan Nettleton

Generalized Permutation Framework for Testing Model Variable Significance

A common problem in machine learning is determining if a variable significantly contributes to a model's prediction performance. This problem is aggravated for datasets, such as gene expression datasets, that suffer the worst case of…

Methodology · Statistics 2023-10-13 Yue Wu , Ted Spaide , Kenji Nakamichi , Russell Van Gelder , Aaron Lee

Targeted Learning for Variable Importance

Variable importance is one of the most widely used measures for interpreting machine learning with significant interest from both statistics and machine learning communities. Recently, increasing attention has been directed toward…

Machine Learning · Statistics 2025-12-22 Xiaohan Wang , Yunzhe Zhou , Giles Hooker

Asymptotic Unbiasedness of the Permutation Importance Measure in Random Forest Models

Variable selection in sparse regression models is an important task as applications ranging from biomedical research to econometrics have shown. Especially for higher dimensional regression problems, for which the link function between…

Machine Learning · Statistics 2019-12-10 Burim Ramosaj , Markus Pauly

Relative Feature Importance

Interpretable Machine Learning (IML) methods are used to gain insight into the relevance of a feature of interest for the performance of a model. Commonly used IML methods differ in whether they consider features of interest in isolation,…

Machine Learning · Statistics 2021-04-23 Gunnar König , Christoph Molnar , Bernd Bischl , Moritz Grosse-Wentrup

TRIP: A Nonparametric Test to Diagnose Biased Feature Importance Scores

Along with accurate prediction, understanding the contribution of each feature to the making of the prediction, i.e., the importance of the feature, is a desirable and arguably necessary component of a machine learning model. For a complex…

Machine Learning · Computer Science 2025-07-11 Aaron Foote , Danny Krizanc

Model-agnostic Feature Importance and Effects with Dependent Features -- A Conditional Subgroup Approach

The interpretation of feature importance in machine learning models is challenging when features are dependent. Permutation feature importance (PFI) ignores such dependencies, which can cause misleading interpretations due to extrapolation.…

Machine Learning · Statistics 2023-11-09 Christoph Molnar , Gunnar König , Bernd Bischl , Giuseppe Casalicchio

Incremental Permutation Feature Importance (iPFI): Towards Online Explanations on Data Streams

Explainable Artificial Intelligence (XAI) has mainly focused on static learning scenarios so far. We are interested in dynamic scenarios where data is sampled progressively, and learning is done in an incremental rather than a batch mode.…

Machine Learning · Computer Science 2023-10-31 Fabian Fumagalli , Maximilian Muschalik , Eyke Hüllermeier , Barbara Hammer

Parameter Importance is Not Static: Evolving Parameter Isolation for Supervised Fine-Tuning

Supervised Fine-Tuning (SFT) of large language models often suffers from task interference and catastrophic forgetting. Recent approaches alleviate this issue by isolating task-critical parameters during training. However, these methods…

Machine Learning · Computer Science 2026-04-16 Zekai Lin , Chao Xue , Di Liang , Xingsheng Han , Peiyang Liu , Xianjie Wu , Lei Jiang , Yu Lu , Haibo Shi , Shuang Liang , Minlong Peng

All Models are Wrong, but Many are Useful: Learning a Variable's Importance by Studying an Entire Class of Prediction Models Simultaneously

Variable importance (VI) tools describe how much covariates contribute to a prediction model's accuracy. However, important variables for one well-performing model (for example, a linear model $f(\mathbf{x})=\mathbf{x}^{T}\beta$ with a…

Methodology · Statistics 2019-12-24 Aaron Fisher , Cynthia Rudin , Francesca Dominici

Model-independent variable selection via the rule-based variable priority

While achieving high prediction accuracy is a fundamental goal in machine learning, an equally important task is finding a small number of features with high explanatory power. One popular selection technique is permutation importance,…

Machine Learning · Statistics 2024-10-02 Min Lu , Hemant Ishwaran

Unrestricted Permutation forces Extrapolation: Variable Importance Requires at least One More Model, or There Is No Free Variable Importance

This paper reviews and advocates against the use of permute-and-predict (PaP) methods for interpreting black box functions. Methods such as the variable importance measures proposed for random forests, partial dependence plots, and…

Methodology · Statistics 2021-10-11 Giles Hooker , Lucas Mentch , Siyu Zhou