Related papers: Selective Explanations

An Additive Instance-Wise Approach to Multi-class Model Interpretation

Interpretable machine learning offers insights into what factors drive a certain prediction of a black-box system. A large number of interpreting methods focus on identifying explanatory input features, which generally fall into two main…

Machine Learning · Computer Science 2023-06-02 Vy Vo , Van Nguyen , Trung Le , Quan Hung Tran , Gholamreza Haffari , Seyit Camtepe , Dinh Phung

Streamlining models with explanations in the learning loop

Several explainable AI methods allow a Machine Learning user to get insights on the classification process of a black-box model in the form of local linear explanations. With such information, the user can judge which features are locally…

Machine Learning · Computer Science 2023-02-16 Francesco Lomuscio , Paolo Bajardi , Alan Perotti , Elvio G. Amparore

Lifting Interpretability-Performance Trade-off via Automated Feature Engineering

Complex black-box predictive models may have high performance, but lack of interpretability causes problems like lack of trust, lack of stability, sensitivity to concept drift. On the other hand, achieving satisfactory accuracy of…

Machine Learning · Computer Science 2020-02-12 Alicja Gosiewska , Przemyslaw Biecek

Comprehensive Attribution: Inherently Explainable Vision Model with Feature Detector

As deep vision models' popularity rapidly increases, there is a growing emphasis on explanations for model predictions. The inherently explainable attribution method aims to enhance the understanding of model behavior by identifying the…

Computer Vision and Pattern Recognition · Computer Science 2024-08-07 Xianren Zhang , Dongwon Lee , Suhang Wang

Biased Models Have Biased Explanations

We study fairness in Machine Learning (FairML) through the lens of attribute-based explanations generated for machine learning models. Our hypothesis is: Biased Models have Biased Explanations. To establish that, we first translate existing…

Machine Learning · Computer Science 2020-12-22 Aditya Jain , Manish Ravula , Joydeep Ghosh

An Information-Theoretic Approach to Personalized Explainable Machine Learning

Automated decision making is used routinely throughout our everyday life. Recommender systems decide which jobs, movies, or other user profiles might be interesting to us. Spell checkers help us to make good use of language. Fraud detection…

Machine Learning · Computer Science 2020-07-15 Alexander Jung , Pedro H. J. Nardelli

Altruist: Argumentative Explanations through Local Interpretations of Predictive Models

Explainable AI is an emerging field providing solutions for acquiring insights into automated systems' rationale. It has been put on the AI map by suggesting ways to tackle key ethical and societal issues. Existing explanation techniques…

Machine Learning · Computer Science 2022-05-02 Ioannis Mollas , Nick Bassiliades , Grigorios Tsoumakas

Have We Learned to Explain?: How Interpretability Methods Can Learn to Encode Predictions in their Interpretations

While the need for interpretable machine learning has been established, many common approaches are slow, lack fidelity, or hard to evaluate. Amortized explanation methods reduce the cost of providing interpretations by learning a global…

Machine Learning · Statistics 2021-03-03 Neil Jethani , Mukund Sudarshan , Yindalon Aphinyanaphongs , Rajesh Ranganath

Achieving interpretable machine learning by functional decomposition of black-box models into explainable predictor effects

Machine learning (ML) has seen significant growth in both popularity and importance. The high prediction accuracy of ML models is often achieved through complex black-box architectures that are difficult to interpret. This interpretability…

Machine Learning · Statistics 2024-07-29 David Köhler , David Rügamer , Matthias Schmid

Sound Explanation for Trustworthy Machine Learning

We take a formal approach to the explainability problem of machine learning systems. We argue against the practice of interpreting black-box models via attributing scores to input components due to inherently conflicting goals of…

Machine Learning · Computer Science 2023-06-13 Kai Jia , Pasapol Saowakon , Limor Appelbaum , Martin Rinard

Higher-Order Feature Attribution: Bridging Statistics, Explainable AI, and Topological Signal Processing

Feature attributions are post-training analysis methods that assess how various input features of a machine learning model contribute to an output prediction. Their interpretation is straightforward when features act independently, but it…

Machine Learning · Computer Science 2026-01-29 Kurt Butler , Guanchao Feng , Petar Djuric

T-Explainer: A Model-Agnostic Explainability Framework Based on Gradients

The development of machine learning applications has increased significantly in recent years, motivated by the remarkable ability of learning-powered systems to discover and generalize intricate patterns hidden in massive datasets. Modern…

Machine Learning · Computer Science 2025-04-25 Evandro S. Ortigossa , Fábio F. Dias , Brian Barr , Claudio T. Silva , Luis Gustavo Nonato

SynthTree: Co-supervised Local Model Synthesis for Explainable Prediction

Explainable machine learning (XML) has emerged as a major challenge in artificial intelligence (AI). Although black-box models such as Deep Neural Networks and Gradient Boosting often exhibit exceptional predictive accuracy, their lack of…

Methodology · Statistics 2024-06-18 Evgenii Kuriabov , Jia Li

Efficient Explanations from Empirical Explainers

Amid a discussion about Green AI in which we see explainability neglected, we explore the possibility to efficiently approximate computationally expensive explainers. To this end, we propose feature attribution modelling with Empirical…

Machine Learning · Computer Science 2021-09-16 Robert Schwarzenberg , Nils Feldhus , Sebastian Möller

llmSHAP: A Principled Approach to LLM Explainability

Feature attribution methods help make machine learning-based inference explainable by determining how much one or several features have contributed to a model's output. A particularly popular attribution method is based on the Shapley value…

Artificial Intelligence · Computer Science 2025-11-04 Filip Naudot , Tobias Sundqvist , Timotheus Kampik

Probabilistic Sufficient Explanations

Understanding the behavior of learned classifiers is an important task, and various black-box explanations, logical reasoning approaches, and model-specific methods have been proposed. In this paper, we introduce probabilistic sufficient…

Machine Learning · Computer Science 2021-05-24 Eric Wang , Pasha Khosravi , Guy Van den Broeck

Distribution-Based Feature Attribution for Explaining the Predictions of Any Classifier

The proliferation of complex, black-box AI models has intensified the need for techniques that can explain their decisions. Feature attribution methods have become a popular solution for providing post-hoc explanations, yet the field has…

Machine Learning · Computer Science 2025-11-13 Xinpeng Li , Kai Ming Ting

On Interpretability and Similarity in Concept-Based Machine Learning

Machine Learning (ML) provides important techniques for classification and predictions. Most of these are black-box models for users and do not provide decision-makers with an explanation. For the sake of transparency or more validity of…

Machine Learning · Computer Science 2021-02-26 Léonard Kwuida , Dmitry I. Ignatov

LEAFAGE: Example-based and Feature importance-based Explanationsfor Black-box ML models

As machine learning models become more accurate, they typically become more complex and uninterpretable by humans. The black-box character of these models holds back its acceptance in practice, especially in high-risk domains where the…

Artificial Intelligence · Computer Science 2019-08-19 Ajaya Adhikari , D. M. J Tax , Riccardo Satta , Matthias Fath

Toward Machine-Guided, Human-Initiated Explanatory Interactive Learning

Recent work has demonstrated the promise of combining local explanations with active learning for understanding and supervising black-box models. Here we show that, under specific conditions, these algorithms may misrepresent the quality of…

Artificial Intelligence · Computer Science 2020-07-21 Teodora Popordanoska , Mohit Kumar , Stefano Teso