Related papers: Evaluating Local Explanations using White-box Mode…

Local Rule-Based Explanations of Black Box Decision Systems

The recent years have witnessed the rise of accurate but obscure decision systems which hide the logic of their internal decision processes to the users. The lack of explanations for the decisions of black box systems is a key ethical…

Artificial Intelligence · Computer Science 2018-05-29 Riccardo Guidotti , Anna Monreale , Salvatore Ruggieri , Dino Pedreschi , Franco Turini , Fosca Giannotti

Evaluating Local Model-Agnostic Explanations of Learning to Rank Models with Decision Paths

Local explanations of learning-to-rank (LTR) models are thought to extract the most important features that contribute to the ranking predicted by the LTR model for a single data point. Evaluating the accuracy of such explanations is…

Machine Learning · Statistics 2022-03-17 Amir Hossein Akhavan Rahnama , Judith Butepage

Evaluating Local Explainability Metrics for Machine Learning Models on Tabular Data

Despite the wide use of explainability techniques to attempt to understand the behavior of Artificial Intelligence (AI), the generated explanations may not always be reliable. An explanation can appear plausible to humans but fail to…

Machine Learning · Computer Science 2026-05-28 Tomás Pereira , João Vitorino , Eva Maia , Isabel Praça

Local Explanations and Self-Explanations for Assessing Faithfulness in black-box LLMs

This paper introduces a novel task to assess the faithfulness of large language models (LLMs) using local perturbations and self-explanations. Many LLMs often require additional context to answer certain questions correctly. For this…

Computation and Language · Computer Science 2024-09-24 Christos Fragkathoulas , Odysseas S. Chlapanis

Local Interpretation Methods to Machine Learning Using the Domain of the Feature Space

As machine learning becomes an important part of many real world applications affecting human lives, new requirements, besides high predictive accuracy, become important. One important requirement is transparency, which has been associated…

Machine Learning · Computer Science 2019-08-01 Tiago Botari , Rafael Izbicki , Andre C. P. L. F. de Carvalho

On the overlooked issue of defining explanation objectives for local-surrogate explainers

Local surrogate approaches for explaining machine learning model predictions have appealing properties, such as being model-agnostic and flexible in their modelling. Several methods exist that fit this description and share this goal.…

Machine Learning · Computer Science 2021-06-11 Rafael Poyiadzi , Xavier Renard , Thibault Laugel , Raul Santos-Rodriguez , Marcin Detyniecki

Framework for Evaluating Faithfulness of Local Explanations

We study the faithfulness of an explanation system to the underlying prediction model. We show that this can be captured by two properties, consistency and sufficiency, and introduce quantitative measures of the extent to which these hold.…

Machine Learning · Computer Science 2022-02-03 Sanjoy Dasgupta , Nave Frost , Michal Moshkovitz

Feature Importance Depends on Properties of the Data: Towards Choosing the Correct Explanations for Your Data and Decision Trees based Models

In order to ensure the reliability of the explanations of machine learning models, it is crucial to establish their advantages and limits and in which case each of these methods outperform. However, the current understanding of when and how…

Machine Learning · Computer Science 2025-02-12 Célia Wafa Ayad , Thomas Bonnier , Benjamin Bosch , Sonali Parbhoo , Jesse Read

Streamlining models with explanations in the learning loop

Several explainable AI methods allow a Machine Learning user to get insights on the classification process of a black-box model in the form of local linear explanations. With such information, the user can judge which features are locally…

Machine Learning · Computer Science 2023-02-16 Francesco Lomuscio , Paolo Bajardi , Alan Perotti , Elvio G. Amparore

An Additive Instance-Wise Approach to Multi-class Model Interpretation

Interpretable machine learning offers insights into what factors drive a certain prediction of a black-box system. A large number of interpreting methods focus on identifying explanatory input features, which generally fall into two main…

Machine Learning · Computer Science 2023-06-02 Vy Vo , Van Nguyen , Trung Le , Quan Hung Tran , Gholamreza Haffari , Seyit Camtepe , Dinh Phung

Explainability's Gain is Optimality's Loss? -- How Explanations Bias Decision-making

Decisions in organizations are about evaluating alternatives and choosing the one that would best serve organizational goals. To the extent that the evaluation of alternatives could be formulated as a predictive task with appropriate…

Human-Computer Interaction · Computer Science 2022-06-30 Charles Wan , Rodrigo Belo , Leid Zejnilović

Considerations When Learning Additive Explanations for Black-Box Models

Many methods to explain black-box models, whether local or global, are additive. In this paper, we study global additive explanations for non-additive models, focusing on four explanation methods: partial dependence, Shapley explanations…

Machine Learning · Statistics 2023-08-02 Sarah Tan , Giles Hooker , Paul Koch , Albert Gordo , Rich Caruana

Approximating Score-based Explanation Techniques Using Conformal Regression

Score-based explainable machine-learning techniques are often used to understand the logic behind black-box models. However, such explanation techniques are often computationally expensive, which limits their application in time-critical…

Machine Learning · Computer Science 2023-08-24 Amr Alkhatib , Henrik Boström , Sofiane Ennadir , Ulf Johansson

Evaluating Explanation Methods for Neural Machine Translation

Recently many efforts have been devoted to interpreting the black-box NMT models, but little progress has been made on metrics to evaluate explanation methods. Word Alignment Error Rate can be used as such a metric that matches human…

Computation and Language · Computer Science 2020-05-05 Jierui Li , Lemao Liu , Huayang Li , Guanlin Li , Guoping Huang , Shuming Shi

Local Score Dependent Model Explanation for Time Dependent Covariates

The use of deep neural networks to make high risk decisions creates a need for global and local explanations so that users and experts have confidence in the modeling algorithms. We introduce a novel technique to find global and local…

Machine Learning · Computer Science 2019-08-15 Xochitl Watts , Freddy Lecue

Benchmarking and Survey of Explanation Methods for Black Box Models

The widespread adoption of black-box models in Artificial Intelligence has enhanced the need for explanation methods to reveal how these obscure models reach specific decisions. Retrieving explanations is fundamental to unveil possible…

Artificial Intelligence · Computer Science 2021-02-26 Francesco Bodria , Fosca Giannotti , Riccardo Guidotti , Francesca Naretto , Dino Pedreschi , Salvatore Rinzivillo

Locally Invariant Explanations: Towards Stable and Unidirectional Explanations through Local Invariant Learning

Locally interpretable model agnostic explanations (LIME) method is one of the most popular methods used to explain black-box models at a per example level. Although many variants have been proposed, few provide a simple way to produce high…

Machine Learning · Computer Science 2023-10-04 Amit Dhurandhar , Karthikeyan Ramamurthy , Kartik Ahuja , Vijay Arya

Guarantee Regions for Local Explanations

Interpretability methods that utilise local surrogate models (e.g. LIME) are very good at describing the behaviour of the predictive model at a point of interest, but they are not guaranteed to extrapolate to the local region surrounding…

Machine Learning · Computer Science 2024-02-21 Marton Havasi , Sonali Parbhoo , Finale Doshi-Velez

Sound Explanation for Trustworthy Machine Learning

We take a formal approach to the explainability problem of machine learning systems. We argue against the practice of interpreting black-box models via attributing scores to input components due to inherently conflicting goals of…

Machine Learning · Computer Science 2023-06-13 Kai Jia , Pasapol Saowakon , Limor Appelbaum , Martin Rinard

Valid Explanations for Learning to Rank Models

Learning-to-rank (LTR) is a class of supervised learning techniques that apply to ranking problems dealing with a large number of features. The popularity and widespread application of LTR models in prioritizing information in a variety of…

Machine Learning · Computer Science 2020-05-19 Jaspreet Singh , Zhenye Wang , Megha Khosla , Avishek Anand