Related papers: Explainability for Machine Learning Models: From D…

Leveraging Explanations in Interactive Machine Learning: An Overview

Explanations have gained an increasing level of interest in the AI and Machine Learning (ML) communities in order to improve model transparency and allow users to form a mental model of a trained ML model. However, explanations can go…

Machine Learning · Computer Science 2022-10-11 Stefano Teso , Öznur Alkan , Wolfang Stammer , Elizabeth Daly

From Human Explanation to Model Interpretability: A Framework Based on Weight of Evidence

We take inspiration from the study of human explanation to inform the design and evaluation of interpretability methods in machine learning. First, we survey the literature on human explanation in philosophy, cognitive science, and the…

Artificial Intelligence · Computer Science 2021-09-21 David Alvarez-Melis , Harmanpreet Kaur , Hal Daumé , Hanna Wallach , Jennifer Wortman Vaughan

Assessing the Local Interpretability of Machine Learning Models

The increasing adoption of machine learning tools has led to calls for accountability via model interpretability. But what does it mean for a machine learning model to be interpretable by humans, and how can this be assessed? We focus on…

Machine Learning · Computer Science 2019-08-06 Dylan Slack , Sorelle A. Friedler , Carlos Scheidegger , Chitradeep Dutta Roy

Explanation Methods in Deep Learning: Users, Values, Concerns and Challenges

Issues regarding explainable AI involve four components: users, laws & regulations, explanations and algorithms. Together these components provide a context in which explanation methods can be evaluated regarding their adequacy. The goal of…

Artificial Intelligence · Computer Science 2018-03-30 Gabrielle Ras , Marcel van Gerven , Pim Haselager

Explaining Predictions from Machine Learning Models: Algorithms, Users, and Pedagogy

Model explainability has become an important problem in machine learning (ML) due to the increased effect that algorithmic predictions have on humans. Explanations can help users understand not only why ML models make certain predictions,…

Machine Learning · Computer Science 2022-09-13 Ana Lucic

An AI Architecture with the Capability to Explain Recognition Results

Explainability is needed to establish confidence in machine learning results. Some explainable methods take a post hoc approach to explain the weights of machine learning models, others highlight areas of the input contributing to…

Machine Learning · Computer Science 2024-07-15 Paul Whitten , Francis Wolff , Chris Papachristou

A Learning Theoretic Perspective on Local Explainability

In this paper, we explore connections between interpretable machine learning and learning theory through the lens of local approximation explanations. First, we tackle the traditional problem of performance generalization and bound the…

Machine Learning · Computer Science 2020-11-03 Jeffrey Li , Vaishnavh Nagarajan , Gregory Plumb , Ameet Talwalkar

Altruist: Argumentative Explanations through Local Interpretations of Predictive Models

Explainable AI is an emerging field providing solutions for acquiring insights into automated systems' rationale. It has been put on the AI map by suggesting ways to tackle key ethical and societal issues. Existing explanation techniques…

Machine Learning · Computer Science 2022-05-02 Ioannis Mollas , Nick Bassiliades , Grigorios Tsoumakas

Learning to Explain: Answering Why-Questions via Rephrasing

Providing plausible responses to why questions is a challenging but critical goal for language based human-machine interaction. Explanations are challenging in that they require many different forms of abstract knowledge and reasoning.…

Computation and Language · Computer Science 2019-06-05 Allen Nie , Erin D. Bennett , Noah D. Goodman

Online Explanation Generation for Human-Robot Teaming

As AI becomes an integral part of our lives, the development of explainable AI, embodied in the decision-making process of an AI or robotic agent, becomes imperative. For a robotic teammate, the ability to generate explanations to justify…

Artificial Intelligence · Computer Science 2020-09-01 Mehrdad Zakershahrak , Ze Gong , Nikhillesh Sadassivam , Yu Zhang

An Evaluation of the Human-Interpretability of Explanation

Recent years have seen a boom in interest in machine learning systems that can provide a human-understandable rationale for their predictions or decisions. However, exactly what kinds of explanation are truly human-interpretable remains…

Machine Learning · Computer Science 2019-08-30 Isaac Lage , Emily Chen , Jeffrey He , Menaka Narayanan , Been Kim , Sam Gershman , Finale Doshi-Velez

Explaining Deep Neural Networks and Beyond: A Review of Methods and Applications

With the broader and highly successful usage of machine learning in industry and the sciences, there has been a growing demand for Explainable AI. Interpretability and explanation methods for gaining a better understanding about the problem…

Machine Learning · Computer Science 2021-02-26 Wojciech Samek , Grégoire Montavon , Sebastian Lapuschkin , Christopher J. Anders , Klaus-Robert Müller

Counterfactual Explanations and Algorithmic Recourses for Machine Learning: A Review

Machine learning plays a role in many deployed decision systems, often in ways that are difficult or impossible to understand by human stakeholders. Explaining, in a human-understandable way, the relationship between the input and output of…

Machine Learning · Computer Science 2022-11-17 Sahil Verma , Varich Boonsanong , Minh Hoang , Keegan E. Hines , John P. Dickerson , Chirag Shah

Interpretability and Explainability: A Machine Learning Zoo Mini-tour

In this review, we examine the problem of designing interpretable and explainable machine learning models. Interpretability and explainability lie at the core of many machine learning and statistical applications in medicine, economics,…

Machine Learning · Computer Science 2023-03-02 Ričards Marcinkevičs , Julia E. Vogt

Evaluating Local Explainability Metrics for Machine Learning Models on Tabular Data

Despite the wide use of explainability techniques to attempt to understand the behavior of Artificial Intelligence (AI), the generated explanations may not always be reliable. An explanation can appear plausible to humans but fail to…

Machine Learning · Computer Science 2026-05-28 Tomás Pereira , João Vitorino , Eva Maia , Isabel Praça

Directions for Explainable Knowledge-Enabled Systems

Interest in the field of Explainable Artificial Intelligence has been growing for decades and has accelerated recently. As Artificial Intelligence models have become more complex, and often more opaque, with the incorporation of complex…

Artificial Intelligence · Computer Science 2020-03-18 Shruthi Chari , Daniel M. Gruen , Oshani Seneviratne , Deborah L. McGuinness

Explainable Machine Learning in Deployment

Explainable machine learning offers the potential to provide stakeholders with insights into model behavior by using various methods such as feature importance scores, counterfactual explanations, or influential training data. Yet there is…

Machine Learning · Computer Science 2020-07-13 Umang Bhatt , Alice Xiang , Shubham Sharma , Adrian Weller , Ankur Taly , Yunhan Jia , Joydeep Ghosh , Ruchir Puri , José M. F. Moura , Peter Eckersley

Personalized explanation in machine learning: A conceptualization

Explanation in machine learning and related fields such as artificial intelligence aims at making machine learning models and their decisions understandable to humans. Existing work suggests that personalizing explanations might help to…

Machine Learning · Computer Science 2019-04-29 Johanes Schneider , Joshua Handali

Model Learning with Personalized Interpretability Estimation (ML-PIE)

High-stakes applications require AI-generated models to be interpretable. Current algorithms for the synthesis of potentially interpretable models rely on objectives or regularization terms that represent interpretability only coarsely…

Machine Learning · Computer Science 2021-04-28 Marco Virgolin , Andrea De Lorenzo , Francesca Randone , Eric Medvet , Mattias Wahde

Semantic Explanations of Predictions

The main objective of explanations is to transmit knowledge to humans. This work proposes to construct informative explanations for predictions made from machine learning models. Motivated by the observations from social sciences, our…

Artificial Intelligence · Computer Science 2018-05-29 Freddy Lecue , Jiewen Wu