Related papers: Identifying the Most Explainable Classifier

Probabilistic Sufficient Explanations

Understanding the behavior of learned classifiers is an important task, and various black-box explanations, logical reasoning approaches, and model-specific methods have been proposed. In this paper, we introduce probabilistic sufficient…

Machine Learning · Computer Science 2021-05-24 Eric Wang , Pasha Khosravi , Guy Van den Broeck

Explainable AI for Classification using Probabilistic Logic Inference

The overarching goal of Explainable AI is to develop systems that not only exhibit intelligent behaviours, but also are able to explain their rationale and reveal insights. In explainable machine learning, methods that produce a high level…

Artificial Intelligence · Computer Science 2020-05-06 Xiuyi Fan , Siyuan Liu , Thomas C. Henderson

How to Explain Individual Classification Decisions

After building a classifier with modern tools of machine learning we typically have a black box at hand that is able to predict well for unseen data. Thus, we get an answer to the question what is the most likely label of a given unseen…

Machine Learning · Statistics 2009-12-08 David Baehrens , Timon Schroeter , Stefan Harmeling , Motoaki Kawanabe , Katja Hansen , Klaus-Robert Mueller

The Shape of Explanations: A Topological Account of Rule-Based Explanations in Machine Learning

Rule-based explanations provide simple reasons explaining the behavior of machine learning classifiers at given points in the feature space. Several recent methods (Anchors, LORE, etc.) purport to generate rule-based explanations for…

Machine Learning · Computer Science 2023-01-24 Brett Mullins

Optimal Explanations of Linear Models

When predictive models are used to support complex and important decisions, the ability to explain a model's reasoning can increase trust, expose hidden biases, and reduce vulnerability to adversarial attacks. However, attempts at…

Machine Learning · Computer Science 2019-07-11 Dimitris Bertsimas , Arthur Delarue , Patrick Jaillet , Sebastien Martin

Feature Necessity & Relevancy in ML Classifier Explanations

Given a machine learning (ML) model and a prediction, explanations can be defined as sets of features which are sufficient for the prediction. In some applications, and besides asking for an explanation, it is also critical to understand…

Machine Learning · Computer Science 2023-02-08 Xuanxiang Huang , Martin C. Cooper , Antonio Morgado , Jordi Planes , Joao Marques-Silva

An Information-Theoretic Approach to Personalized Explainable Machine Learning

Automated decision making is used routinely throughout our everyday life. Recommender systems decide which jobs, movies, or other user profiles might be interesting to us. Spell checkers help us to make good use of language. Fraud detection…

Machine Learning · Computer Science 2020-07-15 Alexander Jung , Pedro H. J. Nardelli

An AI Architecture with the Capability to Explain Recognition Results

Explainability is needed to establish confidence in machine learning results. Some explainable methods take a post hoc approach to explain the weights of machine learning models, others highlight areas of the input contributing to…

Machine Learning · Computer Science 2024-07-15 Paul Whitten , Francis Wolff , Chris Papachristou

Explainability Is in the Mind of the Beholder: Establishing the Foundations of Explainable Artificial Intelligence

Explainable artificial intelligence and interpretable machine learning are research domains growing in importance. Yet, the underlying concepts remain somewhat elusive and lack generally agreed definitions. While recent inspiration from…

Artificial Intelligence · Computer Science 2022-09-12 Kacper Sokol , Peter Flach

Explainable Deep Classification Models for Domain Generalization

Conventionally, AI models are thought to trade off explainability for lower accuracy. We develop a training strategy that not only leads to a more explainable AI system for object classification, but as a consequence, suffers no perceptible…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Andrea Zunino , Sarah Adel Bargal , Riccardo Volpi , Mehrnoosh Sameki , Jianming Zhang , Stan Sclaroff , Vittorio Murino , Kate Saenko

Explaining the Explainer: A First Theoretical Analysis of LIME

Machine learning is used more and more often for sensitive applications, sometimes replacing humans in critical decision-making processes. As such, interpretability of these algorithms is a pressing need. One popular algorithm to provide…

Machine Learning · Computer Science 2020-01-14 Damien Garreau , Ulrike von Luxburg

On the Relationship Between Interpretability and Explainability in Machine Learning

Interpretability and explainability have gained more and more attention in the field of machine learning as they are crucial when it comes to high-stakes decisions and troubleshooting. Since both provide information about predictors and…

Machine Learning · Computer Science 2024-04-26 Benjamin Leblanc , Pascal Germain

Logic Explanation of AI Classifiers by Categorical Explaining Functors

The most common methods in explainable artificial intelligence are post-hoc techniques which identify the most relevant features used by pretrained opaque models. Some of the most advanced post hoc methods can generate explanations that…

Artificial Intelligence · Computer Science 2026-03-11 Stefano Fioravanti , Francesco Giannini , Paolo Frazzetto , Fabio Zanasi , Pietro Barbiero

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Despite widespread adoption, machine learning models remain mostly black boxes. Understanding the reasons behind predictions is, however, quite important in assessing trust, which is fundamental if one plans to take action based on a…

Machine Learning · Computer Science 2016-08-10 Marco Tulio Ribeiro , Sameer Singh , Carlos Guestrin

Towards Explainable NLP: A Generative Explanation Framework for Text Classification

Building explainable systems is a critical problem in the field of Natural Language Processing (NLP), since most machine learning models provide no explanations for the predictions. Existing approaches for explainable machine learning…

Computation and Language · Computer Science 2019-06-12 Hui Liu , Qingyu Yin , William Yang Wang

Interpretable Explanations of Black Boxes by Meaningful Perturbation

As machine learning algorithms are increasingly applied to high impact yet high risk tasks, such as medical diagnosis or autonomous driving, it is critical that researchers can explain how such algorithms arrived at their predictions. In…

Computer Vision and Pattern Recognition · Computer Science 2021-12-06 Ruth Fong , Andrea Vedaldi

Personalized explanation in machine learning: A conceptualization

Explanation in machine learning and related fields such as artificial intelligence aims at making machine learning models and their decisions understandable to humans. Existing work suggests that personalizing explanations might help to…

Machine Learning · Computer Science 2019-04-29 Johanes Schneider , Joshua Handali

An Evaluation of the Human-Interpretability of Explanation

Recent years have seen a boom in interest in machine learning systems that can provide a human-understandable rationale for their predictions or decisions. However, exactly what kinds of explanation are truly human-interpretable remains…

Machine Learning · Computer Science 2019-08-30 Isaac Lage , Emily Chen , Jeffrey He , Menaka Narayanan , Been Kim , Sam Gershman , Finale Doshi-Velez

Computing Rule-Based Explanations of Machine Learning Classifiers using Knowledge Graphs

The use of symbolic knowledge representation and reasoning as a way to resolve the lack of transparency of machine learning classifiers is a research area that lately attracts many researchers. In this work, we use knowledge graphs as the…

Artificial Intelligence · Computer Science 2022-02-09 Edmund Dervakos , Orfeas Menis-Mastromichalakis , Alexandros Chortaras , Giorgos Stamou

An Additive Instance-Wise Approach to Multi-class Model Interpretation

Interpretable machine learning offers insights into what factors drive a certain prediction of a black-box system. A large number of interpreting methods focus on identifying explanatory input features, which generally fall into two main…

Machine Learning · Computer Science 2023-06-02 Vy Vo , Van Nguyen , Trung Le , Quan Hung Tran , Gholamreza Haffari , Seyit Camtepe , Dinh Phung