Related papers: Evaluating Explainable AI: Which Algorithmic Expla…

Evaluating Local Explainability Metrics for Machine Learning Models on Tabular Data

Despite the wide use of explainability techniques to attempt to understand the behavior of Artificial Intelligence (AI), the generated explanations may not always be reliable. An explanation can appear plausible to humans but fail to…

Machine Learning · Computer Science 2026-05-28 Tomás Pereira , João Vitorino , Eva Maia , Isabel Praça

OptiLIME: Optimized LIME Explanations for Diagnostic Computer Algorithms

Local Interpretable Model-Agnostic Explanations (LIME) is a popular method to perform interpretability of any kind of Machine Learning (ML) model. It explains one ML prediction at a time, by learning a simple linear model around the…

Machine Learning · Computer Science 2022-02-09 Giorgio Visani , Enrico Bagli , Federico Chesani

Explaining the Explainer: A First Theoretical Analysis of LIME

Machine learning is used more and more often for sensitive applications, sometimes replacing humans in critical decision-making processes. As such, interpretability of these algorithms is a pressing need. One popular algorithm to provide…

Machine Learning · Computer Science 2020-01-14 Damien Garreau , Ulrike von Luxburg

Green LIME: Improving AI Explainability through Design of Experiments

In artificial intelligence (AI), the complexity of many models and processes surpasses human understanding, making it challenging to determine why a specific prediction is made. This lack of transparency is particularly problematic in…

Machine Learning · Statistics 2025-06-30 Alexandra Stadler , Werner G. Müller , Radoslav Harman

"Why Should You Trust My Explanation?" Understanding Uncertainty in LIME Explanations

Methods for interpreting machine learning black-box models increase the outcomes' transparency and in turn generates insight into the reliability and fairness of the algorithms. However, the interpretations themselves could contain…

Machine Learning · Computer Science 2019-06-05 Yujia Zhang , Kuangyan Song , Yiming Sun , Sarah Tan , Madeleine Udell

ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability

Concept-based explanations work by mapping complex model computations to human-understandable concepts. Evaluating such explanations is very difficult, as it includes not only the quality of the induced space of possible concepts but also…

Computation and Language · Computer Science 2025-06-05 Antonin Poché , Alon Jacovi , Agustin Martin Picard , Victor Boutin , Fanny Jourdan

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Despite widespread adoption, machine learning models remain mostly black boxes. Understanding the reasons behind predictions is, however, quite important in assessing trust, which is fundamental if one plans to take action based on a…

Machine Learning · Computer Science 2016-08-10 Marco Tulio Ribeiro , Sameer Singh , Carlos Guestrin

Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models

With the availability of large databases and recent improvements in deep learning methodology, the performance of AI systems is reaching or even exceeding the human level on an increasing number of complex tasks. Impressive examples of this…

Artificial Intelligence · Computer Science 2017-08-29 Wojciech Samek , Thomas Wiegand , Klaus-Robert Müller

Explainability for Machine Learning Models: From Data Adaptability to User Perception

This thesis explores the generation of local explanations for already deployed machine learning models, aiming to identify optimal conditions for producing meaningful explanations considering both data and user requirements. The primary…

Artificial Intelligence · Computer Science 2024-02-19 julien Delaunay

Explainability in Neural Networks for Natural Language Processing Tasks

Neural networks are widely regarded as black-box models, creating significant challenges in understanding their inner workings, especially in natural language processing (NLP) applications. To address this opacity, model explanation…

Computation and Language · Computer Science 2025-01-10 Melkamu Mersha , Mingiziem Bitewa , Tsion Abay , Jugal Kalita

Are Human Explanations Always Helpful? Towards Objective Evaluation of Human Natural Language Explanations

Human-annotated labels and explanations are critical for training explainable NLP models. However, unlike human-annotated labels whose quality is easier to calibrate (e.g., with a majority vote), human-crafted free-form explanations can be…

Computation and Language · Computer Science 2023-05-23 Bingsheng Yao , Prithviraj Sen , Lucian Popa , James Hendler , Dakuo Wang

Altruist: Argumentative Explanations through Local Interpretations of Predictive Models

Explainable AI is an emerging field providing solutions for acquiring insights into automated systems' rationale. It has been put on the AI map by suggesting ways to tackle key ethical and societal issues. Existing explanation techniques…

Machine Learning · Computer Science 2022-05-02 Ioannis Mollas , Nick Bassiliades , Grigorios Tsoumakas

Model-Agnostic Interpretability of Machine Learning

Understanding why machine learning models behave the way they do empowers both system designers and end-users in many ways: in model selection, feature engineering, in order to trust and act upon the predictions, and in more intuitive user…

Machine Learning · Statistics 2016-06-20 Marco Tulio Ribeiro , Sameer Singh , Carlos Guestrin

An Evaluation of the Human-Interpretability of Explanation

Recent years have seen a boom in interest in machine learning systems that can provide a human-understandable rationale for their predictions or decisions. However, exactly what kinds of explanation are truly human-interpretable remains…

Machine Learning · Computer Science 2019-08-30 Isaac Lage , Emily Chen , Jeffrey He , Menaka Narayanan , Been Kim , Sam Gershman , Finale Doshi-Velez

Explainable AI Enabled Inspection of Business Process Prediction Models

Modern data analytics underpinned by machine learning techniques has become a key enabler to the automation of data-led decision making. As an important branch of state-of-the-art data analytics, business process predictions are also faced…

Artificial Intelligence · Computer Science 2021-07-22 Chun Ouyang , Renuka Sindhgatta , Catarina Moreira

Assessing the Local Interpretability of Machine Learning Models

The increasing adoption of machine learning tools has led to calls for accountability via model interpretability. But what does it mean for a machine learning model to be interpretable by humans, and how can this be assessed? We focus on…

Machine Learning · Computer Science 2019-08-06 Dylan Slack , Sorelle A. Friedler , Carlos Scheidegger , Chitradeep Dutta Roy

Can Explanations Improve Recommendations? Evidence from Prediction-Informed Explanations

Recommender systems are central to digital platforms, yet they face a fundamental trade-off between accuracy and explainability. Black-box models achieve strong performance but lack interpretability needed for trust and adoption. Existing…

Information Retrieval · Computer Science 2026-05-05 Yuyan Wang , Pan Li , Minmin Chen

Robustness and Usefulness in AI Explanation Methods

Explainability in machine learning has become incredibly important as machine learning-powered systems become ubiquitous and both regulation and public sentiment begin to demand an understanding of how these systems make decisions. As a…

Machine Learning · Computer Science 2022-03-09 Erick Galinkin

DLIME: A Deterministic Local Interpretable Model-Agnostic Explanations Approach for Computer-Aided Diagnosis Systems

Local Interpretable Model-Agnostic Explanations (LIME) is a popular technique used to increase the interpretability and explainability of black box Machine Learning (ML) algorithms. LIME typically generates an explanation for a single…

Machine Learning · Computer Science 2019-06-26 Muhammad Rehman Zafar , Naimul Mefraz Khan

Do LLM Self-Explanations Help Users Predict Model Behavior? Evaluating Counterfactual Simulatability with Pragmatic Perturbations

Large Language Models (LLMs) can produce verbalized self-explanations, yet prior studies suggest that such rationales may not reliably reflect the model's true decision process. We ask whether these explanations nevertheless help users…

Computation and Language · Computer Science 2026-01-08 Pingjun Hong , Benjamin Roth