Related papers: Aggregating explanation methods for stable and rob…

Robustness of Explanation Methods for NLP Models

Explanation methods have emerged as an important tool to highlight the features responsible for the predictions of neural networks. There is mounting evidence that many explanation methods are rather unreliable and susceptible to malicious…

Computation and Language · Computer Science 2022-06-27 Shriya Atmakuri , Tejas Chheda , Dinesh Kandula , Nishant Yadav , Taesung Lee , Hessel Tuinhof

Do Explanations Explain? Model Knows Best

It is a mystery which input features contribute to a neural network's output. Various explanation (feature attribution) methods are proposed in the literature to shed light on the problem. One peculiar observation is that these explanations…

Machine Learning · Computer Science 2022-03-07 Ashkan Khakzar , Pedram Khorsandi , Rozhin Nobahari , Nassir Navab

Fidelity of Ensemble Aggregation for Saliency Map Explanations using Bayesian Optimization Techniques

In recent years, an abundance of feature attribution methods for explaining neural networks have been developed. Especially in the field of computer vision, many methods for generating saliency maps providing pixel attributions exist.…

Computer Vision and Pattern Recognition · Computer Science 2022-07-06 Yannik Mahlau , Christian Nolde

On the Robustness of Explanations of Deep Neural Network Models: A Survey

Explainability has been widely stated as a cornerstone of the responsible and trustworthy use of machine learning models. With the ubiquitous use of Deep Neural Network (DNN) models expanding to risk-sensitive and safety-critical domains,…

Machine Learning · Computer Science 2022-11-10 Amlan Jyoti , Karthik Balaji Ganesh , Manoj Gayala , Nandita Lakshmi Tunuguntla , Sandesh Kamath , Vineeth N Balasubramanian

Accelerating the Global Aggregation of Local Explanations

Local explanation methods highlight the input tokens that have a considerable impact on the outcome of classifying the document at hand. For example, the Anchor algorithm applies a statistical analysis of the sensitivity of the classifier…

Machine Learning · Computer Science 2024-01-15 Alon Mor , Yonatan Belinkov , Benny Kimelfeld

Generative Explanations for Graph Neural Network: Methods and Evaluations

Graph Neural Networks (GNNs) achieve state-of-the-art performance in various graph-related tasks. However, the black-box nature often limits their interpretability and trustworthiness. Numerous explainability methods have been proposed to…

Machine Learning · Computer Science 2023-11-13 Jialin Chen , Kenza Amara , Junchi Yu , Rex Ying

Axiomatic Aggregations of Abductive Explanations

The recent criticisms of the robustness of post hoc model approximation explanation methods (like LIME and SHAP) have led to the rise of model-precise abductive explanations. For each data point, abductive explanations provide a minimal…

Artificial Intelligence · Computer Science 2023-10-13 Gagan Biradar , Yacine Izza , Elita Lobo , Vignesh Viswanathan , Yair Zick

Neural network ensembles: Evaluation of aggregation algorithms

Ensembles of artificial neural networks show improved generalization capabilities that outperform those of single networks. However, for aggregation to be effective, the individual networks must be as accurate and diverse as possible. An…

Artificial Intelligence · Computer Science 2007-05-23 P. M. Granitto , P. F. Verdes , H. A. Ceccatto

An AI Architecture with the Capability to Explain Recognition Results

Explainability is needed to establish confidence in machine learning results. Some explainable methods take a post hoc approach to explain the weights of machine learning models, others highlight areas of the input contributing to…

Machine Learning · Computer Science 2024-07-15 Paul Whitten , Francis Wolff , Chris Papachristou

Provably Better Explanations with Optimized Aggregation of Feature Attributions

Using feature attributions for post-hoc explanations is a common practice to understand and verify the predictions of opaque machine learning models. Despite the numerous techniques available, individual methods often produce inconsistent…

Machine Learning · Computer Science 2024-06-10 Thomas Decker , Ananta R. Bhattarai , Jindong Gu , Volker Tresp , Florian Buettner

Explanations can be manipulated and geometry is to blame

Explanation methods aim to make neural networks more trustworthy and interpretable. In this paper, we demonstrate a property of explanation methods which is disconcerting for both of these purposes. Namely, we show that explanations can be…

Machine Learning · Statistics 2019-09-26 Ann-Kathrin Dombrowski , Maximilian Alber , Christopher J. Anders , Marcel Ackermann , Klaus-Robert Müller , Pan Kessel

Evaluating Explanation Methods for Deep Learning in Security

Deep learning is increasingly used as a building block of security systems. Unfortunately, neural networks are hard to interpret and typically opaque to the practitioner. The machine learning community has started to address this problem by…

Machine Learning · Computer Science 2020-04-28 Alexander Warnecke , Daniel Arp , Christian Wressnegger , Konrad Rieck

Robust Explainability: A Tutorial on Gradient-Based Attribution Methods for Deep Neural Networks

With the rise of deep neural networks, the challenge of explaining the predictions of these networks has become increasingly recognized. While many methods for explaining the decisions of deep neural networks exist, there is currently no…

Machine Learning · Computer Science 2022-07-13 Ian E. Nielsen , Dimah Dera , Ghulam Rasool , Nidhal Bouaynaya , Ravi P. Ramachandran

When Can You Trust Your Explanations? A Robustness Analysis on Feature Importances

Recent legislative regulations have underlined the need for accountable and transparent artificial intelligence systems and have contributed to a growing interest in the Explainable Artificial Intelligence (XAI) field. Nonetheless, the lack…

Machine Learning · Computer Science 2025-10-14 Ilaria Vascotto , Alex Rodriguez , Alessandro Bonaita , Luca Bortolussi

Robust Explanation Constraints for Neural Networks

Post-hoc explanation methods are used with the intent of providing insights about neural networks and are sometimes said to help engender trust in their outputs. However, popular explanations methods have been found to be fragile to minor…

Machine Learning · Computer Science 2022-12-19 Matthew Wicker , Juyeon Heo , Luca Costabello , Adrian Weller

Measuring and improving the quality of visual explanations

The ability of to explain neural network decisions goes hand in hand with their safe deployment. Several methods have been proposed to highlight features important for a given network decision. However, there is no consensus on how to…

Computer Vision and Pattern Recognition · Computer Science 2020-03-23 Agnieszka Grabska-Barwińska

Ensemble Interpretation: A Unified Method for Interpretable Machine Learning

To address the issues of stability and fidelity in interpretable learning, a novel interpretable methodology, ensemble interpretation, is presented in this paper which integrates multi-perspective explanation of various interpretation…

Machine Learning · Computer Science 2023-12-12 Chao Min , Guoyong Liao , Guoquan Wen , Yingjun Li , Xing Guo

A survey on improving NLP models with human explanations

Training a model with access to human explanations can improve data efficiency and model performance on in- and out-of-domain data. Adding to these empirical findings, similarity with the process of human learning makes learning from…

Computation and Language · Computer Science 2022-04-20 Mareike Hartmann , Daniel Sonntag

Explaining, Evaluating and Enhancing Neural Networks' Learned Representations

Most efforts in interpretability in deep learning have focused on (1) extracting explanations of a specific downstream task in relation to the input features and (2) imposing constraints on the model, often at the expense of predictive…

Machine Learning · Computer Science 2022-02-22 Marco Bertolini , Djork-Arné Clevert , Floriane Montanari

Experimental Assessment of Aggregation Principles in Argumentation-enabled Collective Intelligence

On the Web, there is always a need to aggregate opinions from the crowd (as in posts, social networks, forums, etc.). Different mechanisms have been implemented to capture these opinions such as "Like" in Facebook, "Favorite" in Twitter,…

Artificial Intelligence · Computer Science 2017-06-20 Edmond Awad , Jean-François Bonnefon , Martin Caminada , Thomas Malone , Iyad Rahwan