Contrastive Explanations for Explaining Model Adaptations

André Artelt; Fabian Hinder; Valerie Vaquet; Robert Feldhans; Barbara Hammer

Contrastive Explanations for Explaining Model Adaptations

Machine Learning 2021-04-08 v2 Artificial Intelligence

Authors: André Artelt , Fabian Hinder , Valerie Vaquet , Robert Feldhans , Barbara Hammer

Abstract

Many decision making systems deployed in the real world are not static - a phenomenon known as model adaptation takes place over time. The need for transparency and interpretability of AI-based decision models is widely accepted and thus have been worked on extensively. Usually, explanation methods assume a static system that has to be explained. Explaining non-static systems is still an open research question, which poses the challenge how to explain model adaptations. In this contribution, we propose and (empirically) evaluate a framework for explaining model adaptations by contrastive explanations. We also propose a method for automatically finding regions in data space that are affected by a given model adaptation and thus should be explained.

Keywords

explainable artificial intelligence counterfactual explanation interpretable machine learning

Cite

@article{arxiv.2104.02459,
  title  = {Contrastive Explanations for Explaining Model Adaptations},
  author = {André Artelt and Fabian Hinder and Valerie Vaquet and Robert Feldhans and Barbara Hammer},
  journal= {arXiv preprint arXiv:2104.02459},
  year   = {2021}
}

Comments

Fix some typos

Contrastive Explanations for Explaining Model Adaptations

Abstract

Keywords

Cite

Comments

Related papers