Explainable Deep Classification Models for Domain Generalization

Andrea Zunino; Sarah Adel Bargal; Riccardo Volpi; Mehrnoosh Sameki; Jianming Zhang; Stan Sclaroff; Vittorio Murino; Kate Saenko

Explainable Deep Classification Models for Domain Generalization

Computer Vision and Pattern Recognition 2020-03-17 v1

Authors: Andrea Zunino , Sarah Adel Bargal , Riccardo Volpi , Mehrnoosh Sameki , Jianming Zhang , Stan Sclaroff , Vittorio Murino , Kate Saenko

View on arXiv ↗ PDF ↗

Abstract

Conventionally, AI models are thought to trade off explainability for lower accuracy. We develop a training strategy that not only leads to a more explainable AI system for object classification, but as a consequence, suffers no perceptible accuracy degradation. Explanations are defined as regions of visual evidence upon which a deep classification network makes a decision. This is represented in the form of a saliency map conveying how much each pixel contributed to the network's decision. Our training strategy enforces a periodic saliency-based feedback to encourage the model to focus on the image regions that directly correspond to the ground-truth object. We quantify explainability using an automated metric, and using human judgement. We propose explainability as a means for bridging the visual-semantic gap between different domains where model explanations are used as a means of disentagling domain specific information from otherwise relevant features. We demonstrate that this leads to improved generalization to new domains without hindering performance on the original domain.

Keywords

explainable artificial intelligence interpretable machine learning interpretability

Cite

@article{arxiv.2003.06498,
  title  = {Explainable Deep Classification Models for Domain Generalization},
  author = {Andrea Zunino and Sarah Adel Bargal and Riccardo Volpi and Mehrnoosh Sameki and Jianming Zhang and Stan Sclaroff and Vittorio Murino and Kate Saenko},
  journal= {arXiv preprint arXiv:2003.06498},
  year   = {2020}
}

Explainable Deep Classification Models for Domain Generalization

Abstract

Keywords

Cite

Related papers