English

Pattern-Guided Integrated Gradients

Machine Learning 2020-09-02 v2 Machine Learning

Abstract

Integrated Gradients (IG) and PatternAttribution (PA) are two established explainability methods for neural networks. Both methods are theoretically well-founded. However, they were designed to overcome different challenges. In this work, we combine the two methods into a new method, Pattern-Guided Integrated Gradients (PGIG). PGIG inherits important properties from both parent methods and passes stress tests that the originals fail. In addition, we benchmark PGIG against nine alternative explainability approaches (including its parent methods) in a large-scale image degradation experiment and find that it outperforms all of them.

Cite

@article{arxiv.2007.10685,
  title  = {Pattern-Guided Integrated Gradients},
  author = {Robert Schwarzenberg and Steffen Castle},
  journal= {arXiv preprint arXiv:2007.10685},
  year   = {2020}
}

Comments

Presented at the ICML 2020 Workshop on Human Interpretability in Machine Learning (WHI)

R2 v1 2026-06-23T17:16:29.788Z