Amid a discussion about Green AI in which we see explainability neglected, we explore the possibility to efficiently approximate computationally expensive explainers. To this end, we propose feature attribution modelling with Empirical Explainers. Empirical Explainers learn from data to predict the attribution maps of expensive explainers. We train and test Empirical Explainers in the language domain and find that they model their expensive counterparts surprisingly well, at a fraction of the cost. They could thus mitigate the computational burden of neural explanations significantly, in applications that tolerate an approximation error.
@article{arxiv.2103.15429,
title = {Efficient Explanations from Empirical Explainers},
author = {Robert Schwarzenberg and Nils Feldhus and Sebastian Möller},
journal= {arXiv preprint arXiv:2103.15429},
year = {2021}
}
Comments
Accepted to the EMNLP 2021 Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP)