English

Learning Explanations from Language Data

Computation and Language 2018-08-14 v1 Machine Learning

Abstract

PatternAttribution is a recent method, introduced in the vision domain, that explains classifications of deep neural networks. We demonstrate that it also generates meaningful interpretations in the language domain.

Keywords

Cite

@article{arxiv.1808.04127,
  title  = {Learning Explanations from Language Data},
  author = {David Harbecke and Robert Schwarzenberg and Christoph Alt},
  journal= {arXiv preprint arXiv:1808.04127},
  year   = {2018}
}

Comments

Appears in 2018 EMNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP)

R2 v1 2026-06-23T03:31:49.230Z