Improving Interpretability via Explicit Word Interaction Graph Layer

Arshdeep Sekhon; Hanjie Chen; Aman Shrivastava; Zhe Wang; Yangfeng Ji; Yanjun Qi

Improving Interpretability via Explicit Word Interaction Graph Layer

Computation and Language 2023-02-07 v1 Artificial Intelligence

Authors: Arshdeep Sekhon , Hanjie Chen , Aman Shrivastava , Zhe Wang , Yangfeng Ji , Yanjun Qi

Abstract

Recent NLP literature has seen growing interest in improving model interpretability. Along this direction, we propose a trainable neural network layer that learns a global interaction graph between words and then selects more informative words using the learned word interactions. Our layer, we call WIGRAPH, can plug into any neural network-based NLP text classifiers right after its word embedding layer. Across multiple SOTA NLP models and various NLP datasets, we demonstrate that adding the WIGRAPH layer substantially improves NLP models' interpretability and enhances models' prediction performance at the same time.

Keywords

graph representation learning knowledge graph interpretable machine learning

Cite

@article{arxiv.2302.02016,
  title  = {Improving Interpretability via Explicit Word Interaction Graph Layer},
  author = {Arshdeep Sekhon and Hanjie Chen and Aman Shrivastava and Zhe Wang and Yangfeng Ji and Yanjun Qi},
  journal= {arXiv preprint arXiv:2302.02016},
  year   = {2023}
}

Comments

15 pages, AAAI 2023

Improving Interpretability via Explicit Word Interaction Graph Layer

Abstract

Keywords

Cite

Comments

Related papers