Related papers: SPECTRA: Sparse Structured Text Rationalization

Rationalizing Neural Predictions

Prediction without justification has limited applicability. As a remedy, we learn to extract pieces of input text as justifications -- rationales -- that are tailored to be short and coherent, yet sufficient for making the same prediction.…

Computation and Language · Computer Science 2016-11-04 Tao Lei , Regina Barzilay , Tommi Jaakkola

Towards Faithful Explanations: Boosting Rationalization with Shortcuts Discovery

The remarkable success in neural networks provokes the selective rationalization. It explains the prediction results by identifying a small subset of the inputs sufficient to support them. Since existing methods still suffer from adopting…

Machine Learning · Computer Science 2024-07-22 Linan Yue , Qi Liu , Yichao Du , Li Wang , Weibo Gao , Yanqing An

Conducting sparse feature selection on arbitrarily long phrases in text corpora with a focus on interpretability

We propose a general framework for topic-specific summarization of large text corpora, and illustrate how it can be used for analysis in two quite different contexts: an OSHA database of fatality and catastrophe reports (to facilitate…

Computation and Language · Computer Science 2016-07-26 Luke Miratrix , Robin Ackerman

Towards Trustworthy Explanation: On Causal Rationalization

With recent advances in natural language processing, rationalization becomes an essential self-explaining diagram to disentangle the black box by selecting a subset of input texts to account for the major variation in prediction. Yet,…

Machine Learning · Computer Science 2023-09-12 Wenbo Zhang , Tong Wu , Yunlong Wang , Yong Cai , Hengrui Cai

Model Interpretability and Rationale Extraction by Input Mask Optimization

Concurrent to the rapid progress in the development of neural-network based models in areas like natural language processing and computer vision, the need for creating explanations for the predictions of these black-box models has risen…

Computation and Language · Computer Science 2025-08-18 Marc Brinner , Sina Zarriess

Flexible Instance-Specific Rationalization of NLP Models

Recent research on model interpretability in natural language processing extensively uses feature scoring methods for identifying which parts of the input are the most important for a model to make a prediction (i.e. explanation or…

Computation and Language · Computer Science 2021-12-07 George Chrysostomou , Nikolaos Aletras

Selective Factor Extraction in High Dimensions

This paper studies simultaneous feature selection and extraction in supervised and unsupervised learning. We propose and investigate selective reduced rank regression for constructing optimal explanatory factors from a parsimonious subset…

Methodology · Statistics 2016-10-27 Yiyuan She

Rationalization through Concepts

Automated predictions require explanations to be interpretable by humans. One type of explanation is a rationale, i.e., a selection of input features such as relevant text snippets from which the model computes the outcome. However, a…

Computation and Language · Computer Science 2021-05-12 Diego Antognini , Boi Faltings

Interpretable Neural Predictions with Differentiable Binary Variables

The success of neural networks comes hand in hand with a desire for more interpretability. We focus on text classifiers and make them more interpretable by having them provide a justification, a rationale, for their predictions. We approach…

Computation and Language · Computer Science 2020-06-22 Jasmijn Bastings , Wilker Aziz , Ivan Titov

Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control

Selective rationalization has become a common mechanism to ensure that predictive models reveal how they use any available features. The selection may be soft or hard, and identifies a subset of input features relevant for prediction. The…

Computation and Language · Computer Science 2019-12-17 Mo Yu , Shiyu Chang , Yang Zhang , Tommi S. Jaakkola

REFER: An End-to-end Rationale Extraction Framework for Explanation Regularization

Human-annotated textual explanations are becoming increasingly important in Explainable Natural Language Processing. Rationale extraction aims to provide faithful (i.e., reflective of the behavior of the model) and plausible (i.e.,…

Computation and Language · Computer Science 2023-10-24 Mohammad Reza Ghasemi Madani , Pasquale Minervini

Paragraph-level Rationale Extraction through Regularization: A case study on European Court of Human Rights Cases

Interpretability or explainability is an emerging research field in NLP. From a user-centric point of view, the goal is to build models that provide proper justification for their decisions, similar to those of humans, by requiring the…

Computation and Language · Computer Science 2021-03-25 Ilias Chalkidis , Manos Fergadiotis , Dimitrios Tsarapatsanis , Nikolaos Aletras , Ion Androutsopoulos , Prodromos Malakasiotis

Distribution Matching for Rationalization

The task of rationalization aims to extract pieces of input text as rationales to justify neural network predictions on text classification tasks. By definition, rationales represent key text pieces used for prediction and thus should have…

Computation and Language · Computer Science 2021-06-02 Yongfeng Huang , Yujun Chen , Yulun Du , Zhilin Yang

Rationalizing Text Matching: Learning Sparse Alignments via Optimal Transport

Selecting input features of top relevance has become a popular method for building self-explaining models. In this work, we extend this selective rationalization approach to text matching, where the goal is to jointly select and align text…

Machine Learning · Computer Science 2020-05-28 Kyle Swanson , Lili Yu , Tao Lei

Neural Document Summarization by Jointly Learning to Score and Select Sentences

Sentence scoring and sentence selection are two main steps in extractive document summarization systems. However, previous works treat them as two separated subtasks. In this paper, we present a novel end-to-end neural network framework for…

Computation and Language · Computer Science 2018-07-09 Qingyu Zhou , Nan Yang , Furu Wei , Shaohan Huang , Ming Zhou , Tiejun Zhao

Binary Stochastic Filtering: feature selection and beyond

Feature selection is one of the most decisive tools in understanding data and machine learning models. Among other methods, sparsity induced by $L^{1}$ penalty is one of the simplest and best studied approaches to this problem. Although…

Machine Learning · Computer Science 2020-07-09 Andrii Trelin , Aleš Procházka

UNIREX: A Unified Learning Framework for Language Model Rationale Extraction

An extractive rationale explains a language model's (LM's) prediction on a given task instance by highlighting the text inputs that most influenced the prediction. Ideally, rationale extraction should be faithful (reflective of LM's actual…

Computation and Language · Computer Science 2023-02-28 Aaron Chan , Maziar Sanjabi , Lambert Mathias , Liang Tan , Shaoliang Nie , Xiaochang Peng , Xiang Ren , Hamed Firooz

Unsupervised Selective Rationalization with Noise Injection

A major issue with using deep learning models in sensitive applications is that they provide no explanation for their output. To address this problem, unsupervised selective rationalization produces rationales alongside predictions by…

Computation and Language · Computer Science 2023-05-30 Adam Storek , Melanie Subbiah , Kathleen McKeown

Finding the Needle in a Haystack: Unsupervised Rationale Extraction from Long Text Classifiers

Long-sequence transformers are designed to improve the representation of longer texts by language models and their performance on downstream document-level tasks. However, not much is understood about the quality of token-level predictions…

Computation and Language · Computer Science 2023-03-15 Kamil Bujel , Andrew Caines , Helen Yannakoudakis , Marek Rei

Explainable Neural Networks with Guarantees: A Sparse Estimation Approach

Balancing predictive power and interpretability has long been a challenging research area, particularly in powerful yet complex models like neural networks, where nonlinearity obstructs direct interpretation. This paper introduces a novel…

Machine Learning · Computer Science 2025-02-20 Antoine Ledent , Peng Liu