Related papers: Rationalization through Concepts

Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control

Selective rationalization has become a common mechanism to ensure that predictive models reveal how they use any available features. The selection may be soft or hard, and identifies a subset of input features relevant for prediction. The…

Computation and Language · Computer Science 2019-12-17 Mo Yu , Shiyu Chang , Yang Zhang , Tommi S. Jaakkola

Rationalizing Neural Predictions

Prediction without justification has limited applicability. As a remedy, we learn to extract pieces of input text as justifications -- rationales -- that are tailored to be short and coherent, yet sufficient for making the same prediction.…

Computation and Language · Computer Science 2016-11-04 Tao Lei , Regina Barzilay , Tommi Jaakkola

Towards Trustworthy Explanation: On Causal Rationalization

With recent advances in natural language processing, rationalization becomes an essential self-explaining diagram to disentangle the black box by selecting a subset of input texts to account for the major variation in prediction. Yet,…

Machine Learning · Computer Science 2023-09-12 Wenbo Zhang , Tong Wu , Yunlong Wang , Yong Cai , Hengrui Cai

Learnable Game-theoretic Policy Optimization for Data-centric Self-explanation Rationalization

Rationalization, a data-centric framework, aims to build self-explanatory models to explain the prediction outcome by generating a subset of human-intelligible pieces of the input data. It involves a cooperative game model where a generator…

Artificial Intelligence · Computer Science 2025-10-16 Yunxiao Zhao , Zhiqiang Wang , Xingtong Yu , Xiaoli Li , Jiye Liang , Ru Li

Semantic Explanations of Predictions

The main objective of explanations is to transmit knowledge to humans. This work proposes to construct informative explanations for predictions made from machine learning models. Motivated by the observations from social sciences, our…

Artificial Intelligence · Computer Science 2018-05-29 Freddy Lecue , Jiewen Wu

How do Humans Understand Explanations from Machine Learning Systems? An Evaluation of the Human-Interpretability of Explanation

Recent years have seen a boom in interest in machine learning systems that can provide a human-understandable rationale for their predictions or decisions. However, exactly what kinds of explanation are truly human-interpretable remains…

Artificial Intelligence · Computer Science 2018-02-05 Menaka Narayanan , Emily Chen , Jeffrey He , Been Kim , Sam Gershman , Finale Doshi-Velez

The Irrationality of Neural Rationale Models

Neural rationale models are popular for interpretable predictions of NLP tasks. In these, a selector extracts segments of the input text, called rationales, and passes these segments to a classifier for prediction. Since the rationale is…

Computation and Language · Computer Science 2022-07-26 Yiming Zheng , Serena Booth , Julie Shah , Yilun Zhou

An Evaluation of the Human-Interpretability of Explanation

Recent years have seen a boom in interest in machine learning systems that can provide a human-understandable rationale for their predictions or decisions. However, exactly what kinds of explanation are truly human-interpretable remains…

Machine Learning · Computer Science 2019-08-30 Isaac Lage , Emily Chen , Jeffrey He , Menaka Narayanan , Been Kim , Sam Gershman , Finale Doshi-Velez

EDUCE: Explaining model Decisions through Unsupervised Concepts Extraction

Providing explanations along with predictions is crucial in some text processing tasks. Therefore, we propose a new self-interpretable model that performs output prediction and simultaneously provides an explanation in terms of the presence…

Machine Learning · Computer Science 2019-09-30 Diane Bouchacourt , Ludovic Denoyer

Automated Rationale Generation: A Technique for Explainable AI and its Effects on Human Perceptions

Automated rationale generation is an approach for real-time explanation generation whereby a computational model learns to translate an autonomous agent's internal state and action data representations into natural language. Training on…

Artificial Intelligence · Computer Science 2019-01-15 Upol Ehsan , Pradyumna Tambwekar , Larry Chan , Brent Harrison , Mark Riedl

SPECTRA: Sparse Structured Text Rationalization

Selective rationalization aims to produce decisions along with rationales (e.g., text highlights or word alignments between two sentences). Commonly, rationales are modeled as stochastic binary masks, requiring sampling-based gradient…

Computation and Language · Computer Science 2021-09-13 Nuno Miguel Guerreiro , André F. T. Martins

Model Interpretability and Rationale Extraction by Input Mask Optimization

Concurrent to the rapid progress in the development of neural-network based models in areas like natural language processing and computer vision, the need for creating explanations for the predictions of these black-box models has risen…

Computation and Language · Computer Science 2025-08-18 Marc Brinner , Sina Zarriess

Learning from the Best: Rationalizing Prediction by Adversarial Information Calibration

Explaining the predictions of AI models is paramount in safety-critical applications, such as in legal or medical domains. One form of explanation for a prediction is an extractive rationale, i.e., a subset of features of an instance that…

Computation and Language · Computer Science 2020-12-21 Lei Sha , Oana-Maria Camburu , Thomas Lukasiewicz

Do Human Rationales Improve Machine Explanations?

Work on "learning with rationales" shows that humans providing explanations to a machine learning system can improve the system's predictive accuracy. However, this work has not been connected to work in "explainable AI" which concerns…

Computation and Language · Computer Science 2019-06-03 Julia Strout , Ye Zhang , Raymond J. Mooney

Interlock-Free Multi-Aspect Rationalization for Text Classification

Explanation is important for text classification tasks. One prevalent type of explanation is rationales, which are text snippets of input text that suffice to yield the prediction and are meaningful to humans. A lot of research on…

Computation and Language · Computer Science 2022-05-16 Shuangqi Li , Diego Antognini , Boi Faltings

Relative rationality: Is machine rationality subjective?

Rational decision making in its linguistic description means making logical decisions. In essence, a rational agent optimally processes all relevant information to achieve its goal. Rationality has two elements and these are the use of…

Artificial Intelligence · Computer Science 2019-02-14 Tshilidzi Marwala

What to Learn, and How: Toward Effective Learning from Rationales

Learning from rationales seeks to augment model prediction accuracy using human-annotated rationales (i.e. subsets of input tokens) that justify their chosen labels, often in the form of intermediate or multitask supervision. While…

Machine Learning · Computer Science 2022-03-30 Samuel Carton , Surya Kanoria , Chenhao Tan

Enhancing the Rationale-Input Alignment for Self-explaining Rationalization

Rationalization empowers deep learning models with self-explaining capabilities through a cooperative game, where a generator selects a semantically consistent subset of the input as a rationale, and a subsequent predictor makes predictions…

Artificial Intelligence · Computer Science 2023-12-18 Wei Liu , Haozhao Wang , Jun Wang , Zhiying Deng , YuanKai Zhang , Cheng Wang , Ruixuan Li

Distribution Matching for Rationalization

The task of rationalization aims to extract pieces of input text as rationales to justify neural network predictions on text classification tasks. By definition, rationales represent key text pieces used for prediction and thus should have…

Computation and Language · Computer Science 2021-06-02 Yongfeng Huang , Yujun Chen , Yulun Du , Zhilin Yang

ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability

Concept-based explanations work by mapping complex model computations to human-understandable concepts. Evaluating such explanations is very difficult, as it includes not only the quality of the induced space of possible concepts but also…

Computation and Language · Computer Science 2025-06-05 Antonin Poché , Alon Jacovi , Agustin Martin Picard , Victor Boutin , Fanny Jourdan