Related papers: StyLEx: Explaining Style Using Human Lexical Annot…

LIREx: Augmenting Language Inference with Relevant Explanation

Natural language explanations (NLEs) are a special form of data annotation in which annotators identify rationales (most significant text tokens) when assigning labels to data instances, and write out explanations for the labels in natural…

Computation and Language · Computer Science 2020-12-17 Xinyan Zhao , V. G. Vinod Vydiswaran

Refining Language Models with Compositional Explanations

Pre-trained language models have been successful on text classification tasks, but are prone to learning spurious correlations from biased datasets, and are thus vulnerable when making inferences in a new domain. Prior work reveals such…

Computation and Language · Computer Science 2022-01-03 Huihan Yao , Ying Chen , Qinyuan Ye , Xisen Jin , Xiang Ren

Can LLM-Generated Textual Explanations Enhance Model Classification Performance? An Empirical Study

In the rapidly evolving field of Explainable Natural Language Processing (NLP), textual explanations, i.e., human-like rationales, are pivotal for explaining model predictions and enriching datasets with interpretable labels. Traditional…

Computation and Language · Computer Science 2025-11-12 Mahdi Dhaini , Juraj Vladika , Ege Erdogan , Zineb Attaoui , Gjergji Kasneci

Are Human Explanations Always Helpful? Towards Objective Evaluation of Human Natural Language Explanations

Human-annotated labels and explanations are critical for training explainable NLP models. However, unlike human-annotated labels whose quality is easier to calibrate (e.g., with a majority vote), human-crafted free-form explanations can be…

Computation and Language · Computer Science 2023-05-23 Bingsheng Yao , Prithviraj Sen , Lucian Popa , James Hendler , Dakuo Wang

When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data

Many methods now exist for conditioning model outputs on task instructions, retrieved documents, and user-provided explanations and feedback. Rather than relying solely on examples of task inputs and outputs, these approaches use valuable…

Computation and Language · Computer Science 2021-02-12 Peter Hase , Mohit Bansal

Explaining in Style: Training a GAN to explain a classifier in StyleSpace

Image classification models can depend on multiple different semantic attributes of the image. An explanation of the decision of the classifier needs to both discover and visualize these properties. Here we present StylEx, a method for…

Computer Vision and Pattern Recognition · Computer Science 2021-09-02 Oran Lang , Yossi Gandelsman , Michal Yarom , Yoav Wald , Gal Elidan , Avinatan Hassidim , William T. Freeman , Phillip Isola , Amir Globerson , Michal Irani , Inbar Mosseri

A Comparative Study on Textual Saliency of Styles from Eye Tracking, Annotations, and Language Models

There is growing interest in incorporating eye-tracking data and other implicit measures of human language processing into natural language processing (NLP) pipelines. The data from human language processing contain unique insight into…

Computation and Language · Computer Science 2023-10-24 Karin de Langis , Dongyeop Kang

A Lexical, Syntactic, and Semantic Perspective for Understanding Style in Text

With a growing interest in modeling inherent subjectivity in natural language, we present a linguistically-motivated process to understand and analyze the writing style of individuals from three perspectives: lexical, syntactic, and…

Computation and Language · Computer Science 2019-09-19 Gaurav Verma , Balaji Vasan Srinivasan

Agree, Disagree, Explain: Decomposing Human Label Variation in NLI through the Lens of Explanations

Natural Language Inference (NLI) datasets often exhibit human label variation. To better understand these variations, explanation-based approaches analyze the underlying reasoning behind annotators' decisions. One such approach is the LiTEx…

Computation and Language · Computer Science 2026-04-21 Pingjun Hong , Beiduo Chen , Siyao Peng , Marie-Catherine de Marneffe , Benjamin Roth , Barbara Plank

A Diagnostic Study of Explainability Techniques for Text Classification

Recent developments in machine learning have introduced models that approach human performance at the cost of increased architectural complexity. Efforts to make the rationales behind the models' predictions transparent have inspired an…

Computation and Language · Computer Science 2020-09-29 Pepa Atanasova , Jakob Grue Simonsen , Christina Lioma , Isabelle Augenstein

e-SNLI: Natural Language Inference with Natural Language Explanations

In order for machine learning to garner widespread public adoption, models must be able to provide interpretable and robust explanations for their decisions, as well as learn from human-provided explanations at train time. In this work, we…

Computation and Language · Computer Science 2018-12-07 Oana-Maria Camburu , Tim Rocktäschel , Thomas Lukasiewicz , Phil Blunsom

Supervising Model Attention with Human Explanations for Robust Natural Language Inference

Natural Language Inference (NLI) models are known to learn from biases and artefacts within their training data, impacting how well they generalise to other unseen datasets. Existing de-biasing approaches focus on preventing the models from…

Computation and Language · Computer Science 2022-05-03 Joe Stacey , Yonatan Belinkov , Marek Rei

Using Natural Language Explanations to Rescale Human Judgments

The rise of large language models (LLMs) has brought a critical need for high-quality human-labeled data, particularly for processes like human feedback and evaluation. A common practice is to label data via consensus annotation over human…

Computation and Language · Computer Science 2025-06-23 Manya Wadhwa , Jifan Chen , Junyi Jessy Li , Greg Durrett

A survey on improving NLP models with human explanations

Training a model with access to human explanations can improve data efficiency and model performance on in- and out-of-domain data. Adding to these empirical findings, similarity with the process of human learning makes learning from…

Computation and Language · Computer Science 2022-04-20 Mareike Hartmann , Daniel Sonntag

LiTEx: A Linguistic Taxonomy of Explanations for Understanding Within-Label Variation in Natural Language Inference

There is increasing evidence of Human Label Variation (HLV) in Natural Language Inference (NLI), where annotators assign different labels to the same premise-hypothesis pair. However, within-label variation--cases where annotators agree on…

Computation and Language · Computer Science 2025-10-09 Pingjun Hong , Beiduo Chen , Siyao Peng , Marie-Catherine de Marneffe , Barbara Plank

The Sensitivity of Language Models and Humans to Winograd Schema Perturbations

Large-scale pretrained language models are the major driving force behind recent improvements in performance on the Winograd Schema Challenge, a widely employed test of common sense reasoning ability. We show, however, with a new diagnostic…

Computation and Language · Computer Science 2020-05-08 Mostafa Abdou , Vinit Ravishankar , Maria Barrett , Yonatan Belinkov , Desmond Elliott , Anders Søgaard

Modeling Semantic Expectation: Using Script Knowledge for Referent Prediction

Recent research in psycholinguistics has provided increasing evidence that humans predict upcoming content. Prediction also affects perception and might be a key to robustness in human language processing. In this paper, we investigate the…

Computation and Language · Computer Science 2017-02-13 Ashutosh Modi , Ivan Titov , Vera Demberg , Asad Sayeed , Manfred Pinkal

Are Large Language Models Reliable Argument Quality Annotators?

Evaluating the quality of arguments is a crucial aspect of any system leveraging argument mining. However, it is a challenge to obtain reliable and consistent annotations regarding argument quality, as this usually requires domain-specific…

Computation and Language · Computer Science 2024-04-16 Nailia Mirzakhmedova , Marcel Gohsen , Chia Hao Chang , Benno Stein

Logical Reasoning with Span-Level Predictions for Interpretable and Robust NLI Models

Current Natural Language Inference (NLI) models achieve impressive results, sometimes outperforming humans when evaluating on in-distribution test sets. However, as these models are known to learn from annotation artefacts and dataset…

Computation and Language · Computer Science 2022-10-24 Joe Stacey , Pasquale Minervini , Haim Dubossarsky , Marek Rei

Modeling Legal Reasoning: LM Annotation at the Edge of Human Agreement

Generative language models (LMs) are increasingly used for document class-prediction tasks and promise enormous improvements in cost and efficiency. Existing research often examines simple classification tasks, but the capability of LMs to…

Computation and Language · Computer Science 2023-10-31 Rosamond Thalken , Edward H. Stiglitz , David Mimno , Matthew Wilkens