Related papers: Evaluating and Characterizing Human Rationales

Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-Text Rationales

Among the remarkable emergent capabilities of large language models (LMs) is free-text rationalization; beyond a certain scale, large LMs are capable of generating seemingly useful rationalizations, which in turn, can dramatically enhance…

Computation and Language · Computer Science 2023-05-15 Brihi Joshi , Ziyi Liu , Sahana Ramnath , Aaron Chan , Zhewei Tong , Shaoliang Nie , Qifan Wang , Yejin Choi , Xiang Ren

Rethinking Human Preference Evaluation of LLM Rationales

Large language models (LLMs) often generate natural language rationales -- free-form explanations that help improve performance on complex reasoning tasks and enhance interpretability for human users. However, evaluating these rationales…

Artificial Intelligence · Computer Science 2025-09-16 Ziang Li , Manasi Ganti , Zixian Ma , Helena Vasconcelos , Qijia He , Ranjay Krishna

How Ambiguous Are the Rationales for Natural Language Reasoning? A Simple Approach to Handling Rationale Uncertainty

The quality of rationales is essential in the reasoning capabilities of language models. Rationales not only enhance reasoning performance in complex natural language tasks but also justify model decisions. However, obtaining impeccable…

Computation and Language · Computer Science 2025-03-05 Hazel H. Kim

Rationales Are Not Silver Bullets: Measuring the Impact of Rationales on Model Performance and Reliability

Training language models with rationales augmentation has been shown to be beneficial in many existing works. In this paper, we identify that such a prevailing view does not hold consistently. We conduct comprehensive investigations to…

Computation and Language · Computer Science 2025-06-02 Chiwei Zhu , Benfeng Xu , An Yang , Junyang Lin , Quan Wang , Chang Zhou , Zhendong Mao

Do Human Rationales Improve Machine Explanations?

Work on "learning with rationales" shows that humans providing explanations to a machine learning system can improve the system's predictive accuracy. However, this work has not been connected to work in "explainable AI" which concerns…

Computation and Language · Computer Science 2019-06-03 Julia Strout , Ye Zhang , Raymond J. Mooney

What to Learn, and How: Toward Effective Learning from Rationales

Learning from rationales seeks to augment model prediction accuracy using human-annotated rationales (i.e. subsets of input tokens) that justify their chosen labels, often in the form of intermediate or multitask supervision. While…

Machine Learning · Computer Science 2022-03-30 Samuel Carton , Surya Kanoria , Chenhao Tan

Relative rationality: Is machine rationality subjective?

Rational decision making in its linguistic description means making logical decisions. In essence, a rational agent optimally processes all relevant information to achieve its goal. Rationality has two elements and these are the use of…

Artificial Intelligence · Computer Science 2019-02-14 Tshilidzi Marwala

The limit of artificial intelligence: Can machines be rational?

This paper studies the question on whether machines can be rational. It observes the existing reasons why humans are not rational which is due to imperfect and limited information, limited and inconsistent processing power through the brain…

Artificial Intelligence · Computer Science 2018-12-18 Tshilidzi Marwala

Reasoning Elicitation in Language Models via Counterfactual Feedback

Despite the increasing effectiveness of language models, their reasoning capabilities remain underdeveloped. In particular, causal reasoning through counterfactual question answering is lacking. This work aims to bridge this gap. We first…

Computation and Language · Computer Science 2025-03-18 Alihan Hüyük , Xinnuo Xu , Jacqueline Maasch , Aditya V. Nori , Javier González

Studying and improving reasoning in humans and machines

In the present study, we investigate and compare reasoning in large language models (LLM) and humans using a selection of cognitive psychology tools traditionally dedicated to the study of (bounded) rationality. To do so, we presented to…

Computation and Language · Computer Science 2023-09-25 Nicolas Yax , Hernan Anlló , Stefano Palminteri

Can rationality be measured?

This paper studies whether rationality can be computed. Rationality is defined as the use of complete information, which is processed with a perfect biological or physical brain, in an optimized fashion. To compute rationality one needs to…

Artificial Intelligence · Computer Science 2018-12-27 Tshilidzi Marwala

Learning from Sufficient Rationales: Analysing the Relationship Between Explanation Faithfulness and Token-level Regularisation Strategies

Human explanations of natural language, rationales, form a tool to assess whether models learn a label for the right reasons or rely on dataset-specific shortcuts. Sufficiency is a common metric for estimating the informativeness of…

Computation and Language · Computer Science 2025-11-21 Jonathan Kamp , Lisa Beinborn , Antske Fokkens

Characterizing, Evaluating, and Optimizing Complex Reasoning

Large Reasoning Models (LRMs) increasingly rely on reasoning traces with complex internal structures. However, existing work lacks a unified answer to three fundamental questions: (1) what defines high-quality reasoning, (2) how to reliably…

Computation and Language · Computer Science 2026-02-10 Haoran Zhang , Yafu Li , Zhi Wang , Zhilin Wang , Shunkai Zhang , Xiaoye Qu , Yu Cheng

Are Human Explanations Always Helpful? Towards Objective Evaluation of Human Natural Language Explanations

Human-annotated labels and explanations are critical for training explainable NLP models. However, unlike human-annotated labels whose quality is easier to calibrate (e.g., with a majority vote), human-crafted free-form explanations can be…

Computation and Language · Computer Science 2023-05-23 Bingsheng Yao , Prithviraj Sen , Lucian Popa , James Hendler , Dakuo Wang

Human irrationality: both bad and good for reward inference

Assuming humans are (approximately) rational enables robots to infer reward functions by observing human behavior. But people exhibit a wide array of irrationalities, and our goal with this work is to better understand the effect they can…

Machine Learning · Computer Science 2021-11-16 Lawrence Chan , Andrew Critch , Anca Dragan

Rationale awareness for quality assurance in iterative human computation processes

Human computation refers to the outsourcing of computation tasks to human workers. It offers a new direction for solving a variety of problems and calls for innovative ways of managing human computation processes. The majority of human…

Human-Computer Interaction · Computer Science 2012-04-17 Lu Xiao

Self-rationalization improves LLM as a fine-grained judge

LLM-as-a-judge models have been used for evaluating both human and AI generated content, specifically by providing scores and rationales. Rationales, in addition to increasing transparency, help models learn to calibrate its judgments.…

Computation and Language · Computer Science 2024-10-10 Prapti Trivedi , Aditya Gulati , Oliver Molenschot , Meghana Arakkal Rajeev , Rajkumar Ramamurthy , Keith Stevens , Tanveesh Singh Chaudhery , Jahnavi Jambholkar , James Zou , Nazneen Rajani

Estimating Subjective Crowd-Evaluations as an Additional Objective to Improve Natural Language Generation

Human ratings are one of the most prevalent methods to evaluate the performance of natural language processing algorithms. Similarly, it is common to measure the quality of sentences generated by a natural language generation model using…

Computation and Language · Computer Science 2021-04-13 Jakob Nyberg , Ramesh Manuvinakurike , Maike Paetzel-Prüsmann

ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning

Large language models show improved downstream task performance when prompted to generate step-by-step reasoning to justify their final answers. These reasoning steps greatly improve model interpretability and verification, but objectively…

Computation and Language · Computer Science 2023-09-13 Olga Golovneva , Moya Chen , Spencer Poff , Martin Corredor , Luke Zettlemoyer , Maryam Fazel-Zarandi , Asli Celikyilmaz

Rationalization through Concepts

Automated predictions require explanations to be interpretable by humans. One type of explanation is a rationale, i.e., a selection of input features such as relevant text snippets from which the model computes the outcome. However, a…

Computation and Language · Computer Science 2021-05-12 Diego Antognini , Boi Faltings