Related papers: Explaining Question Answering Models through Text …

Characterizing Large Language Models as Rationalizers of Knowledge-intensive Tasks

Large language models (LLMs) are proficient at generating fluent text with minimal task-specific supervision. Yet, their ability to provide well-grounded rationalizations for knowledge-intensive tasks remains under-explored. Such tasks,…

Computation and Language · Computer Science 2024-02-02 Aditi Mishra , Sajjadur Rahman , Hannah Kim , Kushan Mitra , Estevam Hruschka

Machine Comprehension by Text-to-Text Neural Question Generation

We propose a recurrent neural model that generates natural-language questions from documents, conditioned on answers. We show how to train the model using a combination of supervised and reinforcement learning. After teacher forcing for…

Computation and Language · Computer Science 2017-05-16 Xingdi Yuan , Tong Wang , Caglar Gulcehre , Alessandro Sordoni , Philip Bachman , Sandeep Subramanian , Saizheng Zhang , Adam Trischler

Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation

Large Language Models (LLMs) have in recent years demonstrated impressive prowess in natural language generation. A common practice to improve generation diversity is to sample multiple outputs from the model. However, there lacks a simple…

Computation and Language · Computer Science 2022-09-23 Xingdi Yuan , Tong Wang , Yen-Hsiang Wang , Emery Fine , Rania Abdelghani , Pauline Lucas , Hélène Sauzéon , Pierre-Yves Oudeyer

Deep Latent-Variable Models for Text Generation

Text generation aims to produce human-like natural language output for down-stream tasks. It covers a wide range of applications like machine translation, document summarization, dialogue generation and so on. Recently deep neural…

Computation and Language · Computer Science 2022-03-07 Xiaoyu Shen

Hypothesis generation and updating in large language models

Large language models (LLMs) increasingly help people solve problems, from debugging code to repairing machinery. This process requires generating plausible hypotheses from partial descriptions, then updating them as more information…

Machine Learning · Computer Science 2026-05-08 Hua-Dong Xiong

Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification

The need for interpretability in deep learning has driven interest in counterfactual explanations, which identify minimal changes to an instance that change a model's prediction. Current counterfactual (CF) generation methods require…

Computation and Language · Computer Science 2025-12-11 Van Bach Nguyen , Christin Seifert , Jörg Schlötterer

Explanations from Large Language Models Make Small Reasoners Better

Integrating free-text explanations to in-context learning of large language models (LLM) is shown to elicit strong reasoning capabilities along with reasonable explanations. In this paper, we consider the problem of leveraging the…

Computation and Language · Computer Science 2022-10-14 Shiyang Li , Jianshu Chen , Yelong Shen , Zhiyu Chen , Xinlu Zhang , Zekun Li , Hong Wang , Jing Qian , Baolin Peng , Yi Mao , Wenhu Chen , Xifeng Yan

QURIOUS: Question Generation Pretraining for Text Generation

Recent trends in natural language processing using pretraining have shifted focus towards pretraining and fine-tuning approaches for text generation. Often the focus has been on task-agnostic approaches that generalize the language modeling…

Computation and Language · Computer Science 2020-04-24 Shashi Narayan , Gonçalo Simoes , Ji Ma , Hannah Craighead , Ryan Mcdonald

Cross-Examiner: Evaluating Consistency of Large Language Model-Generated Explanations

Large Language Models (LLMs) are often asked to explain their outputs to enhance accuracy and transparency. However, evidence suggests that these explanations can misrepresent the models' true reasoning processes. One effective way to…

Computation and Language · Computer Science 2025-03-13 Danielle Villa , Maria Chang , Keerthiram Murugesan , Rosario Uceda-Sosa , Karthikeyan Natesan Ramamurthy

Explaining How Transformers Use Context to Build Predictions

Language Generation Models produce words based on the previous context. Although existing methods offer input attributions as explanations for a model's prediction, it is still unclear how prior words affect the model's decision throughout…

Computation and Language · Computer Science 2023-05-23 Javier Ferrando , Gerard I. Gállego , Ioannis Tsiamas , Marta R. Costa-jussà

Can LLM-Generated Textual Explanations Enhance Model Classification Performance? An Empirical Study

In the rapidly evolving field of Explainable Natural Language Processing (NLP), textual explanations, i.e., human-like rationales, are pivotal for explaining model predictions and enriching datasets with interpretable labels. Traditional…

Computation and Language · Computer Science 2025-11-12 Mahdi Dhaini , Juraj Vladika , Ege Erdogan , Zineb Attaoui , Gjergji Kasneci

Exploration of Masked and Causal Language Modelling for Text Generation

Large Language Models (LLMs) have revolutionised the field of Natural Language Processing (NLP) and have achieved state-of-the-art performance in practically every task in this field. However, the prevalent approach used in text generation,…

Computation and Language · Computer Science 2024-08-12 Nicolo Micheletti , Samuel Belkadi , Lifeng Han , Goran Nenadic

Pretrained Language Models for Text Generation: A Survey

Text Generation aims to produce plausible and readable text in a human language from input data. The resurgence of deep learning has greatly advanced this field, in particular, with the help of neural generation models based on pre-trained…

Computation and Language · Computer Science 2022-05-17 Junyi Li , Tianyi Tang , Wayne Xin Zhao , Jian-Yun Nie , Ji-Rong Wen

Elaboration-Generating Commonsense Question Answering at Scale

In question answering requiring common sense, language models (e.g., GPT-3) have been used to generate text expressing background knowledge that helps improve performance. Yet the cost of working with such models is very high; in this work,…

Computation and Language · Computer Science 2023-07-18 Wenya Wang , Vivek Srikumar , Hanna Hajishirzi , Noah A. Smith

Unlocking Anticipatory Text Generation: A Constrained Approach for Large Language Models Decoding

Large Language Models (LLMs) have demonstrated a powerful ability for text generation. However, achieving optimal results with a given prompt or instruction can be challenging, especially for billion-sized models. Additionally, undesired…

Computation and Language · Computer Science 2024-10-07 Lifu Tu , Semih Yavuz , Jin Qu , Jiacheng Xu , Rui Meng , Caiming Xiong , Yingbo Zhou

Finding Answers in Thought Matters: Revisiting Evaluation on Large Language Models with Reasoning

Evaluating generative models, such as large language models (LLMs), commonly involves question-answering tasks where the final answer is selected based on probability of answer choices. On the other hand, for models requiring reasoning, the…

Computation and Language · Computer Science 2025-10-17 Hwiyeol Jo , Joosung Lee , Jaehone Lee , Sang-Woo Lee , Joonsuk Park , Kang Min Yoo

Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation

We explore the use of long-context capabilities in large language models to create synthetic reading comprehension data from entire books. Previous efforts to construct such datasets relied on crowd-sourcing, but the emergence of…

Computation and Language · Computer Science 2024-06-04 Bernd Bohnet , Kevin Swersky , Rosanne Liu , Pranjal Awasthi , Azade Nova , Javier Snaider , Hanie Sedghi , Aaron T Parisi , Michael Collins , Angeliki Lazaridou , Orhan Firat , Noah Fiedel

Hypothesis Generation with Large Language Models

Effective generation of novel hypotheses is instrumental to scientific progress. So far, researchers have been the main powerhouse behind hypothesis generation by painstaking data analysis and thinking (also known as the Eureka moment). In…

Artificial Intelligence · Computer Science 2024-12-20 Yangqiaoyu Zhou , Haokun Liu , Tejes Srivastava , Hongyuan Mei , Chenhao Tan

Critical Questions Generation: Motivation and Challenges

The development of Large Language Models (LLMs) has brought impressive performances on mitigation strategies against misinformation, such as counterargument generation. However, LLMs are still seriously hindered by outdated knowledge and by…

Computation and Language · Computer Science 2024-10-21 Blanca Calvo Figueras , Rodrigo Agerri

A Comparative and Experimental Study on Automatic Question Answering Systems and its Robustness against Word Jumbling

Question answer generation using Natural Language Processing models is ubiquitous in the world around us. It is used in many use cases such as the building of chat bots, suggestive prompts in google search and also as a way of navigating…

Computation and Language · Computer Science 2023-11-28 Shashidhar Reddy Javaji , Haoran Hu , Sai Sameer Vennam , Vijaya Gajanan Buddhavarapu