Related papers: STARC: Structured Annotations for Reading Comprehe…

Teaching Machine Comprehension with Compositional Explanations

Advances in machine reading comprehension (MRC) rely heavily on the collection of large scale human-annotated examples in the form of (question, paragraph, answer) triples. In contrast, humans are typically able to generalize with only a…

Computation and Language · Computer Science 2020-10-15 Qinyuan Ye , Xiao Huang , Elizabeth Boschee , Xiang Ren

Stochastic Answer Networks for Machine Reading Comprehension

We propose a simple yet robust stochastic answer network (SAN) that simulates multi-step reasoning in machine reading comprehension. Compared to previous work such as ReasoNet which used reinforcement learning to determine the number of…

Computation and Language · Computer Science 2018-05-16 Xiaodong Liu , Yelong Shen , Kevin Duh , Jianfeng Gao

R3: A Reading Comprehension Benchmark Requiring Reasoning Processes

Existing question answering systems can only predict answers without explicit reasoning processes, which hinder their explainability and make us overestimate their ability of understanding and reasoning over natural language. In this work,…

Computation and Language · Computer Science 2020-04-06 Ran Wang , Kun Tao , Dingjie Song , Zhilong Zhang , Xiao Ma , Xi'ao Su , Xinyu Dai

Reasoning Aware Self-Consistency: Leveraging Reasoning Paths for Efficient LLM Sampling

Self-Consistency mitigates hallucinations in Large Language Models (LLMs) by sampling multiple reasoning paths,but it lacks a systematic approach to determine the optimal number of samples or select the most faithful rationale. To address…

Computation and Language · Computer Science 2025-02-05 Guangya Wan , Yuqi Wu , Jie Chen , Sheng Li

R4C: A Benchmark for Evaluating RC Systems to Get the Right Answer for the Right Reason

Recent studies have revealed that reading comprehension (RC) systems learn to exploit annotation artifacts and other biases in current datasets. This prevents the community from reliably measuring the progress of RC systems. To address this…

Computation and Language · Computer Science 2020-05-05 Naoya Inoue , Pontus Stenetorp , Kentaro Inui

Structured Learning from Partial Annotations

Structured learning is appropriate when predicting structured outputs such as trees, graphs, or sequences. Most prior work requires the training set to consist of complete trees, graphs or sequences. Specifying such detailed ground truth…

Machine Learning · Computer Science 2012-07-03 Xinghua Lou , Fred Hamprecht

Reliable Natural Language Understanding with Large Language Models and Answer Set Programming

Humans understand language by extracting information (meaning) from sentences, combining it with existing commonsense knowledge, and then performing reasoning to draw conclusions. While large language models (LLMs) such as GPT-3 and ChatGPT…

Computation and Language · Computer Science 2023-08-31 Abhiramon Rajasekharan , Yankai Zeng , Parth Padalkar , Gopal Gupta

A Framework for Evaluation of Machine Reading Comprehension Gold Standards

Machine Reading Comprehension (MRC) is the task of answering a question over a paragraph of text. While neural MRC systems gain popularity and achieve noticeable performance, issues are being raised with the methodology used to establish…

Computation and Language · Computer Science 2020-03-11 Viktor Schlegel , Marco Valentino , André Freitas , Goran Nenadic , Riza Batista-Navarro

Automated Scoring for Reading Comprehension via In-context BERT Tuning

Automated scoring of open-ended student responses has the potential to significantly reduce human grader effort. Recent advances in automated scoring often leverage textual representations based on pre-trained language models such as BERT…

Machine Learning · Computer Science 2023-06-16 Nigel Fernandez , Aritra Ghosh , Naiming Liu , Zichao Wang , Benoît Choffin , Richard Baraniuk , Andrew Lan

ET5: A Novel End-to-end Framework for Conversational Machine Reading Comprehension

Conversational machine reading comprehension (CMRC) aims to assist computers to understand an natural language text and thereafter engage in a multi-turn conversation to answer questions related to the text. Existing methods typically…

Computation and Language · Computer Science 2022-09-26 Xiao Zhang , Heyan Huang , Zewen Chi , Xian-Ling Mao

KoRC: Knowledge oriented Reading Comprehension Benchmark for Deep Text Understanding

Deep text understanding, which requires the connections between a given document and prior knowledge beyond its text, has been highlighted by many benchmarks in recent years. However, these benchmarks have encountered two major limitations.…

Computation and Language · Computer Science 2023-07-07 Zijun Yao , Yantao Liu , Xin Lv , Shulin Cao , Jifan Yu , Lei Hou , Juanzi Li

PALRACE: Reading Comprehension Dataset with Human Data and Labeled Rationales

Pre-trained language models achieves high performance on machine reading comprehension (MRC) tasks but the results are hard to explain. An appealing approach to make models explainable is to provide rationales for its decision. To…

Computation and Language · Computer Science 2022-03-25 Jiajie Zou , Yuran Zhang , Peiqing Jin , Cheng Luo , Xunyi Pan , Nai Ding

A Span-Extraction Dataset for Chinese Machine Reading Comprehension

Machine Reading Comprehension (MRC) has become enormously popular recently and has attracted a lot of attention. However, the existing reading comprehension datasets are mostly in English. In this paper, we introduce a Span-Extraction…

Computation and Language · Computer Science 2019-11-05 Yiming Cui , Ting Liu , Wanxiang Che , Li Xiao , Zhipeng Chen , Wentao Ma , Shijin Wang , Guoping Hu

DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs

Reading comprehension has recently seen rapid progress, with systems matching humans on the most popular datasets for the task. However, a large body of work has highlighted the brittleness of these systems, showing that there is much work…

Computation and Language · Computer Science 2019-04-18 Dheeru Dua , Yizhong Wang , Pradeep Dasigi , Gabriel Stanovsky , Sameer Singh , Matt Gardner

STAR: Mitigating Cascading Errors in Spatial Reasoning via Turn-point Alignment and Segment-level DPO

Structured spatial navigation is a core benchmark for Large Language Models (LLMs) spatial reasoning. Existing paradigms like Visualization-of-Thought (VoT) are prone to cascading errors in complex topologies. To solve this, we propose…

Computer Vision and Pattern Recognition · Computer Science 2026-04-02 Pukun Zhao , Longxiang Wang , Chen Chen , Peicheng Wang , Fanqing Zhou , Runze Li , Haojian Huang

Multi Document Reading Comprehension

Reading Comprehension (RC) is a task of answering a question from a given passage or a set of passages. In the case of multiple passages, the task is to find the best possible answer to the question. Recent trials and experiments in the…

Computation and Language · Computer Science 2022-01-06 Avi Chawla

Assessing the Benchmarking Capacity of Machine Reading Comprehension Datasets

Existing analysis work in machine reading comprehension (MRC) is largely concerned with evaluating the capabilities of systems. However, the capabilities of datasets are not assessed for benchmarking language understanding precisely. We…

Computation and Language · Computer Science 2019-11-22 Saku Sugawara , Pontus Stenetorp , Kentaro Inui , Akiko Aizawa

ESTER: A Machine Reading Comprehension Dataset for Event Semantic Relation Reasoning

Understanding how events are semantically related to each other is the essence of reading comprehension. Recent event-centric reading comprehension datasets focus mostly on event arguments or temporal relations. While these tasks partially…

Computation and Language · Computer Science 2021-09-14 Rujun Han , I-Hung Hsu , Jiao Sun , Julia Baylon , Qiang Ning , Dan Roth , Nanyun Peng

Integrating Vision-Centric Text Understanding for Conversational Recommender Systems

Conversational Recommender Systems (CRSs) have attracted growing attention for their ability to deliver personalized recommendations through natural language interactions. To more accurately infer user preferences from multi-turn…

Information Retrieval · Computer Science 2026-01-21 Wei Yuan , Shutong Qiao , Tong Chen , Quoc Viet Hung Nguyen , Zi Huang , Hongzhi Yin

MOCHA: A Dataset for Training and Evaluating Generative Reading Comprehension Metrics

Posing reading comprehension as a generation problem provides a great deal of flexibility, allowing for open-ended questions with few restrictions on possible answers. However, progress is impeded by existing generation metrics, which rely…

Computation and Language · Computer Science 2021-06-08 Anthony Chen , Gabriel Stanovsky , Sameer Singh , Matt Gardner