Related papers: Learning to Search in Long Documents Using Documen…

Hierarchical Question Answering for Long Documents

We present a framework for question answering that can efficiently scale to longer documents while maintaining or even improving performance of state-of-the-art models. While most successful approaches for reading comprehension rely on…

Computation and Language · Computer Science 2017-02-09 Eunsol Choi , Daniel Hewlett , Alexandre Lacoste , Illia Polosukhin , Jakob Uszkoreit , Jonathan Berant

Unfolding the Structure of a Document using Deep Learning

Understanding and extracting of information from large documents, such as business opportunities, academic articles, medical documents and technical reports, poses challenges not present in short documents. Such large documents may be…

Computation and Language · Computer Science 2019-10-10 Muhammad Mahbubur Rahman , Tim Finin

Sentence Ordering and Coherence Modeling using Recurrent Neural Networks

Modeling the structure of coherent texts is a key NLP problem. The task of coherently organizing a given set of sentences has been commonly used to build and evaluate models that understand such structure. We propose an end-to-end…

Computation and Language · Computer Science 2017-12-25 Lajanugen Logeswaran , Honglak Lee , Dragomir Radev

A Deep Cascade Model for Multi-Document Reading Comprehension

A fundamental trade-off between effectiveness and efficiency needs to be balanced when designing an online question answering system. Effectiveness comes from sophisticated functions such as extractive machine reading comprehension (MRC),…

Computation and Language · Computer Science 2019-08-14 Ming Yan , Jiangnan Xia , Chen Wu , Bin Bi , Zhongzhou Zhao , Ji Zhang , Luo Si , Rui Wang , Wei Wang , Haiqing Chen

Fast Reading Comprehension with ConvNets

State-of-the-art deep reading comprehension models are dominated by recurrent neural nets. Their sequential nature is a natural fit for language, but it also precludes parallelization within an instances and often becomes the bottleneck for…

Computation and Language · Computer Science 2017-11-15 Felix Wu , Ni Lao , John Blitzer , Guandao Yang , Kilian Weinberger

Teaching Machines to Read and Comprehend

Teaching machines to read natural language documents remains an elusive challenge. Machine reading systems can be tested on their ability to answer questions posed on the contents of documents that they have seen, but until now large scale…

Computation and Language · Computer Science 2015-11-20 Karl Moritz Hermann , Tomáš Kočiský , Edward Grefenstette , Lasse Espeholt , Will Kay , Mustafa Suleyman , Phil Blunsom

Beyond Chunking: Discourse-Aware Hierarchical Retrieval for Long Document Question Answering

Existing long-document question answering systems typically process texts as flat sequences or use heuristic chunking, which overlook the discourse structures that naturally guide human comprehension. We present a discourse-aware…

Information Retrieval · Computer Science 2026-05-08 Huiyao Chen , Yi Yang , Yinghui Li , Meishan Zhang , Baotian Hu , Min Zhang

Question Answering by Reasoning Across Documents with Graph Convolutional Networks

Most research in reading comprehension has focused on answering questions based on individual documents or even single paragraphs. We introduce a neural model which integrates and reasons relying on information spread within documents and…

Computation and Language · Computer Science 2022-09-28 Nicola De Cao , Wilker Aziz , Ivan Titov

Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension

In this paper, we study machine reading comprehension (MRC) on long texts, where a model takes as inputs a lengthy document and a question and then extracts a text span from the document as an answer. State-of-the-art models tend to use a…

Computation and Language · Computer Science 2020-05-20 Hongyu Gong , Yelong Shen , Dian Yu , Jianshu Chen , Dong Yu

Text-based Question Answering from Information Retrieval and Deep Neural Network Perspectives: A Survey

Text-based Question Answering (QA) is a challenging task which aims at finding short concrete answers for users' questions. This line of research has been widely studied with information retrieval techniques and has received increasing…

Information Retrieval · Computer Science 2020-05-28 Zahra Abbasiantaeb , Saeedeh Momtazi

ReasoNet: Learning to Stop Reading in Machine Comprehension

Teaching a computer to read and answer general questions pertaining to a document is a challenging yet unsolved problem. In this paper, we describe a novel neural network architecture called the Reasoning Network (ReasoNet) for machine…

Machine Learning · Computer Science 2017-06-21 Yelong Shen , Po-Sen Huang , Jianfeng Gao , Weizhu Chen

Multi-hop Reading Comprehension via Deep Reinforcement Learning based Document Traversal

Reading Comprehension has received significant attention in recent years as high quality Question Answering (QA) datasets have become available. Despite state-of-the-art methods achieving strong overall accuracy, Multi-Hop (MH) reasoning…

Computation and Language · Computer Science 2019-05-24 Alex Long , Joel Mason , Alan Blair , Wei Wang

A Neural Model for Joint Document and Snippet Ranking in Question Answering for Large Document Collections

Question answering (QA) systems for large document collections typically use pipelines that (i) retrieve possibly relevant documents, (ii) re-rank them, (iii) rank paragraphs or other snippets of the top-ranked documents, and (iv) select…

Information Retrieval · Computer Science 2021-06-17 Dimitris Pappas , Ion Androutsopoulos

Structure and Semantics Preserving Document Representations

Retrieving relevant documents from a corpus is typically based on the semantic similarity between the document content and query text. The inclusion of structural relationship between documents can benefit the retrieval mechanism by…

Information Retrieval · Computer Science 2022-04-05 Natraj Raman , Sameena Shah , Manuela Veloso

A Survey on Neural Machine Reading Comprehension

Enabling a machine to read and comprehend the natural language documents so that it can answer some questions remains an elusive challenge. In recent years, the popularity of deep learning and the establishment of large-scale datasets have…

Computation and Language · Computer Science 2019-06-11 Boyu Qiu , Xu Chen , Jungang Xu , Yingfei Sun

Document Modeling with Graph Attention Networks for Multi-grained Machine Reading Comprehension

Natural Questions is a new challenging machine reading comprehension benchmark with two-grained answers, which are a long answer (typically a paragraph) and a short answer (one or more entities inside the long answer). Despite the…

Computation and Language · Computer Science 2020-05-14 Bo Zheng , Haoyang Wen , Yaobo Liang , Nan Duan , Wanxiang Che , Daxin Jiang , Ming Zhou , Ting Liu

Retrieve, Read, Rerank: Towards End-to-End Multi-Document Reading Comprehension

This paper considers the reading comprehension task in which multiple documents are given as input. Prior work has shown that a pipeline of retriever, reader, and reranker can improve the overall performance. However, the pipeline system is…

Computation and Language · Computer Science 2019-06-12 Minghao Hu , Yuxing Peng , Zhen Huang , Dongsheng Li

PDFTriage: Question Answering over Long, Structured Documents

Large Language Models (LLMs) have issues with document question answering (QA) in situations where the document is unable to fit in the small context length of an LLM. To overcome this issue, most existing works focus on retrieving the…

Computation and Language · Computer Science 2023-11-09 Jon Saad-Falcon , Joe Barrow , Alexa Siu , Ani Nenkova , David Seunghyun Yoon , Ryan A. Rossi , Franck Dernoncourt

Distraction-Based Neural Networks for Document Summarization

Distributed representation learned with neural networks has recently shown to be effective in modeling natural languages at fine granularities such as words, phrases, and even sentences. Whether and how such an approach can be extended to…

Computation and Language · Computer Science 2016-10-27 Qian Chen , Xiaodan Zhu , Zhenhua Ling , Si Wei , Hui Jiang

Bidirectional Recursive Neural Networks for Token-Level Labeling with Structure

Recently, deep architectures, such as recurrent and recursive neural networks have been successfully applied to various natural language processing tasks. Inspired by bidirectional recurrent neural networks which use representations that…

Machine Learning · Computer Science 2013-12-03 Ozan İrsoy , Claire Cardie