English
Related papers

Related papers: Context-based Transformer Models for Answer Senten…

200 papers

Answer sentence selection (AS2) in open-domain question answering finds answer for a question by ranking candidate sentences extracted from web documents. Recent work exploits answer context, i.e., sentences around a candidate, by…

Computation and Language · Computer Science 2023-06-06 Minh Van Nguyen , Kishan KC , Toan Nguyen , Thien Huu Nguyen , Ankit Chadha , Thuy Vu

Transformer-based language models benefit from conditioning on contexts of hundreds to thousands of previous tokens. What aspects of these contexts contribute to accurate model prediction? We describe a series of experiments that measure…

Computation and Language · Computer Science 2021-06-17 Joe O'Connor , Jacob Andreas

The neural architectures of language models are becoming increasingly complex, especially that of Transformers, based on the attention mechanism. Although their application to numerous natural language processing tasks has proven to be very…

Computation and Language · Computer Science 2023-12-04 Pablo Gamallo

Answer Sentence Selection (AS2) is a core component for building an accurate Question Answering pipeline. AS2 models rank a set of candidate sentences based on how likely they answer a given question. The state of the art in AS2 exploits…

Computation and Language · Computer Science 2023-05-25 Luca Di Liello , Siddhant Garg , Alessandro Moschitti

An important task for designing QA systems is answer sentence selection (AS2): selecting the sentence containing (or constituting) the answer to a question from a set of retrieved relevant documents. In this paper, we propose three novel…

Computation and Language · Computer Science 2022-10-21 Luca Di Liello , Siddhant Garg , Luca Soldaini , Alessandro Moschitti

Language Generation Models produce words based on the previous context. Although existing methods offer input attributions as explanations for a model's prediction, it is still unclear how prior words affect the model's decision throughout…

Computation and Language · Computer Science 2023-05-23 Javier Ferrando , Gerard I. Gállego , Ioannis Tsiamas , Marta R. Costa-jussà

Both humans and machines learn the meaning of unknown words through contextual information in a sentence, but not all contexts are equally helpful for learning. We introduce an effective method for capturing the level of contextual…

Computation and Language · Computer Science 2023-11-10 Sungjin Nam , David Jurgens , Gwen Frishkoff , Kevyn Collins-Thompson

Transformer-based models have demonstrated their effectiveness in automatic speech recognition (ASR) tasks and even shown superior performance over the conventional hybrid framework. The main idea of Transformers is to capture the…

Sound · Computer Science 2022-07-05 Kun Wei , Pengcheng Guo , Ning Jiang

Argument mining systems often consider contextual information, i.e. information outside of an argumentative discourse unit, when trained to accomplish tasks such as argument component identification, classification, and relation extraction.…

Computation and Language · Computer Science 2021-02-23 Luca Lugini , Diane Litman

Modern sequential recommender systems commonly use transformer-based models for next-item prediction. While these models demonstrate a strong balance between efficiency and quality, integrating interleaving features - such as the query…

Information Retrieval · Computer Science 2025-08-13 Andrii Dzhoha , Alisa Mironenko , Evgeny Labzin , Vladimir Vlasov , Maarten Versteegh , Marjan Celikik

Answer Sentence Selection (AS2) is an efficient approach for the design of open-domain Question Answering (QA) systems. In order to achieve low latency, traditional AS2 models score question-answer pairs individually, ignoring any…

Computation and Language · Computer Science 2021-02-05 Rujun Han , Luca Soldaini , Alessandro Moschitti

Reading a document and extracting an answer to a question about its content has attracted substantial attention recently. While most work has focused on the interaction between the question and the document, in this work we evaluate the…

Computation and Language · Computer Science 2018-09-05 Shimi Salant , Jonathan Berant

Context modeling is essential to generate coherent and consistent translation for Document-level Neural Machine Translations. The widely used method for document-level translation usually compresses the context information into a…

Computation and Language · Computer Science 2019-11-22 Zhengxin Yang , Jinchao Zhang , Fandong Meng , Shuhao Gu , Yang Feng , Jie Zhou

Coherence is an important aspect of text quality and is crucial for ensuring its readability. It is essential desirable for outputs from text generation systems like summarization, question answering, machine translation, question…

Computation and Language · Computer Science 2022-02-24 Tushar Abhishek , Daksh Rawat , Manish Gupta , Vasudeva Varma

Document-level neural machine translation has yielded attractive improvements. However, majority of existing methods roughly use all context sentences in a fixed scope. They neglect the fact that different source sentences need different…

Computation and Language · Computer Science 2020-10-12 Xiaomian Kang , Yang Zhao , Jiajun Zhang , Chengqing Zong

The Transformer model is widely used in natural language processing for sentence representation. However, the previous Transformer-based models focus on function words that have limited meaning in most cases and could merely extract…

Computation and Language · Computer Science 2021-07-05 Yu Shi

We propose a new architecture for adapting a sentence-level sequence-to-sequence transformer by incorporating multiple pretrained document context signals and assess the impact on translation performance of (1) different pretraining…

Computation and Language · Computer Science 2021-08-02 Domenic Donato , Lei Yu , Chris Dyer

Large transformer-based language models have been shown to be very effective in many classification tasks. However, their computational complexity prevents their use in applications requiring the classification of a large set of candidates.…

Computation and Language · Computer Science 2020-05-08 Luca Soldaini , Alessandro Moschitti

When pre-trained on large unsupervised textual corpora, language models are able to store and retrieve factual knowledge to some extent, making it possible to use them directly for zero-shot cloze-style question answering. However, storing…

Computation and Language · Computer Science 2020-05-12 Fabio Petroni , Patrick Lewis , Aleksandra Piktus , Tim Rocktäschel , Yuxiang Wu , Alexander H. Miller , Sebastian Riedel

Document Grounded Conversations is a task to generate dialogue responses when chatting about the content of a given document. Obviously, document knowledge plays a critical role in Document Grounded Conversations, while existing dialogue…

Computation and Language · Computer Science 2019-08-02 Zekang Li , Cheng Niu , Fandong Meng , Yang Feng , Qian Li , Jie Zhou
‹ Prev 1 2 3 10 Next ›