Related papers: Context-based Transformer Models for Answer Senten…

Question-Context Alignment and Answer-Context Dependencies for Effective Answer Sentence Selection

Answer sentence selection (AS2) in open-domain question answering finds answer for a question by ranking candidate sentences extracted from web documents. Recent work exploits answer context, i.e., sentences around a candidate, by…

Computation and Language · Computer Science 2023-06-06 Minh Van Nguyen , Kishan KC , Toan Nguyen , Thien Huu Nguyen , Ankit Chadha , Thuy Vu

What Context Features Can Transformer Language Models Use?

Transformer-based language models benefit from conditioning on contexts of hundreds to thousands of previous tokens. What aspects of these contexts contribute to accurate model prediction? We describe a series of experiments that measure…

Computation and Language · Computer Science 2021-06-17 Joe O'Connor , Jacob Andreas

Contextualized word senses: from attention to compositionality

The neural architectures of language models are becoming increasingly complex, especially that of Transformers, based on the attention mechanism. Although their application to numerous natural language processing tasks has proven to be very…

Computation and Language · Computer Science 2023-12-04 Pablo Gamallo

Context-Aware Transformer Pre-Training for Answer Sentence Selection

Answer Sentence Selection (AS2) is a core component for building an accurate Question Answering pipeline. AS2 models rank a set of candidate sentences based on how likely they answer a given question. The state of the art in AS2 exploits…

Computation and Language · Computer Science 2023-05-25 Luca Di Liello , Siddhant Garg , Alessandro Moschitti

Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection

An important task for designing QA systems is answer sentence selection (AS2): selecting the sentence containing (or constituting) the answer to a question from a set of retrieved relevant documents. In this paper, we propose three novel…

Computation and Language · Computer Science 2022-10-21 Luca Di Liello , Siddhant Garg , Luca Soldaini , Alessandro Moschitti

Explaining How Transformers Use Context to Build Predictions

Language Generation Models produce words based on the previous context. Although existing methods offer input attributions as explanations for a model's prediction, it is still unclear how prior words affect the model's decision throughout…

Computation and Language · Computer Science 2023-05-23 Javier Ferrando , Gerard I. Gállego , Ioannis Tsiamas , Marta R. Costa-jussà

An Attention-Based Model for Predicting Contextual Informativeness and Curriculum Learning Applications

Both humans and machines learn the meaning of unknown words through contextual information in a sentence, but not all contexts are equally helpful for learning. We introduce an effective method for capturing the level of contextual…

Computation and Language · Computer Science 2023-11-10 Sungjin Nam , David Jurgens , Gwen Frishkoff , Kevyn Collins-Thompson

Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism

Transformer-based models have demonstrated their effectiveness in automatic speech recognition (ASR) tasks and even shown superior performance over the conventional hybrid framework. The main idea of Transformers is to capture the…

Sound · Computer Science 2022-07-05 Kun Wei , Pengcheng Guo , Ning Jiang

Contextual Argument Component Classification for Class Discussions

Argument mining systems often consider contextual information, i.e. information outside of an argumentative discourse unit, when trained to accomplish tasks such as argument component identification, classification, and relation extraction.…

Computation and Language · Computer Science 2021-02-23 Luca Lugini , Diane Litman

Efficient and Effective Query Context-Aware Learning-to-Rank Model for Sequential Recommendation

Modern sequential recommender systems commonly use transformer-based models for next-item prediction. While these models demonstrate a strong balance between efficiency and quality, integrating interleaving features - such as the query…

Information Retrieval · Computer Science 2025-08-13 Andrii Dzhoha , Alisa Mironenko , Evgeny Labzin , Vladimir Vlasov , Maarten Versteegh , Marjan Celikik

Modeling Context in Answer Sentence Selection Systems on a Latency Budget

Answer Sentence Selection (AS2) is an efficient approach for the design of open-domain Question Answering (QA) systems. In order to achieve low latency, traditional AS2 models score question-answer pairs individually, ignoring any…

Computation and Language · Computer Science 2021-02-05 Rujun Han , Luca Soldaini , Alessandro Moschitti

Contextualized Word Representations for Reading Comprehension

Reading a document and extracting an answer to a question about its content has attracted substantial attention recently. While most work has focused on the interaction between the question and the document, in this work we evaluate the…

Computation and Language · Computer Science 2018-09-05 Shimi Salant , Jonathan Berant

Enhancing Context Modeling with a Query-Guided Capsule Network for Document-level Translation

Context modeling is essential to generate coherent and consistent translation for Document-level Neural Machine Translations. The widely used method for document-level translation usually compresses the context information into a…

Computation and Language · Computer Science 2019-11-22 Zhengxin Yang , Jinchao Zhang , Fandong Meng , Shuhao Gu , Yang Feng , Jie Zhou

Transformer Models for Text Coherence Assessment

Coherence is an important aspect of text quality and is crucial for ensuring its readability. It is essential desirable for outputs from text generation systems like summarization, question answering, machine translation, question…

Computation and Language · Computer Science 2022-02-24 Tushar Abhishek , Daksh Rawat , Manish Gupta , Vasudeva Varma

Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning

Document-level neural machine translation has yielded attractive improvements. However, majority of existing methods roughly use all context sentences in a fixed scope. They neglect the fact that different source sentences need different…

Computation and Language · Computer Science 2020-10-12 Xiaomian Kang , Yang Zhao , Jiajun Zhang , Chengqing Zong

Transformer-F: A Transformer network with effective methods for learning universal sentence representation

The Transformer model is widely used in natural language processing for sentence representation. However, the previous Transformer-based models focus on function words that have limited meaning in most cases and could merely extract…

Computation and Language · Computer Science 2021-07-05 Yu Shi

Diverse Pretrained Context Encodings Improve Document Translation

We propose a new architecture for adapting a sentence-level sequence-to-sequence transformer by incorporating multiple pretrained document context signals and assess the impact on translation performance of (1) different pretraining…

Computation and Language · Computer Science 2021-08-02 Domenic Donato , Lei Yu , Chris Dyer

The Cascade Transformer: an Application for Efficient Answer Sentence Selection

Large transformer-based language models have been shown to be very effective in many classification tasks. However, their computational complexity prevents their use in applications requiring the classification of a large set of candidates.…

Computation and Language · Computer Science 2020-05-08 Luca Soldaini , Alessandro Moschitti

How Context Affects Language Models' Factual Predictions

When pre-trained on large unsupervised textual corpora, language models are able to store and retrieve factual knowledge to some extent, making it possible to use them directly for zero-shot cloze-style question answering. However, storing…

Computation and Language · Computer Science 2020-05-12 Fabio Petroni , Patrick Lewis , Aleksandra Piktus , Tim Rocktäschel , Yuxiang Wu , Alexander H. Miller , Sebastian Riedel

Incremental Transformer with Deliberation Decoder for Document Grounded Conversations

Document Grounded Conversations is a task to generate dialogue responses when chatting about the content of a given document. Obviously, document knowledge plays a critical role in Document Grounded Conversations, while existing dialogue…

Computation and Language · Computer Science 2019-08-02 Zekang Li , Cheng Niu , Fandong Meng , Yang Feng , Qian Li , Jie Zhou