Related papers: Improving Conditioning in Context-Aware Sequence t…

Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals

Neural sequence-to-sequence models are well established for applications which can be cast as mapping a single input sequence into a single output sequence. In this work, we focus on one-to-many sequence transduction problems, such as…

Audio and Speech Processing · Electrical Eng. & Systems 2020-06-26 Jing Shi , Xuankai Chang , Pengcheng Guo , Shinji Watanabe , Yusuke Fujita , Jiaming Xu , Bo Xu , Lei Xie

Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-sequence Model

A sequence-to-sequence model is a neural network module for mapping two sequences of different lengths. The sequence-to-sequence model has three core modules: encoder, decoder, and attention. Attention is the bridge that connects the…

Computation and Language · Computer Science 2018-07-24 Andros Tjandra , Sakriani Sakti , Satoshi Nakamura

Context-Aware Sequence-to-Sequence Models for Conversational Systems

This work proposes a novel approach based on sequence-to-sequence (seq2seq) models for context-aware conversational systems. Exist- ing seq2seq models have been shown to be good for generating natural responses in a data-driven…

Computation and Language · Computer Science 2018-05-23 Silje Christensen , Simen Johnsrud , Massimiliano Ruocco , Heri Ramampiaro

Neural Contextual Conversation Learning with Labeled Question-Answering Pairs

Neural conversational models tend to produce generic or safe responses in different contexts, e.g., reply \textit{"Of course"} to narrative statements or \textit{"I don't know"} to questions. In this paper, we propose an end-to-end approach…

Computation and Language · Computer Science 2016-07-21 Kun Xiong , Anqi Cui , Zefeng Zhang , Ming Li

A Context-aware Attention Network for Interactive Question Answering

Neural network based sequence-to-sequence models in an encoder-decoder framework have been successfully applied to solve Question Answering (QA) problems, predicting answers from statements and questions. However, almost all previous models…

Computation and Language · Computer Science 2017-09-05 Huayu Li , Martin Renqiang Min , Yong Ge , Asim Kadav

Sequence-to-Sequence Models with Attention Mechanistically Map to the Architecture of Human Memory Search

Past work has long recognized the important role of context in guiding how humans search their memory. While context-based memory models can explain many memory phenomena, it remains unclear why humans develop such architectures over…

Neurons and Cognition · Quantitative Biology 2025-06-24 Nikolaus Salvatore , Qiong Zhang

Efficient and Effective Query Context-Aware Learning-to-Rank Model for Sequential Recommendation

Modern sequential recommender systems commonly use transformer-based models for next-item prediction. While these models demonstrate a strong balance between efficiency and quality, integrating interleaving features - such as the query…

Information Retrieval · Computer Science 2025-08-13 Andrii Dzhoha , Alisa Mironenko , Evgeny Labzin , Vladimir Vlasov , Maarten Versteegh , Marjan Celikik

Sequence-to-Sequence Neural Net Models for Grapheme-to-Phoneme Conversion

Sequence-to-sequence translation methods based on generation with a side-conditioned language model have recently shown promising results in several tasks. In machine translation, models conditioned on source side words have been used to…

Computation and Language · Computer Science 2015-08-21 Kaisheng Yao , Geoffrey Zweig

Attention Forcing for Sequence-to-sequence Model Training

Auto-regressive sequence-to-sequence models with attention mechanism have achieved state-of-the-art performance in many tasks such as machine translation and speech synthesis. These models can be difficult to train. The standard approach,…

Machine Learning · Computer Science 2019-10-04 Qingyun Dou , Yiting Lu , Joshua Efiong , Mark J. F. Gales

Diverse Pretrained Context Encodings Improve Document Translation

We propose a new architecture for adapting a sentence-level sequence-to-sequence transformer by incorporating multiple pretrained document context signals and assess the impact on translation performance of (1) different pretraining…

Computation and Language · Computer Science 2021-08-02 Domenic Donato , Lei Yu , Chris Dyer

Capturing Greater Context for Question Generation

Automatic question generation can benefit many applications ranging from dialogue systems to reading comprehension. While questions are often asked with respect to long documents, there are many challenges with modeling such long documents.…

Computation and Language · Computer Science 2019-10-24 Luu Anh Tuan , Darsh J Shah , Regina Barzilay

Sequence-to-Sequence Learning with Latent Neural Grammars

Sequence-to-sequence learning with neural networks has become the de facto standard for sequence prediction tasks. This approach typically models the local distribution over the next word with a powerful neural network that can condition on…

Computation and Language · Computer Science 2021-11-17 Yoon Kim

Good-Enough Compositional Data Augmentation

We propose a simple data augmentation protocol aimed at providing a compositional inductive bias in conditional and unconditional sequence models. Under this protocol, synthetic training examples are constructed by taking real training…

Computation and Language · Computer Science 2020-05-20 Jacob Andreas

Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning

Modern approaches to text to speech require the entire input character sequence to be processed before any audio is synthesised. This latency limits the suitability of such models for time-sensitive tasks like simultaneous interpretation.…

Audio and Speech Processing · Electrical Eng. & Systems 2021-06-16 Devang S Ram Mohan , Raphael Lenain , Lorenzo Foglianti , Tian Huey Teh , Marlene Staib , Alexandra Torresquintero , Jiameng Gao

A Hierarchical Neural Network for Sequence-to-Sequences Learning

In recent years, the sequence-to-sequence learning neural networks with attention mechanism have achieved great progress. However, there are still challenges, especially for Neural Machine Translation (NMT), such as lower translation…

Computation and Language · Computer Science 2018-11-26 Si Zuo , Zhimin Xu

Transformers as Neural Augmentors: Class Conditional Sentence Generation via Variational Bayes

Data augmentation methods for Natural Language Processing tasks are explored in recent years, however they are limited and it is hard to capture the diversity on sentence level. Besides, it is not always possible to perform data…

Computation and Language · Computer Science 2022-05-20 M. Şafak Bilici , Mehmet Fatih Amasyali

Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts

In the era of large language models, applying techniques such as Retrieval Augmented Generation can better address Open-Domain Question-Answering problems. Due to constraints including model sizes and computing resources, the length of…

Computation and Language · Computer Science 2024-12-24 Zhuo Chen , Xinyu Wang , Yong Jiang , Pengjun Xie , Fei Huang , Kewei Tu

Neural Machine Translation with Extended Context

We investigate the use of extended context in attention-based neural machine translation. We base our experiments on translated movie subtitles and discuss the effect of increasing the segments beyond single translation units. We study the…

Computation and Language · Computer Science 2017-08-22 Jörg Tiedemann , Yves Scherrer

A Dual Encoder Sequence to Sequence Model for Open-Domain Dialogue Modeling

Ever since the successful application of sequence to sequence learning for neural machine translation systems, interest has surged in its applicability towards language generation in other problem domains. Recent work has investigated the…

Computation and Language · Computer Science 2017-10-31 Sharath T. S. , Shubhangi Tandon , Ryan Bauer

Compositional pre-training for neural semantic parsing

Semantic parsing is the process of translating natural language utterances into logical forms, which has many important applications such as question answering and instruction following. Sequence-to-sequence models have been very successful…

Computation and Language · Computer Science 2019-05-29 Amir Ziai