English
Related papers

Related papers: Improving Conditioning in Context-Aware Sequence t…

200 papers

Neural sequence-to-sequence models are well established for applications which can be cast as mapping a single input sequence into a single output sequence. In this work, we focus on one-to-many sequence transduction problems, such as…

Audio and Speech Processing · Electrical Eng. & Systems 2020-06-26 Jing Shi , Xuankai Chang , Pengcheng Guo , Shinji Watanabe , Yusuke Fujita , Jiaming Xu , Bo Xu , Lei Xie

A sequence-to-sequence model is a neural network module for mapping two sequences of different lengths. The sequence-to-sequence model has three core modules: encoder, decoder, and attention. Attention is the bridge that connects the…

Computation and Language · Computer Science 2018-07-24 Andros Tjandra , Sakriani Sakti , Satoshi Nakamura

This work proposes a novel approach based on sequence-to-sequence (seq2seq) models for context-aware conversational systems. Exist- ing seq2seq models have been shown to be good for generating natural responses in a data-driven…

Computation and Language · Computer Science 2018-05-23 Silje Christensen , Simen Johnsrud , Massimiliano Ruocco , Heri Ramampiaro

Neural conversational models tend to produce generic or safe responses in different contexts, e.g., reply \textit{"Of course"} to narrative statements or \textit{"I don't know"} to questions. In this paper, we propose an end-to-end approach…

Computation and Language · Computer Science 2016-07-21 Kun Xiong , Anqi Cui , Zefeng Zhang , Ming Li

Neural network based sequence-to-sequence models in an encoder-decoder framework have been successfully applied to solve Question Answering (QA) problems, predicting answers from statements and questions. However, almost all previous models…

Computation and Language · Computer Science 2017-09-05 Huayu Li , Martin Renqiang Min , Yong Ge , Asim Kadav

Past work has long recognized the important role of context in guiding how humans search their memory. While context-based memory models can explain many memory phenomena, it remains unclear why humans develop such architectures over…

Neurons and Cognition · Quantitative Biology 2025-06-24 Nikolaus Salvatore , Qiong Zhang

Modern sequential recommender systems commonly use transformer-based models for next-item prediction. While these models demonstrate a strong balance between efficiency and quality, integrating interleaving features - such as the query…

Information Retrieval · Computer Science 2025-08-13 Andrii Dzhoha , Alisa Mironenko , Evgeny Labzin , Vladimir Vlasov , Maarten Versteegh , Marjan Celikik

Sequence-to-sequence translation methods based on generation with a side-conditioned language model have recently shown promising results in several tasks. In machine translation, models conditioned on source side words have been used to…

Computation and Language · Computer Science 2015-08-21 Kaisheng Yao , Geoffrey Zweig

Auto-regressive sequence-to-sequence models with attention mechanism have achieved state-of-the-art performance in many tasks such as machine translation and speech synthesis. These models can be difficult to train. The standard approach,…

Machine Learning · Computer Science 2019-10-04 Qingyun Dou , Yiting Lu , Joshua Efiong , Mark J. F. Gales

We propose a new architecture for adapting a sentence-level sequence-to-sequence transformer by incorporating multiple pretrained document context signals and assess the impact on translation performance of (1) different pretraining…

Computation and Language · Computer Science 2021-08-02 Domenic Donato , Lei Yu , Chris Dyer

Automatic question generation can benefit many applications ranging from dialogue systems to reading comprehension. While questions are often asked with respect to long documents, there are many challenges with modeling such long documents.…

Computation and Language · Computer Science 2019-10-24 Luu Anh Tuan , Darsh J Shah , Regina Barzilay

Sequence-to-sequence learning with neural networks has become the de facto standard for sequence prediction tasks. This approach typically models the local distribution over the next word with a powerful neural network that can condition on…

Computation and Language · Computer Science 2021-11-17 Yoon Kim

We propose a simple data augmentation protocol aimed at providing a compositional inductive bias in conditional and unconditional sequence models. Under this protocol, synthetic training examples are constructed by taking real training…

Computation and Language · Computer Science 2020-05-20 Jacob Andreas

Modern approaches to text to speech require the entire input character sequence to be processed before any audio is synthesised. This latency limits the suitability of such models for time-sensitive tasks like simultaneous interpretation.…

Audio and Speech Processing · Electrical Eng. & Systems 2021-06-16 Devang S Ram Mohan , Raphael Lenain , Lorenzo Foglianti , Tian Huey Teh , Marlene Staib , Alexandra Torresquintero , Jiameng Gao

In recent years, the sequence-to-sequence learning neural networks with attention mechanism have achieved great progress. However, there are still challenges, especially for Neural Machine Translation (NMT), such as lower translation…

Computation and Language · Computer Science 2018-11-26 Si Zuo , Zhimin Xu

Data augmentation methods for Natural Language Processing tasks are explored in recent years, however they are limited and it is hard to capture the diversity on sentence level. Besides, it is not always possible to perform data…

Computation and Language · Computer Science 2022-05-20 M. Şafak Bilici , Mehmet Fatih Amasyali

In the era of large language models, applying techniques such as Retrieval Augmented Generation can better address Open-Domain Question-Answering problems. Due to constraints including model sizes and computing resources, the length of…

Computation and Language · Computer Science 2024-12-24 Zhuo Chen , Xinyu Wang , Yong Jiang , Pengjun Xie , Fei Huang , Kewei Tu

We investigate the use of extended context in attention-based neural machine translation. We base our experiments on translated movie subtitles and discuss the effect of increasing the segments beyond single translation units. We study the…

Computation and Language · Computer Science 2017-08-22 Jörg Tiedemann , Yves Scherrer

Ever since the successful application of sequence to sequence learning for neural machine translation systems, interest has surged in its applicability towards language generation in other problem domains. Recent work has investigated the…

Computation and Language · Computer Science 2017-10-31 Sharath T. S. , Shubhangi Tandon , Ryan Bauer

Semantic parsing is the process of translating natural language utterances into logical forms, which has many important applications such as question answering and instruction following. Sequence-to-sequence models have been very successful…

Computation and Language · Computer Science 2019-05-29 Amir Ziai
‹ Prev 1 2 3 10 Next ›