English
Related papers

Related papers: Unsupervised Pretraining for Sequence to Sequence …

200 papers

This paper proposes a novel approach to pre-train encoder-decoder sequence-to-sequence (seq2seq) model with unpaired speech and transcripts respectively. Our pre-training method is divided into two stages, named acoustic pre-trianing and…

Sound · Computer Science 2020-01-03 Zhiyun Fan , Shiyu Zhou , Bo Xu

In the literature, the research on abstract meaning representation (AMR) parsing is much restricted by the size of human-curated dataset which is critical to build an AMR parser with good performance. To alleviate such data size…

Computation and Language · Computer Science 2020-10-06 Dongqin Xu , Junhui Li , Muhua Zhu , Min Zhang , Guodong Zhou

This work presents our ongoing research of unsupervised pretraining in neural machine translation (NMT). In our method, we initialize the weights of the encoder and decoder with two language models that are trained with monolingual data and…

Computation and Language · Computer Science 2020-10-20 Dušan Variš , Ondřej Bojar

In this paper, we present a substantial step in better understanding the SOTA sequence-to-sequence (Seq2Seq) pretraining for neural machine translation~(NMT). We focus on studying the impact of the jointly pretrained decoder, which is the…

Computation and Language · Computer Science 2022-03-17 Wenxuan Wang , Wenxiang Jiao , Yongchang Hao , Xing Wang , Shuming Shi , Zhaopeng Tu , Michael Lyu

While machine translation has traditionally relied on large amounts of parallel corpora, a recent research line has managed to train both Neural Machine Translation (NMT) and Statistical Machine Translation (SMT) systems using monolingual…

Computation and Language · Computer Science 2021-12-28 Mikel Artetxe , Gorka Labaka , Eneko Agirre

We study sequence-to-sequence (seq2seq) pre-training with data augmentation for sentence rewriting. Instead of training a seq2seq model with gold training data and augmented data simultaneously, we separate them to train in different…

Computation and Language · Computer Science 2019-09-23 Yi Zhang , Tao Ge , Furu Wei , Ming Zhou , Xu Sun

This paper presents methods of making using of text supervision to improve the performance of sequence-to-sequence (seq2seq) voice conversion. Compared with conventional frame-to-frame voice conversion approaches, the seq2seq acoustic…

Sound · Computer Science 2020-01-14 Jing-Xuan Zhang , Zhen-Hua Ling , Yuan Jiang , Li-Juan Liu , Chen Liang , Li-Rong Dai

Recent studies have demonstrated the efficiency of generative pretraining for English natural language understanding. In this work, we extend this approach to multiple languages and show the effectiveness of cross-lingual pretraining. We…

Computation and Language · Computer Science 2019-01-23 Guillaume Lample , Alexis Conneau

We present two approaches that use unlabeled data to improve sequence learning with recurrent networks. The first approach is to predict what comes next in a sequence, which is a conventional language model in natural language processing.…

Machine Learning · Computer Science 2015-11-05 Andrew M. Dai , Quoc V. Le

Sequence-to-sequence (seq2seq) learning is a popular fashion for large-scale pretraining language models. However, the prior seq2seq pretraining models generally focus on reconstructive objectives on the decoder side and neglect the effect…

Computation and Language · Computer Science 2024-01-10 Qihuang Zhong , Liang Ding , Juhua Liu , Bo Du , Dacheng Tao

Pre-trained sequence-to-sequence models have significantly improved Neural Machine Translation (NMT). Different from prior works where pre-trained models usually adopt an unidirectional decoder, this paper demonstrates that pre-training a…

Computation and Language · Computer Science 2022-06-03 Pengfei Li , Liangyou Li , Meng Zhang , Minghao Wu , Qun Liu

Successful methods for unsupervised neural machine translation (UNMT) employ crosslingual pretraining via self-supervision, often in the form of a masked language modeling or a sequence generation task, which requires the model to align the…

Computation and Language · Computer Science 2021-04-15 Alexandra Chronopoulou , Dario Stojanovski , Alexander Fraser

Unsupervised cross-lingual pretraining has achieved strong results in neural machine translation (NMT), by drastically reducing the need for large parallel data. Most approaches adapt masked-language modeling (MLM) to sequence-to-sequence…

Computation and Language · Computer Science 2021-06-11 Christos Baziotis , Ivan Titov , Alexandra Birch , Barry Haddow

Pre-trained encoder-only and sequence-to-sequence (seq2seq) models each have advantages, however training both model types from scratch is computationally expensive. We explore recipes to improve pre-training efficiency by initializing one…

Computation and Language · Computer Science 2023-06-16 Saleh Soltan , Andy Rosenbaum , Tobias Falke , Qin Lu , Anna Rumshisky , Wael Hamza

Recently, sequence-to-sequence models with attention have been successfully applied in Text-to-speech (TTS). These models can generate near-human speech with a large accurately-transcribed speech corpus. However, preparing such a large…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-12 Haitong Zhang , Yue Lin

Sequence to sequence learning has recently emerged as a new paradigm in supervised learning. To date, most of its applications focused on only one task and not much work explored this framework for multiple tasks. This paper examines three…

Machine Learning · Computer Science 2016-03-02 Minh-Thang Luong , Quoc V. Le , Ilya Sutskever , Oriol Vinyals , Lukasz Kaiser

We present an ensemble-driven self-training framework for unsupervised neural machine translation (UNMT). Starting from a primary language pair, we train multiple UNMT models that share the same translation task but differ in an auxiliary…

Computation and Language · Computer Science 2026-03-19 Ido Aharon , Jonathan Shaki , Sarit Kraus

Sequence-to-Sequence (seq2seq) modeling has rapidly become an important general-purpose NLP tool that has proven effective for many text-generation and sequence-labeling tasks. Seq2seq builds on deep neural language modeling and inherits…

Computation and Language · Computer Science 2016-11-11 Sam Wiseman , Alexander M. Rush

We describe an unsupervised method to create pseudo-parallel corpora for machine translation (MT) from unaligned text. We use multilingual BERT to create source and target sentence embeddings for nearest-neighbor search and adapt the model…

Computation and Language · Computer Science 2020-10-16 Phillip Keung , Julian Salazar , Yichao Lu , Noah A. Smith

Neural text-to-speech (TTS) models can synthesize natural human speech when trained on large amounts of transcribed speech. However, collecting such large-scale transcribed data is expensive. This paper proposes an unsupervised pre-training…

Audio and Speech Processing · Electrical Eng. & Systems 2023-03-29 Seongyeon Park , Myungseo Song , Bohyung Kim , Tae-Hyun Oh
‹ Prev 1 2 3 10 Next ›