Related papers: Encoder-Decoder Shift-Reduce Syntactic Parsing

Efficient Encoder-Decoder Transformer Decoding for Decomposable Tasks

Transformer-based NLP models are powerful but have high computational costs that limit deployment. Finetuned encoder-decoder models are popular in specialized domains and can outperform larger more generalized decoder-only models, such as…

Computation and Language · Computer Science 2024-11-19 Bo-Ru Lu , Nikita Haduong , Chien-Yu Lin , Hao Cheng , Noah A. Smith , Mari Ostendorf

On the Sub-Layer Functionalities of Transformer Decoder

There have been significant efforts to interpret the encoder of Transformer-based encoder-decoder architectures for neural machine translation (NMT); meanwhile, the decoder remains largely unexamined despite its critical role. During…

Computation and Language · Computer Science 2020-10-07 Yilin Yang , Longyue Wang , Shuming Shi , Prasad Tadepalli , Stefan Lee , Zhaopeng Tu

Transition-based Semantic Dependency Parsing with Pointer Networks

Transition-based parsers implemented with Pointer Networks have become the new state of the art in dependency parsing, excelling in producing labelled syntactic trees and outperforming graph-based models in this task. In order to further…

Computation and Language · Computer Science 2020-05-29 Daniel Fernández-González , Carlos Gómez-Rodríguez

Graph Convolutional Encoders for Syntax-aware Neural Machine Translation

We present a simple and effective approach to incorporating syntactic structure into neural attention-based encoder-decoder models for machine translation. We rely on graph-convolutional networks (GCNs), a recent class of neural networks…

Computation and Language · Computer Science 2020-06-22 Jasmijn Bastings , Ivan Titov , Wilker Aziz , Diego Marcheggiani , Khalil Sima'an

Hard but Robust, Easy but Sensitive: How Encoder and Decoder Perform in Neural Machine Translation

Neural machine translation (NMT) typically adopts the encoder-decoder framework. A good understanding of the characteristics and functionalities of the encoder and decoder can help to explain the pros and cons of the framework, and design…

Computation and Language · Computer Science 2019-08-20 Tianyu He , Xu Tan , Tao Qin

Exploiting Pre-trained Encoder-Decoder Transformers for Sequence-to-Sequence Constituent Parsing

To achieve deep natural language understanding, syntactic constituent parsing plays a crucial role and is widely required by many artificial intelligence systems for processing both text and speech. A recent approach involves using standard…

Computation and Language · Computer Science 2026-05-14 Daniel Fernández-González , Cristina Outeiriño Cid

Refining Source Representations with Relation Networks for Neural Machine Translation

Although neural machine translation with the encoder-decoder framework has achieved great success recently, it still suffers drawbacks of forgetting distant information, which is an inherent disadvantage of recurrent neural network…

Computation and Language · Computer Science 2018-09-12 Wen Zhang , Jiawei Hu , Yang Feng , Qun Liu

Hyperdecoders: Instance-specific decoders for multi-task NLP

We investigate input-conditioned hypernetworks for multi-tasking in NLP, generating parameter-efficient adaptations for a decoder using a hypernetwork conditioned on the output of an encoder. This approach produces a unique decoder…

Computation and Language · Computer Science 2022-10-19 Hamish Ivison , Matthew E. Peters

Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder

Most neural machine translation (NMT) models are based on the sequential encoder-decoder framework, which makes no use of syntactic information. In this paper, we improve this model by explicitly incorporating source-side syntactic trees.…

Computation and Language · Computer Science 2017-07-19 Huadong Chen , Shujian Huang , David Chiang , Jiajun Chen

Graph-based Neural Modules to Inspect Attention-based Architectures: A Position Paper

Encoder-decoder architectures are prominent building blocks of state-of-the-art solutions for tasks across multiple fields where deep learning (DL) or foundation models play a key role. Although there is a growing community working on the…

Machine Learning · Computer Science 2022-10-14 Breno W. Carvalho , Artur D'Avilla Garcez , Luis C. Lamb

Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation

The field of neural machine translation (NMT) has changed with the advent of large language models (LLMs). Much of the recent emphasis in natural language processing (NLP) has been on modeling machine translation and many other problems…

Computation and Language · Computer Science 2025-06-03 Yingfeng Luo , Tong Zheng , Yongyu Mu , Bei Li , Qinghong Zhang , Yongqi Gao , Ziqiang Xu , Peinan Feng , Xiaoqian Liu , Tong Xiao , Jingbo Zhu

Learning Generic Sentence Representations Using Convolutional Neural Networks

We propose a new encoder-decoder approach to learn distributed sentence representations that are applicable to multiple purposes. The model is learned by using a convolutional neural network as an encoder to map an input sentence into a…

Computation and Language · Computer Science 2017-07-28 Zhe Gan , Yunchen Pu , Ricardo Henao , Chunyuan Li , Xiaodong He , Lawrence Carin

Learning to Refine Source Representations for Neural Machine Translation

Neural machine translation (NMT) models generally adopt an encoder-decoder architecture for modeling the entire translation process. The encoder summarizes the representation of input sentence from scratch, which is potentially a problem if…

Computation and Language · Computer Science 2018-12-27 Xinwei Geng , Longyue Wang , Xing Wang , Bing Qin , Ting Liu , Zhaopeng Tu

Language-Independent Representor for Neural Machine Translation

Current Neural Machine Translation (NMT) employs a language-specific encoder to represent the source sentence and adopts a language-specific decoder to generate target translation. This language-dependent design leads to large-scale network…

Computation and Language · Computer Science 2018-11-02 Long Zhou , Yuchen Liu , Jiajun Zhang , Chengqing Zong , Guoping Huang

Language Models are Good Translators

Recent years have witnessed the rapid advance in neural machine translation (NMT), the core of which lies in the encoder-decoder architecture. Inspired by the recent progress of large-scale pre-trained language models on machine translation…

Computation and Language · Computer Science 2021-06-28 Shuo Wang , Zhaopeng Tu , Zhixing Tan , Wenxuan Wang , Maosong Sun , Yang Liu

Neural Network Coding

In this paper we introduce Neural Network Coding(NNC), a data-driven approach to joint source and network coding. In NNC, the encoders at each source and intermediate node, as well as the decoder at each destination node, are neural…

Information Theory · Computer Science 2021-01-12 Litian Liu , Amit Solomon , Salman Salamatian , Muriel Medard

Speeding up Context-based Sentence Representation Learning with Non-autoregressive Convolutional Decoding

Context plays an important role in human language understanding, thus it may also be useful for machines learning vector representations of language. In this paper, we explore an asymmetric encoder-decoder structure for unsupervised…

Neural and Evolutionary Computing · Computer Science 2018-06-04 Shuai Tang , Hailin Jin , Chen Fang , Zhaowen Wang , Virginia R. de Sa

Is Encoder-Decoder Redundant for Neural Machine Translation?

Encoder-decoder architecture is widely adopted for sequence-to-sequence modeling tasks. For machine translation, despite the evolution from long short-term memory networks to Transformer networks, plus the introduction and development of…

Computation and Language · Computer Science 2022-10-24 Yingbo Gao , Christian Herold , Zijian Yang , Hermann Ney

Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation

In encoder-decoder neural models, multiple encoders are in general used to represent the contextual information in addition to the individual sentence. In this paper, we investigate multi-encoder approaches in documentlevel neural machine…

Computation and Language · Computer Science 2020-05-19 Bei Li , Hui Liu , Ziyang Wang , Yufan Jiang , Tong Xiao , Jingbo Zhu , Tongran Liu , Changliang Li

Refining Source Representations with Relation Networks for Neural Machine Translation

Although neural machine translation (NMT) with the encoder-decoder framework has achieved great success in recent times, it still suffers from some drawbacks: RNNs tend to forget old information which is often useful and the encoder only…

Computation and Language · Computer Science 2018-05-28 Wen Zhang , Jiawei Hu , Yang Feng , Qun Liu