Related papers: Rethinking Encoder-Decoder Flow Through Shared Str…

Multi-Stream Transformers

Transformer-based encoder-decoder models produce a fused token-wise representation after every encoder layer. We investigate the effects of allowing the encoder to preserve and explore alternative hypotheses, combined at the end of the…

Computation and Language · Computer Science 2021-07-23 Mikhail Burtsev , Anna Rumshisky

Curbing Task Interference using Representation Similarity-Guided Multi-Task Feature Sharing

Multi-task learning of dense prediction tasks, by sharing both the encoder and decoder, as opposed to sharing only the encoder, provides an attractive front to increase both accuracy and computational efficiency. When the tasks are similar,…

Computer Vision and Pattern Recognition · Computer Science 2022-08-22 Naresh Kumar Gurulingan , Elahe Arani , Bahram Zonooz

DenseFuse: A Fusion Approach to Infrared and Visible Images

In this paper, we present a novel deep learning architecture for infrared and visible images fusion problem. In contrast to conventional convolutional networks, our encoding network is combined by convolutional layers, fusion layer and…

Computer Vision and Pattern Recognition · Computer Science 2019-01-23 Hui Li , Xiao-Jun Wu

Beyond Single Stage Encoder-Decoder Networks: Deep Decoders for Semantic Image Segmentation

Single encoder-decoder methodologies for semantic segmentation are reaching their peak in terms of segmentation quality and efficiency per number of layers. To address these limitations, we propose a new architecture based on a decoder…

Computer Vision and Pattern Recognition · Computer Science 2020-07-21 Gabriel L. Oliveira , Senthil Yogamani , Wolfram Burgard , Thomas Brox

Cascade Decoder: A Universal Decoding Method for Biomedical Image Segmentation

The Encoder-Decoder architecture is a main stream deep learning model for biomedical image segmentation. The encoder fully compresses the input and generates encoded features, and the decoder then produces dense predictions using encoded…

Computer Vision and Pattern Recognition · Computer Science 2019-01-16 Peixian Liang , Jianxu Chen , Hao Zheng , Lin Yang , Yizhe Zhang , Danny Z. Chen

DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion

Deep network architectures struggle to continually learn new tasks without forgetting the previous tasks. A recent trend indicates that dynamic architectures based on an expansion of the parameters can reduce catastrophic forgetting…

Computer Vision and Pattern Recognition · Computer Science 2022-08-09 Arthur Douillard , Alexandre Ramé , Guillaume Couairon , Matthieu Cord

Encoder-Decoder Diffusion Language Models for Efficient Training and Inference

Discrete diffusion models enable parallel token sampling for faster inference than autoregressive approaches. However, prior diffusion models use a decoder-only architecture, which requires sampling algorithms that invoke the full network…

Machine Learning · Computer Science 2025-10-28 Marianne Arriola , Yair Schiff , Hao Phung , Aaron Gokaslan , Volodymyr Kuleshov

Blending Learning and Inference in Structured Prediction

In this paper we derive an efficient algorithm to learn the parameters of structured predictors in general graphical models. This algorithm blends the learning and inference tasks, which results in a significant speedup over traditional…

Machine Learning · Computer Science 2013-09-02 Tamir Hazan , Alexander Schwing , David McAllester , Raquel Urtasun

Interpretable Transformations with Encoder-Decoder Networks

Deep feature spaces have the capacity to encode complex transformations of their input data. However, understanding the relative feature-space relationship between two transformed encoded images is difficult. For instance, what is the…

Computer Vision and Pattern Recognition · Computer Science 2017-10-23 Daniel E. Worrall , Stephan J. Garbin , Daniyar Turmukhambetov , Gabriel J. Brostow

A Symmetric Encoder-Decoder with Residual Block for Infrared and Visible Image Fusion

In computer vision and image processing tasks, image fusion has evolved into an attractive research field. However, recent existing image fusion methods are mostly built on pixel-level operations, which may produce unacceptable artifacts…

Computer Vision and Pattern Recognition · Computer Science 2019-05-29 Lihua Jian , Xiaomin Yang , Zheng Liu , Gwanggil Jeon , Mingliang Gao , David Chisholm

An Enhanced Encoder-Decoder Network Architecture for Reducing Information Loss in Image Semantic Segmentation

The traditional SegNet architecture commonly encounters significant information loss during the sampling process, which detrimentally affects its accuracy in image semantic segmentation tasks. To counter this challenge, we introduce an…

Image and Video Processing · Electrical Eng. & Systems 2024-06-05 Zijun Gao , Qi Wang , Taiyuan Mei , Xiaohan Cheng , Yun Zi , Haowei Yang

Understanding How Encoder-Decoder Architectures Attend

Encoder-decoder networks with attention have proven to be a powerful way to solve many sequence-to-sequence tasks. In these networks, attention aligns encoder and decoder states and is often used for visualizing network behavior. However,…

Machine Learning · Computer Science 2021-10-29 Kyle Aitken , Vinay V Ramasesh , Yuan Cao , Niru Maheswaranathan

Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

We present a new versatile building block for deep point cloud processing architectures that is equally suited for diverse tasks. This building block combines the ideas of spatial transformers and multi-view convolutional networks with the…

Computer Vision and Pattern Recognition · Computer Science 2021-10-05 Kirill Mazur , Victor Lempitsky

Recurrent multiple shared layers in Depth for Neural Machine Translation

Learning deeper models is usually a simple and effective approach to improve model performance, but deeper models have larger model parameters and are more difficult to train. To get a deeper model, simply stacking more layers of the model…

Computation and Language · Computer Science 2021-08-27 GuoLiang Li , Yiyang Li

On the Sub-Layer Functionalities of Transformer Decoder

There have been significant efforts to interpret the encoder of Transformer-based encoder-decoder architectures for neural machine translation (NMT); meanwhile, the decoder remains largely unexamined despite its critical role. During…

Computation and Language · Computer Science 2020-10-07 Yilin Yang , Longyue Wang , Shuming Shi , Prasad Tadepalli , Stefan Lee , Zhaopeng Tu

DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted Averaging

The transformer architecture by Vaswani et al. (2017) is now ubiquitous across application domains, from natural language processing to speech processing and image understanding. We propose DenseFormer, a simple modification to the standard…

Computation and Language · Computer Science 2024-03-22 Matteo Pagliardini , Amirkeivan Mohtashami , Francois Fleuret , Martin Jaggi

Separating Style and Content for Generalized Style Transfer

Neural style transfer has drawn broad attention in recent years. However, most existing methods aim to explicitly model the transformation between different styles, and the learned model is thus not generalizable to new styles. We here…

Computer Vision and Pattern Recognition · Computer Science 2018-09-25 Yexun Zhang , Ya Zhang , Wenbin Cai , Jie Chang

Encoders and Ensembles for Task-Free Continual Learning

We present an architecture that is effective for continual learning in an especially demanding setting, where task boundaries do not exist or are unknown, and where classes have to be learned online (with each example presented only once).…

Machine Learning · Computer Science 2021-10-08 Murray Shanahan , Christos Kaplanis , Jovana Mitrović

Adapting Decoder-Based Language Models for Diverse Encoder Downstream Tasks

Decoder-based transformers, while revolutionizing language modeling and scaling to immense sizes, have not completely overtaken encoder-heavy architectures in natural language processing. Specifically, encoder-only models remain dominant in…

Computation and Language · Computer Science 2025-03-05 Paul Suganthan , Fedor Moiseev , Le Yan , Junru Wu , Jianmo Ni , Jay Han , Imed Zitouni , Enrique Alfonseca , Xuanhui Wang , Zhe Dong

Optimized Deep Encoder-Decoder Methods for Crack Segmentation

Surface crack segmentation poses a challenging computer vision task as background, shape, colour and size of cracks vary. In this work we propose optimized deep encoder-decoder methods consisting of a combination of techniques which yield…

Computer Vision and Pattern Recognition · Computer Science 2021-08-27 Jacob König , Mark Jenkins , Mike Mannion , Peter Barrie , Gordon Morison