English
Related papers

Related papers: Rethinking Encoder-Decoder Flow Through Shared Str…

200 papers

Transformer-based encoder-decoder models produce a fused token-wise representation after every encoder layer. We investigate the effects of allowing the encoder to preserve and explore alternative hypotheses, combined at the end of the…

Computation and Language · Computer Science 2021-07-23 Mikhail Burtsev , Anna Rumshisky

Multi-task learning of dense prediction tasks, by sharing both the encoder and decoder, as opposed to sharing only the encoder, provides an attractive front to increase both accuracy and computational efficiency. When the tasks are similar,…

Computer Vision and Pattern Recognition · Computer Science 2022-08-22 Naresh Kumar Gurulingan , Elahe Arani , Bahram Zonooz

In this paper, we present a novel deep learning architecture for infrared and visible images fusion problem. In contrast to conventional convolutional networks, our encoding network is combined by convolutional layers, fusion layer and…

Computer Vision and Pattern Recognition · Computer Science 2019-01-23 Hui Li , Xiao-Jun Wu

Single encoder-decoder methodologies for semantic segmentation are reaching their peak in terms of segmentation quality and efficiency per number of layers. To address these limitations, we propose a new architecture based on a decoder…

Computer Vision and Pattern Recognition · Computer Science 2020-07-21 Gabriel L. Oliveira , Senthil Yogamani , Wolfram Burgard , Thomas Brox

The Encoder-Decoder architecture is a main stream deep learning model for biomedical image segmentation. The encoder fully compresses the input and generates encoded features, and the decoder then produces dense predictions using encoded…

Computer Vision and Pattern Recognition · Computer Science 2019-01-16 Peixian Liang , Jianxu Chen , Hao Zheng , Lin Yang , Yizhe Zhang , Danny Z. Chen

Deep network architectures struggle to continually learn new tasks without forgetting the previous tasks. A recent trend indicates that dynamic architectures based on an expansion of the parameters can reduce catastrophic forgetting…

Computer Vision and Pattern Recognition · Computer Science 2022-08-09 Arthur Douillard , Alexandre Ramé , Guillaume Couairon , Matthieu Cord

Discrete diffusion models enable parallel token sampling for faster inference than autoregressive approaches. However, prior diffusion models use a decoder-only architecture, which requires sampling algorithms that invoke the full network…

Machine Learning · Computer Science 2025-10-28 Marianne Arriola , Yair Schiff , Hao Phung , Aaron Gokaslan , Volodymyr Kuleshov

In this paper we derive an efficient algorithm to learn the parameters of structured predictors in general graphical models. This algorithm blends the learning and inference tasks, which results in a significant speedup over traditional…

Machine Learning · Computer Science 2013-09-02 Tamir Hazan , Alexander Schwing , David McAllester , Raquel Urtasun

Deep feature spaces have the capacity to encode complex transformations of their input data. However, understanding the relative feature-space relationship between two transformed encoded images is difficult. For instance, what is the…

Computer Vision and Pattern Recognition · Computer Science 2017-10-23 Daniel E. Worrall , Stephan J. Garbin , Daniyar Turmukhambetov , Gabriel J. Brostow

In computer vision and image processing tasks, image fusion has evolved into an attractive research field. However, recent existing image fusion methods are mostly built on pixel-level operations, which may produce unacceptable artifacts…

Computer Vision and Pattern Recognition · Computer Science 2019-05-29 Lihua Jian , Xiaomin Yang , Zheng Liu , Gwanggil Jeon , Mingliang Gao , David Chisholm

The traditional SegNet architecture commonly encounters significant information loss during the sampling process, which detrimentally affects its accuracy in image semantic segmentation tasks. To counter this challenge, we introduce an…

Image and Video Processing · Electrical Eng. & Systems 2024-06-05 Zijun Gao , Qi Wang , Taiyuan Mei , Xiaohan Cheng , Yun Zi , Haowei Yang

Encoder-decoder networks with attention have proven to be a powerful way to solve many sequence-to-sequence tasks. In these networks, attention aligns encoder and decoder states and is often used for visualizing network behavior. However,…

Machine Learning · Computer Science 2021-10-29 Kyle Aitken , Vinay V Ramasesh , Yuan Cao , Niru Maheswaranathan

We present a new versatile building block for deep point cloud processing architectures that is equally suited for diverse tasks. This building block combines the ideas of spatial transformers and multi-view convolutional networks with the…

Computer Vision and Pattern Recognition · Computer Science 2021-10-05 Kirill Mazur , Victor Lempitsky

Learning deeper models is usually a simple and effective approach to improve model performance, but deeper models have larger model parameters and are more difficult to train. To get a deeper model, simply stacking more layers of the model…

Computation and Language · Computer Science 2021-08-27 GuoLiang Li , Yiyang Li

There have been significant efforts to interpret the encoder of Transformer-based encoder-decoder architectures for neural machine translation (NMT); meanwhile, the decoder remains largely unexamined despite its critical role. During…

Computation and Language · Computer Science 2020-10-07 Yilin Yang , Longyue Wang , Shuming Shi , Prasad Tadepalli , Stefan Lee , Zhaopeng Tu

The transformer architecture by Vaswani et al. (2017) is now ubiquitous across application domains, from natural language processing to speech processing and image understanding. We propose DenseFormer, a simple modification to the standard…

Computation and Language · Computer Science 2024-03-22 Matteo Pagliardini , Amirkeivan Mohtashami , Francois Fleuret , Martin Jaggi

Neural style transfer has drawn broad attention in recent years. However, most existing methods aim to explicitly model the transformation between different styles, and the learned model is thus not generalizable to new styles. We here…

Computer Vision and Pattern Recognition · Computer Science 2018-09-25 Yexun Zhang , Ya Zhang , Wenbin Cai , Jie Chang

We present an architecture that is effective for continual learning in an especially demanding setting, where task boundaries do not exist or are unknown, and where classes have to be learned online (with each example presented only once).…

Machine Learning · Computer Science 2021-10-08 Murray Shanahan , Christos Kaplanis , Jovana Mitrović

Decoder-based transformers, while revolutionizing language modeling and scaling to immense sizes, have not completely overtaken encoder-heavy architectures in natural language processing. Specifically, encoder-only models remain dominant in…

Computation and Language · Computer Science 2025-03-05 Paul Suganthan , Fedor Moiseev , Le Yan , Junru Wu , Jianmo Ni , Jay Han , Imed Zitouni , Enrique Alfonseca , Xuanhui Wang , Zhe Dong

Surface crack segmentation poses a challenging computer vision task as background, shape, colour and size of cracks vary. In this work we propose optimized deep encoder-decoder methods consisting of a combination of techniques which yield…

Computer Vision and Pattern Recognition · Computer Science 2021-08-27 Jacob König , Mark Jenkins , Mike Mannion , Peter Barrie , Gordon Morison
‹ Prev 1 2 3 10 Next ›