English

Rethinking Encoder-Decoder Flow Through Shared Structures

Computer Vision and Pattern Recognition 2025-01-27 v1 Machine Learning

Abstract

Dense prediction tasks have enjoyed a growing complexity of encoder architectures, decoders, however, have remained largely the same. They rely on individual blocks decoding intermediate feature maps sequentially. We introduce banks, shared structures that are used by each decoding block to provide additional context in the decoding process. These structures, through applying them via resampling and feature fusion, improve performance on depth estimation for state-of-the-art transformer-based architectures on natural and synthetic images whilst training on large-scale datasets.

Keywords

Cite

@article{arxiv.2501.14535,
  title  = {Rethinking Encoder-Decoder Flow Through Shared Structures},
  author = {Frederik Laboyrie and Mehmet Kerim Yucel and Albert Saa-Garriga},
  journal= {arXiv preprint arXiv:2501.14535},
  year   = {2025}
}