Related papers: Sequential Cross Attention Based Multi-task Learni…

Semantics-Depth-Symbiosis: Deeply Coupled Semi-Supervised Learning of Semantics and Depth

Multi-task learning (MTL) paradigm focuses on jointly learning two or more tasks, aiming for significant improvement w.r.t model's generalizability, performance, and training/inference memory footprint. The aforementioned benefits become…

Computer Vision and Pattern Recognition · Computer Science 2022-10-27 Nitin Bansal , Pan Ji , Junsong Yuan , Yi Xu

DenseMTL: Cross-task Attention Mechanism for Dense Multi-task Learning

Multi-task learning has recently emerged as a promising solution for a comprehensive understanding of complex scenes. In addition to being memory-efficient, multi-task models, when appropriately designed, can facilitate the exchange of…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Ivan Lopes , Tuan-Hung Vu , Raoul de Charette

Cross-Task Attention Network: Improving Multi-Task Learning for Medical Imaging Applications

Multi-task learning (MTL) is a powerful approach in deep learning that leverages the information from multiple tasks during training to improve model performance. In medical imaging, MTL has shown great potential to solve various tasks.…

Computer Vision and Pattern Recognition · Computer Science 2023-09-08 Sangwook Kim , Thomas G. Purdie , Chris McIntosh

End-to-End Multi-Task Learning with Attention

We propose a novel multi-task learning architecture, which allows learning of task-specific feature-level attention. Our design, the Multi-Task Attention Network (MTAN), consists of a single shared network containing a global feature pool,…

Computer Vision and Pattern Recognition · Computer Science 2019-04-08 Shikun Liu , Edward Johns , Andrew J. Davison

3D-Aware Multi-Task Learning with Cross-View Correlations for Dense Scene Understanding

This paper addresses the challenge of training a single network to jointly perform multiple dense prediction tasks, such as segmentation and depth estimation, i.e., multi-task learning (MTL). Current approaches mainly capture cross-task…

Computer Vision and Pattern Recognition · Computer Science 2025-11-26 Xiaoye Wang , Chen Tang , Xiangyu Yue , Wei-Hong Li

Cross-Layer Feature Self-Attention Module for Multi-Scale Object Detection

Recent object detection methods have made remarkable progress by leveraging attention mechanisms to improve feature discriminability. However, most existing approaches are confined to refining single-layer or fusing dual-layer features,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-17 Dingzhou Xie , Rushi Lan , Cheng Pang , Enhao Ning , Jiahao Zeng , Wei Zheng

SGW-based Multi-Task Learning in Vision Tasks

Multi-task-learning(MTL) is a multi-target optimization task. Neural networks try to realize each target using a shared interpretative space within MTL. However, as the scale of datasets expands and the complexity of tasks increases,…

Computer Vision and Pattern Recognition · Computer Science 2024-10-08 Ruiyuan Zhang , Yuyao Chen , Yuchi Huo , Jiaxiang Liu , Dianbing Xi , Jie Liu , Chao Wu

Cross-Task Affinity Learning for Multitask Dense Scene Predictions

Multitask learning (MTL) has become prominent for its ability to predict multiple tasks jointly, achieving better per-task performance with fewer parameters than single-task learning. Recently, decoder-focused architectures have…

Computer Vision and Pattern Recognition · Computer Science 2024-11-07 Dimitrios Sinodinos , Narges Armanfard

Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention

Convolutional neural networks (CNNs) and vision transformers (ViTs) have achieved remarkable success in various vision tasks. However, many architectures do not consider interactions between feature maps from different stages and scales,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-16 Liang Shang , Yanli Liu , Zhengyang Lou , Shuxue Quan , Nagesh Adluru , Bochen Guan , William A. Sethares

Contextual Attention Modulation: Towards Efficient Multi-Task Adaptation in Large Language Models

Large Language Models (LLMs) possess remarkable generalization capabilities but struggle with multi-task adaptation, particularly in balancing knowledge retention with task-specific specialization. Conventional fine-tuning methods suffer…

Artificial Intelligence · Computer Science 2025-10-21 Dayan Pan , Zhaoyang Fu , Jingyuan Wang , Xiao Han , Yue Zhu , Xiangyu Zhao

RATM: Recurrent Attentive Tracking Model

We present an attention-based modular neural framework for computer vision. The framework uses a soft attention mechanism allowing models to be trained with gradient descent. It consists of three modules: a recurrent attention module…

Machine Learning · Computer Science 2016-04-29 Samira Ebrahimi Kahou , Vincent Michalski , Roland Memisevic

Multi-Task Learning via Co-Attentive Sharing for Pedestrian Attribute Recognition

Learning to predict multiple attributes of a pedestrian is a multi-task learning problem. To share feature representation between two individual task networks, conventional methods like Cross-Stitch and Sluice network learn a linear…

Computer Vision and Pattern Recognition · Computer Science 2020-04-08 Haitian Zeng , Haizhou Ai , Zijie Zhuang , Long Chen

Multi-layer Learnable Attention Mask for Multimodal Tasks

While the Self-Attention mechanism in the Transformer model has proven to be effective in many domains, we observe that it is less effective in more diverse settings (e.g. multimodality) due to the varying granularity of each token and the…

Computer Vision and Pattern Recognition · Computer Science 2024-06-06 Wayner Barrios , SouYoung Jin

Cross-Modal Self-Attention Network for Referring Image Segmentation

We consider the problem of referring image segmentation. Given an input image and a natural language expression, the goal is to segment the object referred by the language expression in the image. Existing works in this area treat the…

Computer Vision and Pattern Recognition · Computer Science 2019-04-10 Linwei Ye , Mrigank Rochan , Zhi Liu , Yang Wang

Sequential Attention Module for Natural Language Processing

Recently, large pre-trained neural language models have attained remarkable performance on many downstream natural language processing (NLP) applications via fine-tuning. In this paper, we target at how to further improve the token…

Artificial Intelligence · Computer Science 2021-09-08 Mengyuan Zhou , Jian Ma , Haiqin Yang , Lianxin Jiang , Yang Mo

Efficient Multi-Scale Attention Module with Cross-Spatial Learning

Remarkable effectiveness of the channel or spatial attention mechanisms for producing more discernible feature representation are illustrated in various computer vision tasks. However, modeling the cross-channel relationships with channel…

Computer Vision and Pattern Recognition · Computer Science 2023-06-07 Daliang Ouyang , Su He , Guozhong Zhang , Mingzhu Luo , Huaiyong Guo , Jian Zhan , Zhijie Huang

DRTAM: Dual Rank-1 Tensor Attention Module

Recently, attention mechanisms have been extensively investigated in computer vision, but few of them show excellent performance on both large and mobile networks. This paper proposes Dual Rank-1 Tensor Attention Module (DRTAM), a novel…

Computer Vision and Pattern Recognition · Computer Science 2023-09-08 Hanxing Chi , Baihong Lin , Jun Hu , Liang Wang

SCAM! Transferring humans between images with Semantic Cross Attention Modulation

A large body of recent work targets semantically conditioned image generation. Most such methods focus on the narrower task of pose transfer and ignore the more challenging task of subject transfer that consists in not only transferring the…

Computer Vision and Pattern Recognition · Computer Science 2022-10-11 Nicolas Dufour , David Picard , Vicky Kalogeiton

Cross-Architecture Transfer Learning for Linear-Cost Inference Transformers

Recently, multiple architectures has been proposed to improve the efficiency of the Transformer Language Models through changing the design of the self-attention block to have a linear-cost inference (LCI). A notable approach in this realm…

Computation and Language · Computer Science 2024-04-04 Sehyun Choi

Distribution Matching for Multi-Task Learning of Classification Tasks: a Large-Scale Study on Faces & Beyond

Multi-Task Learning (MTL) is a framework, where multiple related tasks are learned jointly and benefit from a shared representation space, or parameter transfer. To provide sufficient learning support, modern MTL uses annotated data with…

Computer Vision and Pattern Recognition · Computer Science 2024-01-04 Dimitrios Kollias , Viktoriia Sharmanska , Stefanos Zafeiriou