Related papers: Deep Multimodality Model for Multi-task Multi-view…

Learning deep representation of multityped objects and tasks

We introduce a deep multitask architecture to integrate multityped representations of multimodal objects. This multitype exposition is less abstract than the multimodal characterization, but more machine-friendly, and thus is more precise…

Machine Learning · Statistics 2016-03-07 Truyen Tran , Dinh Phung , Svetha Venkatesh

Multi-Task Learning for Visual Scene Understanding

Despite the recent progress in deep learning, most approaches still go for a silo-like solution, focusing on learning each task in isolation: training a separate neural network for each individual task. Many real-world problems, however,…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Simon Vandenhende

DLM-VMTL:A Double Layer Mapper for heterogeneous data video Multi-task prompt learning

In recent years, the parameters of backbones of Video Understanding tasks continue to increase and even reach billion-level. Whether fine-tuning a specific task on the Video Foundation Model or pre-training the model designed for the…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Zeyi Bo , Wuxi Sun , Ye Jin

Distribution Matching for Heterogeneous Multi-Task Learning: a Large-scale Face Study

Multi-Task Learning has emerged as a methodology in which multiple tasks are jointly learned by a shared learning algorithm, such as a DNN. MTL is based on the assumption that the tasks under consideration are related; therefore it exploits…

Computer Vision and Pattern Recognition · Computer Science 2021-05-11 Dimitrios Kollias , Viktoriia Sharmanska , Stefanos Zafeiriou

Deep Co-Attention Network for Multi-View Subspace Learning

Many real-world applications involve data from multiple modalities and thus exhibit the view heterogeneity. For example, user modeling on social media might leverage both the topology of the underlying social network and the content of the…

Machine Learning · Computer Science 2021-02-16 Lecheng Zheng , Yu Cheng , Hongxia Yang , Nan Cao , Jingrui He

Universal Representations: A Unified Look at Multiple Task and Domain Learning

We propose a unified look at jointly learning multiple vision tasks and visual domains through universal representations, a single deep neural network. Learning multiple problems simultaneously involves minimizing a weighted sum of multiple…

Computer Vision and Pattern Recognition · Computer Science 2022-08-31 Wei-Hong Li , Xialei Liu , Hakan Bilen

3D-Aware Multi-Task Learning with Cross-View Correlations for Dense Scene Understanding

This paper addresses the challenge of training a single network to jointly perform multiple dense prediction tasks, such as segmentation and depth estimation, i.e., multi-task learning (MTL). Current approaches mainly capture cross-task…

Computer Vision and Pattern Recognition · Computer Science 2025-11-26 Xiaoye Wang , Chen Tang , Xiangyu Yue , Wei-Hong Li

Multi-View representation learning in Multi-Task Scene

Over recent decades have witnessed considerable progress in whether multi-task learning or multi-view learning, but the situation that consider both learning scenes simultaneously has received not too much attention. How to utilize multiple…

Computer Vision and Pattern Recognition · Computer Science 2022-01-19 Run-kun Lu , Jian-wei Liu , Si-ming Lian , Xin Zuo

Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications

Multimodality Representation Learning, as a technique of learning to embed information from different modalities and their correlations, has achieved remarkable success on a variety of applications, such as Visual Question Answering (VQA),…

Artificial Intelligence · Computer Science 2024-03-04 Muhammad Arslan Manzoor , Sarah Albarri , Ziting Xian , Zaiqiao Meng , Preslav Nakov , Shangsong Liang

Multi-target tracking for video surveillance using deep affinity network: a brief review

Deep learning models are known to function like the human brain. Due to their functional mechanism, they are frequently utilized to accomplish tasks that require human intelligence. Multi-target tracking (MTT) for video surveillance is one…

Computer Vision and Pattern Recognition · Computer Science 2021-11-01 Sanam Nisar Mangi

Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach

Face attribute estimation has many potential applications in video surveillance, face retrieval, and social media. While a number of methods have been proposed for face attribute estimation, most of them did not explicitly consider the…

Computer Vision and Pattern Recognition · Computer Science 2017-09-29 Hu Han , Anil K. Jain , Fang Wang , Shiguang Shan , Xilin Chen

Multi-Task Meta Learning: learn how to adapt to unseen tasks

This work proposes Multi-task Meta Learning (MTML), integrating two learning paradigms Multi-Task Learning (MTL) and meta learning, to bring together the best of both worlds. In particular, it focuses simultaneous learning of multiple…

Computer Vision and Pattern Recognition · Computer Science 2023-04-27 Richa Upadhyay , Prakash Chandra Chhipa , Ronald Phlypo , Rajkumar Saini , Marcus Liwicki

Multi-Task Learning for Dense Prediction Tasks: A Survey

With the advent of deep learning, many dense prediction tasks, i.e. tasks that produce pixel-level predictions, have seen significant performance improvements. The typical approach is to learn these tasks in isolation, that is, a separate…

Computer Vision and Pattern Recognition · Computer Science 2021-01-26 Simon Vandenhende , Stamatios Georgoulis , Wouter Van Gansbeke , Marc Proesmans , Dengxin Dai , Luc Van Gool

Multi-Task Learning with Deep Neural Networks: A Survey

Multi-task learning (MTL) is a subfield of machine learning in which multiple tasks are simultaneously learned by a shared model. Such approaches offer advantages like improved data efficiency, reduced overfitting through shared…

Machine Learning · Computer Science 2020-09-22 Michael Crawshaw

Optimizing Dense Visual Predictions Through Multi-Task Coherence and Prioritization

Multi-Task Learning (MTL) involves the concurrent training of multiple tasks, offering notable advantages for dense prediction tasks in computer vision. MTL not only reduces training and inference time as opposed to having multiple…

Computer Vision and Pattern Recognition · Computer Science 2024-12-05 Maxime Fontana , Michael Spratling , Miaojing Shi

Multimodal Co-learning: Challenges, Applications with Datasets, Recent Advances and Future Directions

Multimodal deep learning systems which employ multiple modalities like text, image, audio, video, etc., are showing better performance in comparison with individual modalities (i.e., unimodal) systems. Multimodal machine learning involves…

Machine Learning · Computer Science 2022-01-19 Anil Rahate , Rahee Walambe , Sheela Ramanna , Ketan Kotecha

HetMAML: Task-Heterogeneous Model-Agnostic Meta-Learning for Few-Shot Learning Across Modalities

Existing gradient-based meta-learning approaches to few-shot learning assume that all tasks have the same input feature space. However, in the real world scenarios, there are many cases that the input structures of tasks can be different,…

Artificial Intelligence · Computer Science 2021-09-29 Jiayi Chen , Aidong Zhang

Latent Heterogeneous Graph Network for Incomplete Multi-View Learning

Multi-view learning has progressed rapidly in recent years. Although many previous studies assume that each instance appears in all views, it is common in real-world applications for instances to be missing from some views, resulting in…

Machine Learning · Computer Science 2022-08-30 Pengfei Zhu , Xinjie Yao , Yu Wang , Meng Cao , Binyuan Hui , Shuai Zhao , Qinghua Hu

An Efficient General-Purpose Modular Vision Model via Multi-Task Heterogeneous Training

We present a model that can perform multiple vision tasks and can be adapted to other downstream tasks efficiently. Despite considerable progress in multi-task learning, most efforts focus on learning from multi-label data: a single image…

Computer Vision and Pattern Recognition · Computer Science 2023-06-30 Zitian Chen , Mingyu Ding , Yikang Shen , Wei Zhan , Masayoshi Tomizuka , Erik Learned-Miller , Chuang Gan

Deep Multi-Modal Sets

Many vision-related tasks benefit from reasoning over multiple modalities to leverage complementary views of data in an attempt to learn robust embedding spaces. Most deep learning-based methods rely on a late fusion technique whereby…

Computer Vision and Pattern Recognition · Computer Science 2020-03-04 Austin Reiter , Menglin Jia , Pu Yang , Ser-Nam Lim