Multi-Task Learning for Visual Scene Understanding

Simon Vandenhende

Multi-Task Learning for Visual Scene Understanding

Computer Vision and Pattern Recognition 2022-03-29 v1 Artificial Intelligence Machine Learning

Authors: Simon Vandenhende

Abstract

Despite the recent progress in deep learning, most approaches still go for a silo-like solution, focusing on learning each task in isolation: training a separate neural network for each individual task. Many real-world problems, however, call for a multi-modal approach and, therefore, for multi-tasking models. Multi-task learning (MTL) aims to leverage useful information across tasks to improve the generalization capability of a model. This thesis is concerned with multi-task learning in the context of computer vision. First, we review existing approaches for MTL. Next, we propose several methods that tackle important aspects of multi-task learning. The proposed methods are evaluated on various benchmarks. The results show several advances in the state-of-the-art of multi-task learning. Finally, we discuss several possibilities for future work.

Keywords

multi-task learning multimodal learning unified modeling language

Cite

@article{arxiv.2203.14896,
  title  = {Multi-Task Learning for Visual Scene Understanding},
  author = {Simon Vandenhende},
  journal= {arXiv preprint arXiv:2203.14896},
  year   = {2022}
}

Comments

PhD Thesis

Multi-Task Learning for Visual Scene Understanding

Abstract

Keywords

Cite

Comments

Related papers