Related papers: Auxiliary Tasks in Multi-task Learning

MultiDepth: Single-Image Depth Estimation via Multi-Task Regression and Classification

We introduce MultiDepth, a novel training strategy and convolutional neural network (CNN) architecture that allows approaching single-image depth estimation (SIDE) as a multi-task problem. SIDE is an important part of road scene…

Computer Vision and Pattern Recognition · Computer Science 2019-07-26 Lukas Liebel , Marco Körner

NeurAll: Towards a Unified Visual Perception Model for Automated Driving

Convolutional Neural Networks (CNNs) are successfully used for the important automotive visual perception tasks including object recognition, motion and depth estimation, visual SLAM, etc. However, these tasks are typically independently…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Ganesh Sistu , Isabelle Leang , Sumanth Chennupati , Senthil Yogamani , Ciaran Hughes , Stefan Milz , Samir Rawashdeh

Context-Aware Multi-Task Learning for Traffic Scene Recognition in Autonomous Vehicles

Traffic scene recognition, which requires various visual classification tasks, is a critical ingredient in autonomous vehicles. However, most existing approaches treat each relevant task independently from one another, never considering the…

Computer Vision and Pattern Recognition · Computer Science 2020-04-06 Younkwan Lee , Jihyo Jeon , Jongmin Yu , Moongu Jeon

AuxNet: Auxiliary tasks enhanced Semantic Segmentation for Automated Driving

Decision making in automated driving is highly specific to the environment and thus semantic segmentation plays a key role in recognizing the objects in the environment around the car. Pixel level classification once considered a…

Computer Vision and Pattern Recognition · Computer Science 2019-03-25 Sumanth Chennupati , Ganesh Sistu , Senthil Yogamani , Samir Rawashdeh

Learning On-Road Visual Control for Self-Driving Vehicles with Auxiliary Tasks

A safe and robust on-road navigation system is a crucial component of achieving fully automated vehicles. NVIDIA recently proposed an End-to-End algorithm that can directly learn steering commands from raw pixels of a front camera by using…

Computer Vision and Pattern Recognition · Computer Science 2018-12-20 Yilun Chen , Praveen Palanisamy , Priyantha Mudalige , Katharina Muelling , John M. Dolan

YUVMultiNet: Real-time YUV multi-task CNN for autonomous driving

In this paper, we propose a multi-task convolutional neural network (CNN) architecture optimized for a low power automotive grade SoC. We introduce a network based on a unified architecture where the encoder is shared among the two tasks…

Computer Vision and Pattern Recognition · Computer Science 2019-04-12 Thomas Boulay , Said El-Hachimi , Mani Kumar Surisetti , Pullarao Maddu , Saranya Kandan

Attentive Feature Reuse for Multi Task Meta learning

We develop new algorithms for simultaneous learning of multiple tasks (e.g., image classification, depth estimation), and for adapting to unseen task/domain distributions within those high-level tasks (e.g., different environments). First,…

Machine Learning · Computer Science 2020-06-16 Kiran Lekkala , Laurent Itti

Graph-Based Neural Network Models with Multiple Self-Supervised Auxiliary Tasks

Self-supervised learning is currently gaining a lot of attention, as it allows neural networks to learn robust representations from large quantities of unlabeled data. Additionally, multi-task learning can further improve representation…

Machine Learning · Computer Science 2020-12-07 Franco Manessi , Alessandro Rozza

Multi-task Dictionary Learning based Convolutional Neural Network for Computer aided Diagnosis with Longitudinal Images

Algorithmic image-based diagnosis and prognosis of neurodegenerative diseases on longitudinal data has drawn great interest from computer vision researchers. The current state-of-the-art models for many image classification tasks are based…

Computer Vision and Pattern Recognition · Computer Science 2017-09-04 Jie Zhang , Qingyang Li , Richard J. Caselli , Jieping Ye , Yalin Wang

Joint Semantic Segmentation and Depth Estimation with Deep Convolutional Networks

Multi-scale deep CNNs have been used successfully for problems mapping each pixel to a label, such as depth estimation and semantic segmentation. It has also been shown that such architectures are reusable and can be used for multiple…

Computer Vision and Pattern Recognition · Computer Science 2016-09-21 Arsalan Mousavian , Hamed Pirsiavash , Jana Kosecka

Multi-task learning to improve natural language understanding

Recently advancements in sequence-to-sequence neural network architectures have led to an improved natural language understanding. When building a neural network-based Natural Language Understanding component, one main challenge is to…

Computation and Language · Computer Science 2019-02-18 Stefan Constantin , Jan Niehues , Alex Waibel

Learning Multi-Tasks with Inconsistent Labels by using Auxiliary Big Task

Multi-task learning is to improve the performance of the model by transferring and exploiting common knowledge among tasks. Existing MTL works mainly focus on the scenario where label sets among multiple tasks (MTs) are usually the same,…

Machine Learning · Computer Science 2022-01-10 Quan Feng , Songcan Chen

Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction

Convolutional neural networks (CNN) have made significant advances in detecting roads from satellite images. However, existing CNN approaches are generally repurposed semantic segmentation architectures and suffer from the poor delineation…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Tinghuai Wang , Guangming Wang , Kuan Eeik Tan

Deep CNN-based Multi-task Learning for Open-Set Recognition

We propose a novel deep convolutional neural network (CNN) based multi-task learning approach for open-set visual recognition. We combine a classifier network and a decoder network with a shared feature extractor network within a multi-task…

Computer Vision and Pattern Recognition · Computer Science 2019-03-11 Poojan Oza , Vishal M. Patel

End-to-end Multi-Modal Multi-Task Vehicle Control for Self-Driving Cars with Visual Perception

Convolutional Neural Networks (CNN) have been successfully applied to autonomous driving tasks, many in an end-to-end manner. Previous end-to-end steering control methods take an image or an image sequence as the input and directly predict…

Computer Vision and Pattern Recognition · Computer Science 2018-02-06 Zhengyuan Yang , Yixuan Zhang , Jerry Yu , Junjie Cai , Jiebo Luo

Evolving CNN Architectures: From Custom Designs to Deep Residual Models for Diverse Image Classification and Detection Tasks

This paper presents a comparative study of a custom convolutional neural network (CNN) architecture against widely used pretrained and transfer learning CNN models across five real-world image datasets. The datasets span binary…

Computer Vision and Pattern Recognition · Computer Science 2026-01-06 Mahmudul Hasan , Mabsur Fatin Bin Hossain

Semantic-Aware Scene Recognition

Scene recognition is currently one of the top-challenging research fields in computer vision. This may be due to the ambiguity between classes: images of several scene classes may share similar objects, which causes confusion among them.…

Computer Vision and Pattern Recognition · Computer Science 2020-02-28 Alejandro López-Cifuentes , Marcos Escudero-Viñolo , Jesús Bescós , Álvaro García-Martín

Self-supervised Auxiliary Learning for Graph Neural Networks via Meta-Learning

In recent years, graph neural networks (GNNs) have been widely adopted in the representation learning of graph-structured data and provided state-of-the-art performance in various applications such as link prediction, node classification,…

Machine Learning · Computer Science 2021-04-14 Dasol Hwang , Jinyoung Park , Sunyoung Kwon , Kyung-Min Kim , Jung-Woo Ha , Hyunwoo J. Kim

Training a Custom CNN on Five Heterogeneous Image Datasets

Deep learning has transformed visual data analysis, with Convolutional Neural Networks (CNNs) becoming highly effective in learning meaningful feature representations directly from images. Unlike traditional manual feature engineering…

Computer Vision and Pattern Recognition · Computer Science 2026-01-09 Anika Tabassum , Tasnuva Mahazabin Tuba , Nafisa Naznin

Improving RNN Transducer Based ASR with Auxiliary Tasks

End-to-end automatic speech recognition (ASR) models with a single neural network have recently demonstrated state-of-the-art results compared to conventional hybrid speech recognizers. Specifically, recurrent neural network transducer…

Computation and Language · Computer Science 2020-11-10 Chunxi Liu , Frank Zhang , Duc Le , Suyoun Kim , Yatharth Saraf , Geoffrey Zweig