Related papers: A Dynamic Feature Interaction Framework for Multi-…

M3Net: Multimodal Multi-task Learning for 3D Detection, Segmentation, and Occupancy Prediction in Autonomous Driving

The perception system for autonomous driving generally requires to handle multiple diverse sub-tasks. However, current algorithms typically tackle individual sub-tasks separately, which leads to low efficiency when aiming at obtaining…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Xuesong Chen , Shaoshuai Shi , Tao Ma , Jingqiu Zhou , Simon See , Ka Chun Cheung , Hongsheng Li

Driving Scene Perception Network: Real-time Joint Detection, Depth Estimation and Semantic Segmentation

As the demand for enabling high-level autonomous driving has increased in recent years and visual perception is one of the critical features to enable fully autonomous driving, in this paper, we introduce an efficient approach for…

Computer Vision and Pattern Recognition · Computer Science 2018-03-13 Liangfu Chen , Zeng Yang , Jianjun Ma , Zheng Luo

Efficient RGB-D Scene Understanding via Multi-task Adaptive Learning and Cross-dimensional Feature Guidance

Scene understanding plays a critical role in enabling intelligence and autonomy in robotic systems. Traditional approaches often face challenges, including occlusions, ambiguous boundaries, and the inability to adapt attention based on…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Guodong Sun , Junjie Liu , Gaoyang Zhang , Bo Wu , Yang Zhang

Context-Aware Multi-Task Learning for Traffic Scene Recognition in Autonomous Vehicles

Traffic scene recognition, which requires various visual classification tasks, is a critical ingredient in autonomous vehicles. However, most existing approaches treat each relevant task independently from one another, never considering the…

Computer Vision and Pattern Recognition · Computer Science 2020-04-06 Younkwan Lee , Jihyo Jeon , Jongmin Yu , Moongu Jeon

3D Instance Segmentation via Multi-Task Metric Learning

We propose a novel method for instance label segmentation of dense 3D voxel grids. We target volumetric scene representations, which have been acquired with depth sensors or multi-view stereo methods and which have been processed with…

Computer Vision and Pattern Recognition · Computer Science 2019-11-04 Jean Lahoud , Bernard Ghanem , Marc Pollefeys , Martin R. Oswald

DenseMTL: Cross-task Attention Mechanism for Dense Multi-task Learning

Multi-task learning has recently emerged as a promising solution for a comprehensive understanding of complex scenes. In addition to being memory-efficient, multi-task models, when appropriately designed, can facilitate the exchange of…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Ivan Lopes , Tuan-Hung Vu , Raoul de Charette

Multi-view PointNet for 3D Scene Understanding

Fusion of 2D images and 3D point clouds is important because information from dense images can enhance sparse point clouds. However, fusion is challenging because 2D and 3D data live in different spaces. In this work, we propose MVPNet…

Computer Vision and Pattern Recognition · Computer Science 2019-10-01 Maximilian Jaritz , Jiayuan Gu , Hao Su

Multi-Task Consistency for Active Learning

Learning-based solutions for vision tasks require a large amount of labeled training data to ensure their performance and reliability. In single-task vision-based settings, inconsistency-based active learning has proven to be effective in…

Computer Vision and Pattern Recognition · Computer Science 2023-06-22 Aral Hekimoglu , Philipp Friedrich , Walter Zimmer , Michael Schmidt , Alvaro Marcos-Ramiro , Alois C. Knoll

A Multi-Task Deep Learning Approach for Sensor-based Human Activity Recognition and Segmentation

Sensor-based human activity segmentation and recognition are two important and challenging problems in many real-world applications and they have drawn increasing attention from the deep learning community in recent years. Most of the…

Computer Vision and Pattern Recognition · Computer Science 2023-03-21 Furong Duan , Tao Zhu , Jinqiang Wang , Liming Chen , Huansheng Ning , Yaping Wan

Attentive Feature Reuse for Multi Task Meta learning

We develop new algorithms for simultaneous learning of multiple tasks (e.g., image classification, depth estimation), and for adapting to unseen task/domain distributions within those high-level tasks (e.g., different environments). First,…

Machine Learning · Computer Science 2020-06-16 Kiran Lekkala , Laurent Itti

Multi-task learning from fixed-wing UAV images for 2D/3D city modeling

Single-task learning in artificial neural networks will be able to learn the model very well, and the benefits brought by transferring knowledge thus become limited. In this regard, when the number of tasks increases (e.g., semantic…

Computer Vision and Pattern Recognition · Computer Science 2021-09-03 Mohammad R. Bayanlou , Mehdi Khoshboresh-Masouleh

Visual Perception Engine: Fast and Flexible Multi-Head Inference for Robotic Vision Tasks

Deploying multiple machine learning models on resource-constrained robotic platforms for different perception tasks often results in redundant computations, large memory footprints, and complex integration challenges. In response, this work…

Robotics · Computer Science 2025-08-19 Jakub Łucki , Jonathan Becktor , Georgios Georgakis , Rob Royce , Shehryar Khattak

Multi-Task Learning for Fatigue Detection and Face Recognition of Drivers via Tree-Style Space-Channel Attention Fusion Network

In driving scenarios, automobile active safety systems are increasingly incorporating deep learning technology. These systems typically need to handle multiple tasks simultaneously, such as detecting fatigue driving and recognizing the…

Computer Vision and Pattern Recognition · Computer Science 2024-05-14 Shulei Qu , Zhenguo Gao , Xiaowei Chen , Na Li , Yakai Wang , Xiaoxiao Wu

UniNet: A Unified Scene Understanding Network and Exploring Multi-Task Relationships through the Lens of Adversarial Attacks

Scene understanding is crucial for autonomous systems which intend to operate in the real world. Single task vision networks extract information only based on some aspects of the scene. In multi-task learning (MTL), on the other hand, these…

Computer Vision and Pattern Recognition · Computer Science 2022-08-15 Naresh Kumar Gurulingan , Elahe Arani , Bahram Zonooz

HENet++: Hybrid Encoding and Multi-task Learning for 3D Perception and End-to-end Autonomous Driving

Three-dimensional feature extraction is a critical component of autonomous driving systems, where perception tasks such as 3D object detection, bird's-eye-view (BEV) semantic segmentation, and occupancy prediction serve as important…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Zhongyu Xia , Zhiwei Lin , Yongtao Wang , Ming-Hsuan Yang

SemanticFlow: A Self-Supervised Framework for Joint Scene Flow Prediction and Instance Segmentation in Dynamic Environments

Accurate perception of dynamic traffic scenes is crucial for high-level autonomous driving systems, requiring robust object motion estimation and instance segmentation. However, traditional methods often treat them as separate tasks,…

Computer Vision and Pattern Recognition · Computer Science 2025-03-20 Yinqi Chen , Meiying Zhang , Qi Hao , Guang Zhou

LidarMultiNet: Towards a Unified Multi-Task Network for LiDAR Perception

LiDAR-based 3D object detection, semantic segmentation, and panoptic segmentation are usually implemented in specialized networks with distinctive architectures that are difficult to adapt to each other. This paper presents LidarMultiNet, a…

Computer Vision and Pattern Recognition · Computer Science 2023-03-23 Dongqiangzi Ye , Zixiang Zhou , Weijia Chen , Yufei Xie , Yu Wang , Panqu Wang , Hassan Foroosh

Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth Estimation

This report serves as a supplementary document for TaskPrompter, detailing its implementation on a new joint 2D-3D multi-task learning benchmark based on Cityscapes-3D. TaskPrompter presents an innovative multi-task prompting framework that…

Computer Vision and Pattern Recognition · Computer Science 2023-04-07 Hanrong Ye , Dan Xu

JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds with Multi-Task Pointwise Networks and Multi-Value Conditional Random Fields

Deep learning techniques have become the to-go models for most vision-related tasks on 2D images. However, their power has not been fully realised on several tasks in 3D space, e.g., 3D scene understanding. In this work, we jointly address…

Computer Vision and Pattern Recognition · Computer Science 2019-04-08 Quang-Hieu Pham , Duc Thanh Nguyen , Binh-Son Hua , Gemma Roig , Sai-Kit Yeung

BridgeNet: Comprehensive and Effective Feature Interactions via Bridge Feature for Multi-task Dense Predictions

Multi-task dense prediction aims at handling multiple pixel-wise prediction tasks within a unified network simultaneously for visual scene understanding. However, cross-task feature interactions of current methods are still suffering from…

Computer Vision and Pattern Recognition · Computer Science 2024-11-26 Jingdong Zhang , Jiayuan Fan , Peng Ye , Bo Zhang , Hancheng Ye , Baopu Li , Yancheng Cai , Tao Chen