Related papers: Multimodal Continuous Emotion Recognition using De…

Evaluation of Error and Correlation-Based Loss Functions For Multitask Learning Dimensional Speech Emotion Recognition

The choice of a loss function is a critical part of machine learning. This paper evaluated two different loss functions commonly used in regression-task dimensional speech emotion recognition, an error-based and a correlation-based loss…

Audio and Speech Processing · Electrical Eng. & Systems 2022-07-22 Bagus Tris Atmaja , Masato Akagi

Towards Speech Emotion Recognition "in the wild" using Aggregated Corpora and Deep Multi-Task Learning

One of the challenges in Speech Emotion Recognition (SER) "in the wild" is the large mismatch between training and test data (e.g. speakers and tasks). In order to improve the generalisation capabilities of the emotion models, we propose to…

Computation and Language · Computer Science 2017-08-15 Jaebok Kim , Gwenn Englebienne , Khiet P. Truong , Vanessa Evers

Lessons Learnt: Revisit Key Training Strategies for Effective Speech Emotion Recognition in the Wild

In this study, we revisit key training strategies in machine learning often overlooked in favor of deeper architectures. Specifically, we explore balancing strategies, activation functions, and fine-tuning techniques to enhance speech…

Audio and Speech Processing · Electrical Eng. & Systems 2025-09-26 Jing-Tong Tzeng , Bo-Hao Su , Ya-Tse Wu , Hsing-Hang Chou , Chi-Chun Lee

Enhancing Speech Emotion Recognition with Multi-Task Learning and Dynamic Feature Fusion

This study investigates fine-tuning self-supervised learn ing (SSL) models using multi-task learning (MTL) to enhance speech emotion recognition (SER). The framework simultane ously handles four related tasks: emotion recognition, gender…

Sound · Computer Science 2025-08-26 Honghong Wang , Jing Deng , Fanqin Meng , Rong Zheng

Multi-Task Recurrent Convolutional Network with Correlation Loss for Surgical Video Analysis

Surgical tool presence detection and surgical phase recognition are two fundamental yet challenging tasks in surgical video analysis and also very essential components in various applications in modern operating rooms. While these two…

Computer Vision and Pattern Recognition · Computer Science 2019-07-16 Yueming Jin , Huaxia Li , Qi Dou , Hao Chen , Jing Qin , Chi-Wing Fu , Pheng-Ann Heng

Accommodating Missing Modalities in Time-Continuous Multimodal Emotion Recognition

Decades of research indicate that emotion recognition is more effective when drawing information from multiple modalities. But what if some modalities are sometimes missing? To address this problem, we propose a novel Transformer-based…

Machine Learning · Computer Science 2023-11-20 Juan Vazquez-Rodriguez , Grégoire Lefebvre , Julien Cumin , James L. Crowley

Combining Deep Transfer Learning with Signal-image Encoding for Multi-Modal Mental Wellbeing Classification

The quantification of emotional states is an important step to understanding wellbeing. Time series data from multiple modalities such as physiological and motion sensor data have proven to be integral for measuring and quantifying…

Computer Vision and Pattern Recognition · Computer Science 2020-12-08 Kieran Woodward , Eiman Kanjo , Athanasios Tsanas

Multi-Task Learning Framework for Emotion Recognition in-the-wild

This paper presents our system for the Multi-Task Learning (MTL) Challenge in the 4th Affective Behavior Analysis in-the-wild (ABAW) competition. We explore the research problems of this challenge from three aspects: 1) For obtaining…

Computer Vision and Pattern Recognition · Computer Science 2022-08-31 Tenggan Zhang , Chuanhe Liu , Xiaolong Liu , Yuchen Liu , Liyu Meng , Lei Sun , Wenqiang Jiang , Fengyuan Zhang , Jinming Zhao , Qin Jin

MCN-CL: Multimodal Cross-Attention Network and Contrastive Learning for Multimodal Emotion Recognition

Multimodal emotion recognition plays a key role in many domains, including mental health monitoring, educational interaction, and human-computer interaction. However, existing methods often face three major challenges: unbalanced category…

Computer Vision and Pattern Recognition · Computer Science 2025-11-17 Feng Li , Ke Wu , Yongwei Li

Semi-Supervised Speech Emotion Recognition with Ladder Networks

Speech emotion recognition (SER) systems find applications in various fields such as healthcare, education, and security and defense. A major drawback of these systems is their lack of generalization across different conditions. This…

Audio and Speech Processing · Electrical Eng. & Systems 2023-05-15 Srinivas Parthasarathy , Carlos Busso

Continuous Emotion Recognition via Deep Convolutional Autoencoder and Support Vector Regressor

Automatic facial expression recognition is an important research area in the emotion recognition and computer vision. Applications can be found in several domains such as medical treatment, driver fatigue surveillance, sociable robotics,…

Computer Vision and Pattern Recognition · Computer Science 2020-02-03 Sevegni Odilon Clement Allognon , Alessandro L. Koerich , Alceu de S. Britto

Multitask Learning and Multistage Fusion for Dimensional Audiovisual Emotion Recognition

Due to its ability to accurately predict emotional state using multimodal features, audiovisual emotion recognition has recently gained more interest from researchers. This paper proposes two methods to predict emotional attributes from…

Audio and Speech Processing · Electrical Eng. & Systems 2022-07-22 Bagus Tris Atmaja , Masato Akagi

Multimodal Functional Maximum Correlation for Emotion Recognition

Emotional states manifest as coordinated yet heterogeneous physiological responses across central and autonomic systems, posing a fundamental challenge for multimodal representation learning in affective computing. Learning such joint…

Machine Learning · Computer Science 2026-05-26 Deyang Zheng , Tianyi Zhang , Wenming Zheng , Shujian Yu

Distribution Matching for Multi-Task Learning of Classification Tasks: a Large-Scale Study on Faces & Beyond

Multi-Task Learning (MTL) is a framework, where multiple related tasks are learned jointly and benefit from a shared representation space, or parameter transfer. To provide sufficient learning support, modern MTL uses annotated data with…

Computer Vision and Pattern Recognition · Computer Science 2024-01-04 Dimitrios Kollias , Viktoriia Sharmanska , Stefanos Zafeiriou

Cross-Task Inconsistency Based Active Learning (CTIAL) for Emotion Recognition

Emotion recognition is a critical component of affective computing. Training accurate machine learning models for emotion recognition typically requires a large amount of labeled data. Due to the subtleness and complexity of emotions,…

Machine Learning · Computer Science 2024-12-03 Yifan Xu , Xue Jiang , Dongrui Wu

Multi-task, multi-label and multi-domain learning with residual convolutional networks for emotion recognition

Automated emotion recognition in the wild from facial images remains a challenging problem. Although recent advances in Deep Learning have supposed a significant breakthrough in this topic, strong changes in pose, orientation and point of…

Computer Vision and Pattern Recognition · Computer Science 2018-02-20 Gerard Pons , David Masip

Optimizing Dense Visual Predictions Through Multi-Task Coherence and Prioritization

Multi-Task Learning (MTL) involves the concurrent training of multiple tasks, offering notable advantages for dense prediction tasks in computer vision. MTL not only reduces training and inference time as opposed to having multiple…

Computer Vision and Pattern Recognition · Computer Science 2024-12-05 Maxime Fontana , Michael Spratling , Miaojing Shi

MoL for LLMs: Dual-Loss Optimization to Enhance Domain Expertise While Preserving General Capabilities

Although large language models (LLMs) perform well in general tasks, domain-specific applications suffer from hallucinations and accuracy limitations. Continual Pre-Training (CPT) approaches encounter two key issues: (1) domain-biased data…

Computation and Language · Computer Science 2025-05-21 Jingxue Chen , Qingkun Tang , Qianchun Lu , Siyuan Fang

End-to-End Multimodal Emotion Recognition using Deep Neural Networks

Automatic affect recognition is a challenging task due to the various modalities emotions can be expressed with. Applications can be found in many domains including multimedia retrieval and human computer interaction. In recent years, deep…

Computer Vision and Pattern Recognition · Computer Science 2018-02-14 Panagiotis Tzirakis , George Trigeorgis , Mihalis A. Nicolaou , Björn Schuller , Stefanos Zafeiriou

Multimodal Mixture of Low-Rank Experts for Sentiment Analysis and Emotion Recognition

Multi-task learning (MTL) enables the efficient transfer of extra knowledge acquired from other tasks. The high correlation between multimodal sentiment analysis (MSA) and multimodal emotion recognition (MER) supports their joint training.…

Artificial Intelligence · Computer Science 2025-05-21 Shuo Zhang , Jinsong Zhang , Zhejun Zhang , Lei Li