English
Related papers

Related papers: Multimodal Continuous Emotion Recognition using De…

200 papers

The choice of a loss function is a critical part of machine learning. This paper evaluated two different loss functions commonly used in regression-task dimensional speech emotion recognition, an error-based and a correlation-based loss…

Audio and Speech Processing · Electrical Eng. & Systems 2022-07-22 Bagus Tris Atmaja , Masato Akagi

One of the challenges in Speech Emotion Recognition (SER) "in the wild" is the large mismatch between training and test data (e.g. speakers and tasks). In order to improve the generalisation capabilities of the emotion models, we propose to…

Computation and Language · Computer Science 2017-08-15 Jaebok Kim , Gwenn Englebienne , Khiet P. Truong , Vanessa Evers

In this study, we revisit key training strategies in machine learning often overlooked in favor of deeper architectures. Specifically, we explore balancing strategies, activation functions, and fine-tuning techniques to enhance speech…

Audio and Speech Processing · Electrical Eng. & Systems 2025-09-26 Jing-Tong Tzeng , Bo-Hao Su , Ya-Tse Wu , Hsing-Hang Chou , Chi-Chun Lee

This study investigates fine-tuning self-supervised learn ing (SSL) models using multi-task learning (MTL) to enhance speech emotion recognition (SER). The framework simultane ously handles four related tasks: emotion recognition, gender…

Sound · Computer Science 2025-08-26 Honghong Wang , Jing Deng , Fanqin Meng , Rong Zheng

Surgical tool presence detection and surgical phase recognition are two fundamental yet challenging tasks in surgical video analysis and also very essential components in various applications in modern operating rooms. While these two…

Computer Vision and Pattern Recognition · Computer Science 2019-07-16 Yueming Jin , Huaxia Li , Qi Dou , Hao Chen , Jing Qin , Chi-Wing Fu , Pheng-Ann Heng

Decades of research indicate that emotion recognition is more effective when drawing information from multiple modalities. But what if some modalities are sometimes missing? To address this problem, we propose a novel Transformer-based…

Machine Learning · Computer Science 2023-11-20 Juan Vazquez-Rodriguez , Grégoire Lefebvre , Julien Cumin , James L. Crowley

The quantification of emotional states is an important step to understanding wellbeing. Time series data from multiple modalities such as physiological and motion sensor data have proven to be integral for measuring and quantifying…

Computer Vision and Pattern Recognition · Computer Science 2020-12-08 Kieran Woodward , Eiman Kanjo , Athanasios Tsanas

This paper presents our system for the Multi-Task Learning (MTL) Challenge in the 4th Affective Behavior Analysis in-the-wild (ABAW) competition. We explore the research problems of this challenge from three aspects: 1) For obtaining…

Computer Vision and Pattern Recognition · Computer Science 2022-08-31 Tenggan Zhang , Chuanhe Liu , Xiaolong Liu , Yuchen Liu , Liyu Meng , Lei Sun , Wenqiang Jiang , Fengyuan Zhang , Jinming Zhao , Qin Jin

Multimodal emotion recognition plays a key role in many domains, including mental health monitoring, educational interaction, and human-computer interaction. However, existing methods often face three major challenges: unbalanced category…

Computer Vision and Pattern Recognition · Computer Science 2025-11-17 Feng Li , Ke Wu , Yongwei Li

Speech emotion recognition (SER) systems find applications in various fields such as healthcare, education, and security and defense. A major drawback of these systems is their lack of generalization across different conditions. This…

Audio and Speech Processing · Electrical Eng. & Systems 2023-05-15 Srinivas Parthasarathy , Carlos Busso

Automatic facial expression recognition is an important research area in the emotion recognition and computer vision. Applications can be found in several domains such as medical treatment, driver fatigue surveillance, sociable robotics,…

Computer Vision and Pattern Recognition · Computer Science 2020-02-03 Sevegni Odilon Clement Allognon , Alessandro L. Koerich , Alceu de S. Britto

Due to its ability to accurately predict emotional state using multimodal features, audiovisual emotion recognition has recently gained more interest from researchers. This paper proposes two methods to predict emotional attributes from…

Audio and Speech Processing · Electrical Eng. & Systems 2022-07-22 Bagus Tris Atmaja , Masato Akagi

Emotional states manifest as coordinated yet heterogeneous physiological responses across central and autonomic systems, posing a fundamental challenge for multimodal representation learning in affective computing. Learning such joint…

Machine Learning · Computer Science 2026-05-26 Deyang Zheng , Tianyi Zhang , Wenming Zheng , Shujian Yu

Multi-Task Learning (MTL) is a framework, where multiple related tasks are learned jointly and benefit from a shared representation space, or parameter transfer. To provide sufficient learning support, modern MTL uses annotated data with…

Computer Vision and Pattern Recognition · Computer Science 2024-01-04 Dimitrios Kollias , Viktoriia Sharmanska , Stefanos Zafeiriou

Emotion recognition is a critical component of affective computing. Training accurate machine learning models for emotion recognition typically requires a large amount of labeled data. Due to the subtleness and complexity of emotions,…

Machine Learning · Computer Science 2024-12-03 Yifan Xu , Xue Jiang , Dongrui Wu

Automated emotion recognition in the wild from facial images remains a challenging problem. Although recent advances in Deep Learning have supposed a significant breakthrough in this topic, strong changes in pose, orientation and point of…

Computer Vision and Pattern Recognition · Computer Science 2018-02-20 Gerard Pons , David Masip

Multi-Task Learning (MTL) involves the concurrent training of multiple tasks, offering notable advantages for dense prediction tasks in computer vision. MTL not only reduces training and inference time as opposed to having multiple…

Computer Vision and Pattern Recognition · Computer Science 2024-12-05 Maxime Fontana , Michael Spratling , Miaojing Shi

Although large language models (LLMs) perform well in general tasks, domain-specific applications suffer from hallucinations and accuracy limitations. Continual Pre-Training (CPT) approaches encounter two key issues: (1) domain-biased data…

Computation and Language · Computer Science 2025-05-21 Jingxue Chen , Qingkun Tang , Qianchun Lu , Siyuan Fang

Automatic affect recognition is a challenging task due to the various modalities emotions can be expressed with. Applications can be found in many domains including multimedia retrieval and human computer interaction. In recent years, deep…

Computer Vision and Pattern Recognition · Computer Science 2018-02-14 Panagiotis Tzirakis , George Trigeorgis , Mihalis A. Nicolaou , Björn Schuller , Stefanos Zafeiriou

Multi-task learning (MTL) enables the efficient transfer of extra knowledge acquired from other tasks. The high correlation between multimodal sentiment analysis (MSA) and multimodal emotion recognition (MER) supports their joint training.…

Artificial Intelligence · Computer Science 2025-05-21 Shuo Zhang , Jinsong Zhang , Zhejun Zhang , Lei Li
‹ Prev 1 2 3 10 Next ›