English
Related papers

Related papers: Recognizing More Emotions with Less Data Using Sel…

200 papers

Emotion recognition datasets are relatively small, making the use of the more sophisticated deep learning approaches challenging. In this work, we propose a transfer learning method for speech emotion recognition where features extracted…

Sound · Computer Science 2021-04-09 Leonardo Pepino , Pablo Riera , Luciana Ferrer

This paper presents a transfer learning method in speech emotion recognition based on a Time-Delay Neural Network (TDNN) architecture. A major challenge in the current speech-based emotion detection research is data scarcity. The proposed…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-18 Sitong Zhou , Homayoon Beigi

Automatic emotion recognition plays a key role in computer-human interaction as it has the potential to enrich the next-generation artificial intelligence with emotional intelligence. It finds applications in customer and/or representative…

Sound · Computer Science 2022-02-21 Sarala Padi , Seyed Omid Sadjadi , Dinesh Manocha , Ram D. Sriram

Deep learning has been widely adopted in automatic emotion recognition and has lead to significant progress in the field. However, due to insufficient annotated emotion datasets, pre-trained models are limited in their generalization…

Computer Vision and Pattern Recognition · Computer Science 2020-06-25 Dung Nguyen , Sridha Sridharan , Duc Thanh Nguyen , Simon Denman , David Dean , Clinton Fookes

Automatic speech emotion recognition (SER) is a challenging task that plays a crucial role in natural human-computer interaction. One of the main challenges in SER is data scarcity, i.e., insufficient amounts of carefully labeled data to…

Sound · Computer Science 2021-08-17 Sarala Padi , Seyed Omid Sadjadi , Dinesh Manocha , Ram D. Sriram

Speech emotion recognition is a challenging task for three main reasons: 1) human emotion is abstract, which means it is hard to distinguish; 2) in general, human emotion can only be detected in some specific moments during a long…

Sound · Computer Science 2019-05-03 Yuanyuan Zhang , Jun Du , Zirui Wang , Jianshu Zhang

Speech emotion recognition is a challenging task and an important step towards more natural human-machine interaction. We show that pre-trained language models can be fine-tuned for text emotion recognition, achieving an accuracy of 69.5%…

Audio and Speech Processing · Electrical Eng. & Systems 2019-12-06 Verena Heusser , Niklas Freymuth , Stefan Constantin , Alex Waibel

Speech emotion recognition~(SER) refers to the technique of inferring the emotional state of an individual from speech signals. SERs continue to garner interest due to their wide applicability. Although the domain is mainly founded on…

Audio and Speech Processing · Electrical Eng. & Systems 2022-03-29 Sneha Das , Nicklas Leander Lund , Nicole Nadine Lønfeldt , Anne Katrine Pagsberg , Line H. Clemmensen

Emotion recognition is a topic of significant interest in assistive robotics due to the need to equip robots with the ability to comprehend human behavior, facilitating their effective interaction in our society. Consequently, efficient and…

Human-Computer Interaction · Computer Science 2023-12-05 Rutherford Agbeshi Patamia , Paulo E. Santos , Kingsley Nketia Acheampong , Favour Ekong , Kwabena Sarpong , She Kun

We propose emotion2vec, a universal speech emotion representation model. emotion2vec is pre-trained on open-source unlabeled emotion data through self-supervised online distillation, combining utterance-level loss and frame-level loss…

Computation and Language · Computer Science 2023-12-27 Ziyang Ma , Zhisheng Zheng , Jiaxin Ye , Jinchao Li , Zhifu Gao , Shiliang Zhang , Xie Chen

Best-performing speech models are trained on large amounts of data in the language they are meant to work for. However, most languages have sparse data, making training models challenging. This shortage of data is even more prevalent in…

Computation and Language · Computer Science 2024-10-08 David-Gabriel Ion , Răzvan-Alexandru Smădu , Dumitru-Clementin Cercel , Florin Pop , Mihaela-Claudia Cercel

Automatic emotion recognition is one of the central concerns of the Human-Computer Interaction field as it can bridge the gap between humans and machines. Current works train deep learning models on low-level data representations to solve…

Audio and Speech Processing · Electrical Eng. & Systems 2021-11-22 Mariana Rodrigues Makiuchi , Kuniaki Uto , Koichi Shinoda

The use of deep learning techniques for automatic facial expression recognition has recently attracted great interest but developed models are still unable to generalize well due to the lack of large emotion datasets for deep learning. To…

Computer Vision and Pattern Recognition · Computer Science 2018-05-28 Dung Nguyen , Kien Nguyen , Sridha Sridharan , Iman Abbasnejad , David Dean , Clinton Fookes

Speech Emotion Recognition (SER) presents a significant yet persistent challenge in human-computer interaction. While deep learning has advanced spoken language processing, achieving high performance on limited datasets remains a critical…

Audio and Speech Processing · Electrical Eng. & Systems 2025-09-03 Tai Vu

Recently, self-supervised pre-training has shown significant improvements in many areas of machine learning, including speech and NLP. We propose using large self-supervised pre-trained models for both audio and text modality with…

Audio and Speech Processing · Electrical Eng. & Systems 2021-08-24 Krishna D N

Speech is the most natural way of expressing ourselves as humans. Identifying emotion from speech is a nontrivial task due to the ambiguous definition of emotion itself. Speaker Emotion Recognition (SER) is essential for understanding human…

Sound · Computer Science 2024-11-07 Pourya Jafarzadeh , Amir Mohammad Rostami , Padideh Choobdar

This paper introduces Meta-PerSER, a novel meta-learning framework that personalizes Speech Emotion Recognition (SER) by adapting to each listener's unique way of interpreting emotion. Conventional SER systems rely on aggregated…

Audio and Speech Processing · Electrical Eng. & Systems 2025-05-23 Liang-Yeh Shen , Shi-Xin Fang , Yi-Cheng Lin , Huang-Cheng Chou , Hung-yi Lee

Acoustic emotion recognition aims to categorize the affective state of the speaker and is still a difficult task for machine learning models. The difficulties come from the scarcity of training data, general subjectivity in emotion…

Computation and Language · Computer Science 2018-04-02 Egor Lakomkin , Cornelius Weber , Sven Magg , Stefan Wermter

Emotion recognition is a challenging task due to limited availability of in-the-wild labeled datasets. Self-supervised learning has shown improvements on tasks with limited labeled datasets in domains like speech and natural language.…

Computation and Language · Computer Science 2021-04-08 Aparna Khare , Srinivas Parthasarathy , Shiva Sundaram

Speech Emotion Recognition (SER) plays a pivotal role in enhancing human-computer interaction by enabling a deeper understanding of emotional states across a wide range of applications, contributing to more empathetic and effective…

Audio and Speech Processing · Electrical Eng. & Systems 2023-09-25 Amirali Soltani Tehrani , Niloufar Faridani , Ramin Toosi
‹ Prev 1 2 3 10 Next ›