English

Multimodal Affect Recognition using Kinect

Human-Computer Interaction 2016-07-12 v1 Computer Vision and Pattern Recognition

Abstract

Affect (emotion) recognition has gained significant attention from researchers in the past decade. Emotion-aware computer systems and devices have many applications ranging from interactive robots, intelligent online tutor to emotion based navigation assistant. In this research data from multiple modalities such as face, head, hand, body and speech was utilized for affect recognition. The research used color and depth sensing device such as Kinect for facial feature extraction and tracking human body joints. Temporal features across multiple frames were used for affect recognition. Event driven decision level fusion was used to combine the results from each individual modality using majority voting to recognize the emotions. The study also implemented affect recognition by matching the features to the rule based emotion templates per modality. Experiments showed that multimodal affect recognition rates using combination of emotion templates and supervised learning were better compared to recognition rates based on supervised learning alone. Recognition rates obtained using temporal feature were higher compared to recognition rates obtained using position based features only.

Keywords

Cite

@article{arxiv.1607.02652,
  title  = {Multimodal Affect Recognition using Kinect},
  author = {Amol Patwardhan and Gerald Knapp},
  journal= {arXiv preprint arXiv:1607.02652},
  year   = {2016}
}

Comments

9 pages, 2 tables, 1 figure, Peer reviewed in ACM TIST

R2 v1 2026-06-22T14:50:04.939Z