Topic Modeling Based Multi-modal Depression Detection

Yuan Gong; Christian Poellabauer

doi:10.1145/3133944.3133945

Topic Modeling Based Multi-modal Depression Detection

Computation and Language 2018-03-29 v1 Information Retrieval Machine Learning Sound Audio and Speech Processing

Authors: Yuan Gong , Christian Poellabauer

View on arXiv ↗ PDF ↗ DOI ↗

Abstract

Major depressive disorder is a common mental disorder that affects almost 7% of the adult U.S. population. The 2017 Audio/Visual Emotion Challenge (AVEC) asks participants to build a model to predict depression levels based on the audio, video, and text of an interview ranging between 7-33 minutes. Since averaging features over the entire interview will lose most temporal information, how to discover, capture, and preserve useful temporal details for such a long interview are significant challenges. Therefore, we propose a novel topic modeling based approach to perform context-aware analysis of the recording. Our experiments show that the proposed approach outperforms context-unaware methods and the challenge baselines for all metrics.

Topic Modeling Based Multi-modal Depression Detection

Abstract

Keywords

Cite

Comments

Related papers