Abstract
Quantifying behavioural changes in depression using affective computing techniques is the first step in developing an objective diagnostic aid, with clinical utility, for clinical depression. As part of the AVEC 2013 Challenge, we present a multimodal approach for the Depression Sub-Challenge using a GMM-UBM system with three different kernels for the audio subsystem and Space Time Interest Points in a Bag-of-Words approach for the vision subsystem. These are then fused at the feature level to form the combined AV system. Key results include the strong performance of acoustic audio features and the bag-of-words visual features in predicting an individual’s level of depression using regression. Interestingly, in the context of the small amount of literature on the subject, is that our feature level multimodal fusion technique is able to outperform both the audio and visual challenge baselines.
Original language | English |
---|---|
Title of host publication | AVEC 2013 - Proceedings of the 3rd ACM International Workshop on Audio/Visual Emotion Challenge |
Editors | Bjorn Schuller, Michel Valstar, Roddy Cowie, Maja Pantic, Jarek Krajewski |
Place of Publication | Barcelona, Spain |
Publisher | Association for Computing Machinery (ACM) |
Pages | 11-20 |
Number of pages | 10 |
ISBN (Print) | 9781450323956 |
DOIs | |
Publication status | Published - 2013 |
Event | ACM International Workshop on Audio/Visual Emotion Challenge - Barcelona, Barcelona, Spain Duration: 21 Oct 2013 → 25 Oct 2013 |
Conference
Conference | ACM International Workshop on Audio/Visual Emotion Challenge |
---|---|
Country/Territory | Spain |
City | Barcelona |
Period | 21/10/13 → 25/10/13 |