A comparative study of different classifiers for detecting depression from spontaneous speech

Sharifa Alghowinem, Roland Goecke, Michael Wagner, Julien Epps, Tom Gedeon, Michael Breakspear, Gordon Parker

Research output: A Conference proceeding or a Chapter in BookConference contribution

33 Citations (Scopus)

Abstract

Accurate detection of depression from spontaneous speech could lead to an objective diagnostic aid to assist clinicians to better diagnose depression. Little thought has been given so far to which classifier performs best for this task. In this study, using a 60-subject real-world clinically validated dataset, we compare three popular classifiers from the affective computing literature – Gaussian Mixture Models (GMM), Support Vector Machines (SVM) and Multilayer Perceptron neural networks (MLP) – as well as the recently proposed Hierarchical Fuzzy Signature (HFS) classifier. Among these, a hybrid classifier using GMM models and SVM gave the best overall classification results. Comparing feature, score, and decision fusion, score fusion performed better for GMM, HFS and MLP, while decision fusion worked best for SVM (both for raw data and GMM models). Feature fusion performed worse than other fusion methods in this study. We found that loudness, root mean square, and intensity were the voice features that performed best to detect depression in this dataset.
Original languageEnglish
Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
EditorsRabab Ward, Li Deng
Place of PublicationVancouver, British Columbia, Canada
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages8022-8026
Number of pages5
ISBN (Electronic)9781479903566
DOIs
Publication statusPublished - 2013
Event38th International Conference on Acoustics, Speech and Signal Processing ICASSP2013 - Vancouver, Vancouver, Canada
Duration: 26 May 201331 May 2013

Conference

Conference38th International Conference on Acoustics, Speech and Signal Processing ICASSP2013
Abbreviated titleICASSP2013
CountryCanada
CityVancouver
Period26/05/1331/05/13
OtherThe ICASSP meeting is the world's largest and most comprehensive technical conference focused on signal processing and its applications. The conference will feature world-class speakers, tutorials, exhibits, and over 50 lecture and poster sessions

    Fingerprint

Cite this

Alghowinem, S., Goecke, R., Wagner, M., Epps, J., Gedeon, T., Breakspear, M., & Parker, G. (2013). A comparative study of different classifiers for detecting depression from spontaneous speech. In R. Ward, & L. Deng (Eds.), ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp. 8022-8026). Vancouver, British Columbia, Canada: IEEE, Institute of Electrical and Electronics Engineers. https://doi.org/10.1109/ICASSP.2013.6639227