Depression is a serious psychiatric disorder that affects mood, thoughts, and the ability to function in everyday life. This paper investigates the characteristics of depressed speech for the purpose of automatic classification by analysing the effect of different speech features on the classification results. We analysed voiced, unvoiced and mixed speech in order to gain a better understanding of depressed speech and to bridge the gap between physiological and affective computing studies. This understanding may ultimately lead to an objective affective sensing system that supports clinicians in their diagnosis and monitoring of clinical depression. The characteristics of depressed speech were statistically analysed using ANOVA and linked to their classification results using GMM and SVM. Features were extracted and classified over speech utterances of 30 clinically depressed patients against 30 controls (both gender-matched) in a speaker-independent manner. Most feature classification results were consistent with their statistical characteristics, providing a link between physiological and affective computing studies. The classification results from low-level features were slightly better than the statistical functional features, which indicates a loss of information in the latter. We found that both mixed and unvoiced speech were as useful in detecting depression as voiced speech, if not better.
|Title of host publication||14th Annual Conference of the International Speech Communication Association Interspeech 2013|
|Editors||Frederic Bimbot, Cecile Fougeron, Francois Pellegrino|
|Place of Publication||Lyon, France|
|Publisher||International Speech Communication Association|
|Number of pages||5|
|Publication status||Published - 2013|
|Event||14th Annual Conference of the International Speech Communication Association Interspeech 2013 - Lyon, Lyon, France|
Duration: 25 Aug 2013 → 29 Aug 2013
|Conference||14th Annual Conference of the International Speech Communication Association Interspeech 2013|
|Abbreviated title||INTERSPEECH 2013|
|Period||25/08/13 → 29/08/13|
Alghowinem, S., GOECKE, R., WAGNER, M., Epps, J., Parker, G., & Breakspear, M. (2013). Characterising Depressed Speech for Classification. In F. Bimbot, C. Fougeron, & F. Pellegrino (Eds.), 14th Annual Conference of the International Speech Communication Association Interspeech 2013 (pp. 2534-2538). Lyon, France: International Speech Communication Association.