Abstract
Quantifying how the spectral content of speech relates to changes in mental state may be crucial in building an objective speech-based depression classification system with clinical utility. This paper investigates the hypothesis that important depression based information can be captured within the covariance structure of a Gaussian Mixture Model (GMM) of recorded speech. Significant negative correlations found between a speaker’s average weighted variance - a GMM-based indicator of speaker variability - and their level of depression support this hypothesis. Further evidence is provided by the comparison of classification accuracies from seven different GMM-UBM systems, each formed by varying different parameter combinations during MAP adaption. This analysis shows that variance-only adaptation either outperforms or matches the de facto standard mean-only adaptation when classifying both the presence and severity of depression. This result is perhaps the first of its kind seen in GMM-UBM speech classification.
Original language | English |
---|---|
Title of host publication | 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013): Speech in Life Sciences and Human Societies |
Editors | Frederic Bimbot, Cecile Fougeron, Francois Pellegrino |
Place of Publication | Lyon, France |
Publisher | International Speech Communication Association |
Pages | 857-861 |
Number of pages | 5 |
Volume | 2 |
ISBN (Print) | 9781629934433 |
Publication status | Published - 2013 |
Event | 14th Annual Conference of the International Speech Communication Association Interspeech 2013 - Lyon, Lyon, France Duration: 25 Aug 2013 → 29 Aug 2013 |
Conference
Conference | 14th Annual Conference of the International Speech Communication Association Interspeech 2013 |
---|---|
Abbreviated title | INTERSPEECH 2013 |
Country/Territory | France |
City | Lyon |
Period | 25/08/13 → 29/08/13 |