Abstract
Depression is a mood disorder that has serious consequences for both individuals and society. The current diagnosis of depression relies on questionnaires and clinical interviews, which are complex and subjective, as they depend on multiple factors such as the patient’s comorbidities, cognitive ability, honesty in describing symptoms, and the experience and motivation of the clinician. An automated way of estimating depression severity objectively would, therefore, be of great assistance to clinicians. Over the last decade, various affective computing systems have been proposed that use machine learning, speech analysis, and computer vision techniques to extract unimodal or multimodal cues and estimate the severity of depression. Temporal information plays a crucial role in learning spatiotemporal patterns in depression data. When inferring depression severity from face videos, analysing long sequences at different temporal scales is potentially more informative than analysing short ones. Therefore, a new approach based on long sequence structured global convolution and various temporal scales on diverse kernels in an end-to-end ensemble model has been developed and evaluated for estimating depression severity from face video data. The application of this long-range dependency technique is novel in automated depression analysis. The proposed ensemble model explores the role of temporal scales in assessing depression severity from facial movement information and outperforms common deep learning models and single structured global convolution models. The results confirm that analysing facial movements at different temporal scales is an important component towards effective diagnostic aids in
depression analysis.
depression analysis.
Original language | English |
---|---|
Title of host publication | 2023 International Conference on Digital Image Computing: Techniques and Applications (DICTA 2023) |
Subtitle of host publication | Techniques and Applications, DICTA 2023 |
Editors | Anwaar UIhaq, Phillip Torr, Manaranjan Paul, Toby Walsh, Subrata Chakraborty, Shams Islam, Imran Razzak |
Place of Publication | Los Alamitos (CA), USA |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 296-303 |
Number of pages | 8 |
ISBN (Electronic) | 9798350382204 |
ISBN (Print) | 9798350382211 |
DOIs | |
Publication status | Published - 29 Jan 2024 |
Event | DICTA 2023 - Australia, Port Macquarie, Australia Duration: 28 Nov 2023 → 1 Dec 2023 https://www.dictaconference.org/ |
Publication series
Name | 2023 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2023 |
---|
Conference
Conference | DICTA 2023 |
---|---|
Country/Territory | Australia |
City | Port Macquarie |
Period | 28/11/23 → 1/12/23 |
Internet address |