TY - GEN
T1 - Acute Pain Recognition from Facial Expression Videos using Vision Transformers
AU - Bargshady, Ghazal
AU - Joseph, Calvin
AU - Hirachan, Niraj
AU - Goecke, Roland
AU - Rojas, Raul Fernandez
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024/7/19
Y1 - 2024/7/19
N2 - Pain assessment is significant for patients and clinicians in diagnosis and treatment injuries and disease. It could facilitate a patient’s treatment process by monitoring patients' pain levels in an accurate and regular manner. Automated detection of pain from facial expressions is a useful technique to assess pain of patients with communication disabilities. In this study, video vision transformers (ViViT) enhanced for pain recognition tasks are presented to capture spatio-temporal, facial information relevant to estimating the binary classification of pain and, thus, to provide valuable insights for automated estimation. The developed model has been trained and evaluated on two acute pain datasets, including 51 subjects using a newly collected pain intensity dataset designated as the AI4PAIN Challenge dataset, and 87 subjects from the BioVid Pain dataset. As an ablation study we used two baseline models, ResNet50 and a hybrid deep learning model based on the pretrained ResNet50+3DCNN. The results demonstrated that the proposed ViViT outperform the other models in pain detection by achieving accuracy = 66.96% for AI4PAIN dataset and accuracy = 79.95% for BioVid dataset.
AB - Pain assessment is significant for patients and clinicians in diagnosis and treatment injuries and disease. It could facilitate a patient’s treatment process by monitoring patients' pain levels in an accurate and regular manner. Automated detection of pain from facial expressions is a useful technique to assess pain of patients with communication disabilities. In this study, video vision transformers (ViViT) enhanced for pain recognition tasks are presented to capture spatio-temporal, facial information relevant to estimating the binary classification of pain and, thus, to provide valuable insights for automated estimation. The developed model has been trained and evaluated on two acute pain datasets, including 51 subjects using a newly collected pain intensity dataset designated as the AI4PAIN Challenge dataset, and 87 subjects from the BioVid Pain dataset. As an ablation study we used two baseline models, ResNet50 and a hybrid deep learning model based on the pretrained ResNet50+3DCNN. The results demonstrated that the proposed ViViT outperform the other models in pain detection by achieving accuracy = 66.96% for AI4PAIN dataset and accuracy = 79.95% for BioVid dataset.
KW - Computer vision
KW - Accuracy
KW - Pain
KW - Biological system modeling
KW - Face recognition
KW - Video sequences
KW - Transformers
KW - Functional near-infrared spectroscopy
KW - Monitoring
KW - Residual neural networks
UR - https://ieeexplore.ieee.org/document/10781616/
UR - https://embc.embs.org/2024/
UR - https://ieeexplore.ieee.org/xpl/conhome/10781475/proceeding
UR - http://www.scopus.com/inward/record.url?scp=85214989983&partnerID=8YFLogxK
U2 - 10.1109/EMBC53108.2024.10781616
DO - 10.1109/EMBC53108.2024.10781616
M3 - Conference contribution
SN - 9798350371505
T3 - Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
SP - 1
EP - 4
BT - 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2024 - Proceedings
A2 - Jung, Ranu
A2 - Wheeler, Bruce
A2 - Otto, Kelvin
A2 - Fernanda Cabrera-Umpiérrez, María
A2 - Mitsis , Georgios
A2 - Wang, May
A2 - Chan, Rosa
PB - IEEE, Institute of Electrical and Electronics Engineers
T2 - 2024 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)
Y2 - 15 July 2024 through 19 July 2024
ER -