Abstract
This study investigates the utility of Long Short-Term Memory (LSTM) networks for modelling spatial-temporal patterns for micro-expression recognition (MER). Micro-expressions are involuntary, short facial expressions, often of low intensity. RNNs have attracted a lot of attention in recent years for modelling temporal sequences. The RNN-LSTM combination to be highly effective results in many application areas. The proposed method combines the recent VGGFace2 model, basically a ResNet-50 CNN trained on the VGGFace2 dataset, with uni-directional and bi-directional LSTM to explore different ways modelling spatial-temporal facial patterns for MER. The Grad-CAM heat map visualisation is used in the training stages to determine the most appropriate layer of the VGGFace2 model for retraining. Experiments are conducted with pure VGGFace2, VGGFace2 + uni-directional LSTM, and VGGFace2 + Bi-directional LSTM on the SMIC database using 5-fold cross-validation.
| Original language | English |
|---|---|
| Title of host publication | ICMI '20 Companion |
| Subtitle of host publication | Companion Publication of the 2020 International Conference on Multimodal Interaction |
| Editors | Khiet Truong, Dirk Heylen, Mary Czerwinski |
| Place of Publication | Netherlands |
| Publisher | Association for Computing Machinery (ACM) |
| Pages | 7-11 |
| Number of pages | 5 |
| ISBN (Print) | 9781450380027 |
| DOIs | |
| Publication status | Published - 25 Oct 2020 |
| Event | 22nd ACM International Conference on Multimodal Interaction - Utrecht, Netherlands Duration: 25 Oct 2020 → 29 Oct 2020 http://icmi.acm.org/2020/ |
Conference
| Conference | 22nd ACM International Conference on Multimodal Interaction |
|---|---|
| Abbreviated title | ICMI'20 |
| Country/Territory | Netherlands |
| City | Utrecht |
| Period | 25/10/20 → 29/10/20 |
| Internet address |