TY - JOUR
T1 - An automated lexical stress classification tool for assessing dysprosody in childhood apraxia of speech
AU - McKechnie, Jacqueline
AU - Shahin, Mostafa
AU - Ahmed, Beena
AU - McCabe, Patricia
AU - Arciuli, Joanne
AU - Ballard, Kirrie J.
N1 - Funding Information:
Funding: This research was made possible by NPRP Grant # 8-293-2-124 (Ahmed, Gutierrez-Osuna, and Ballard) from the Qatar National Research Fund (a member of the Qatar Foundation) and an Australian Postgraduate Award (McKechnie). The statements made herein are solely the responsibility of the authors.
Publisher Copyright:
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
PY - 2021/11
Y1 - 2021/11
N2 - Childhood apraxia of speech (CAS) commonly affects the production of lexical stress contrast in polysyllabic words. Automated classification tools have the potential to increase reliability and efficiency in measuring lexical stress. Here, factors affecting the accuracy of a custom-built deep neural network (DNN)-based classification tool are evaluated. Sixteen children with typical development (TD) and 26 with CAS produced 50 polysyllabic words. Words with strong–weak (SW, e.g., dinosaur) or WS (e.g., banana) stress were fed to the classification tool, and the accuracy measured (a) against expert judgment, (b) for speaker group, and (c) with/without prior knowledge of phonemic errors in the sample. The influence of segmental features and participant factors on tool accuracy was analysed. Linear mixed modelling showed significant interaction between group and stress type, surviving adjustment for age and CAS severity. For TD, agreement for SW and WS words was >80%, but CAS speech was higher for SW (>80%) than WS (~60%). Prior knowledge of segmental errors conferred no clear advantage. Automatic lexical stress classification shows promise for identifying errors in children’s speech at diagnosis or with treatment-related change, but accuracy for WS words in apraxic speech needs improvement. Further training of algorithms using larger sets of labelled data containing impaired speech and WS words may increase accuracy.
AB - Childhood apraxia of speech (CAS) commonly affects the production of lexical stress contrast in polysyllabic words. Automated classification tools have the potential to increase reliability and efficiency in measuring lexical stress. Here, factors affecting the accuracy of a custom-built deep neural network (DNN)-based classification tool are evaluated. Sixteen children with typical development (TD) and 26 with CAS produced 50 polysyllabic words. Words with strong–weak (SW, e.g., dinosaur) or WS (e.g., banana) stress were fed to the classification tool, and the accuracy measured (a) against expert judgment, (b) for speaker group, and (c) with/without prior knowledge of phonemic errors in the sample. The influence of segmental features and participant factors on tool accuracy was analysed. Linear mixed modelling showed significant interaction between group and stress type, surviving adjustment for age and CAS severity. For TD, agreement for SW and WS words was >80%, but CAS speech was higher for SW (>80%) than WS (~60%). Prior knowledge of segmental errors conferred no clear advantage. Automatic lexical stress classification shows promise for identifying errors in children’s speech at diagnosis or with treatment-related change, but accuracy for WS words in apraxic speech needs improvement. Further training of algorithms using larger sets of labelled data containing impaired speech and WS words may increase accuracy.
KW - Automatic speech recognition
KW - Childhood apraxia of speech
KW - Diagnosis
KW - Lexical stress
KW - Motor speech disorder
KW - Prosody
UR - http://www.scopus.com/inward/record.url?scp=85118263820&partnerID=8YFLogxK
U2 - 10.3390/brainsci11111408
DO - 10.3390/brainsci11111408
M3 - Article
AN - SCOPUS:85118263820
SN - 2076-3425
VL - 11
SP - 1
EP - 22
JO - Brain Sciences
JF - Brain Sciences
IS - 11
M1 - 1408
ER -