Comparison of GMM-HMM and DNN-HMM based pronunciation verification techniques for use in the assessment of childhood apraxia of speech

Mostafa Shahin, Beena Ahmed, Jacqueline McKechnie, Kirrie Ballard, Ricardo Gutierrez-Osuna

Research output: A Conference proceeding or a Chapter in BookConference contributionpeer-review

26 Citations (Scopus)

Abstract

This paper introduces a pronunciation verification method to be used in an automatic assessment therapy tool of child disordered speech. The proposed method creates a phonebased search lattice that is flexible enough to cover all probable mispronunciations. This allows us to verify the correctness of the pronunciation and detect the incorrect phonemes produced by the child. We compare between two different acoustic models, the conventional GMM-HMM and the hybrid DNN-HMM. Results show that the hybrid DNNHMM outperforms the conventional GMM-HMM for all experiments on both normal and disordered speech. The total correctness accuracy of the system at the phoneme level is above 85% when used with disordered speech.

Original languageEnglish
Title of host publication15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014)
Subtitle of host publicationCelebrating the Diversity of Spoken Languages
EditorsH. Li, P. Ching
Place of PublicationBaixas, France
PublisherInternational Speech Communication Association
Pages1583-1587
Number of pages5
Volume1
ISBN (Print)9781634394352
Publication statusPublished - 2014
Externally publishedYes
Event15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014 - Singapore, Singapore
Duration: 14 Sept 201418 Sept 2014

Conference

Conference15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014
Country/TerritorySingapore
CitySingapore
Period14/09/1418/09/14

Fingerprint

Dive into the research topics of 'Comparison of GMM-HMM and DNN-HMM based pronunciation verification techniques for use in the assessment of childhood apraxia of speech'. Together they form a unique fingerprint.

Cite this