Abstract
The hidden Markov model (HMM) is a double stochastic process. The observable process produces a sequence of observations and the hidden process is a Markov process. The HMM assumes that the occurrence of one observation is statistically independent of the occurrence of the others. To avoid this limitation, a temporal HMM is proposed. The hidden process in the temporal HMM is the same, but the observable process is now a Markov process. Each observation in the training sequence is assumed to be statistically dependent on its predecessor, and codewords or Gaussian components are used as states in the observable Markov process. Speaker identification experiments performed on 138 Gaussian mixture speaker models in the YOHO database shows a better performance for the temporal HMM compared to the standard HMM
Original language | English |
---|---|
Title of host publication | Proceedings of the 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing |
Editors | J Kwok, LM Po |
Place of Publication | Hong Kong |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 137-140 |
Number of pages | 4 |
ISBN (Print) | 0-7803-8688-4 |
Publication status | Published - 2004 |
Event | 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing - , Hong Kong Duration: 19 Oct 2004 → 21 Oct 2004 |
Conference
Conference | 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing |
---|---|
Country/Territory | Hong Kong |
Period | 19/10/04 → 21/10/04 |