Speaker identification experiments are performed with novel features representative of the glottal source waveform. These are derived from closed-phase analysis and inverse filtering. Source waveforms are segmented into two consecutive periods and normalised in prosody, forming so called source-frame feature vectors. Support-vector-machines are used to construct speaker discriminative hyperplanes and identification rates are reported. Groups of male speakers of size 5 to 20 are examined from the YOHO corpus and 65% correct identification rates are achieved on a per source-frame basis. Finally the source-frames phonetic independence is confirmed with the TI 46-Word corpus.
|Title of host publication||Proceedings of the 14th Australasian International Conference on Speech Science and Technology|
|Editors||Felicity Cox, Katherine Demuth, Susan Lin, Kelly Miles, Sallyanne Palethrope, Jason Shaw, Ivan Yuen|
|Place of Publication||Sydney, Australia|
|Publisher||Australasian Speech Science and Technology Association (ASSTA)|
|Number of pages||4|
|Publication status||Published - 2012|
|Event||14th Australasian International Conference on Speech Science and Technology - Sydney, Sydney, Australia|
Duration: 3 Dec 2012 → 6 Dec 2012
|Publisher||Australasian Speech Science and Technology Association|
|Conference||14th Australasian International Conference on Speech Science and Technology|
|Period||3/12/12 → 6/12/12|
Vandyke, D., Wagner, M., Chetty, G., & Goecke, R. (2012). Speaker Identification Using Glottal-Source Waveforms and Support-Vector-Machine Modelling. In F. Cox, K. Demuth, S. Lin, K. Miles, S. Palethrope, J. Shaw, & I. Yuen (Eds.), Proceedings of the 14th Australasian International Conference on Speech Science and Technology (pp. 49-52). (ASSTA 2013). Sydney, Australia: Australasian Speech Science and Technology Association (ASSTA).