Speaker identification experiments are performed with novel features representative of the glottal source waveform. These are derived from closed-phase analysis and inverse filtering. Source waveforms are segmented into two consecutive periods and normalised in prosody, forming so called source-frame feature vectors. Support-vector-machines are used to construct speaker discriminative hyperplanes and identification rates are reported. Groups of male speakers of size 5 to 20 are examined from the YOHO corpus and 65% correct identification rates are achieved on a per source-frame basis. Finally the source-frames phonetic independence is confirmed with the TI 46-Word corpus.
|Publisher||Australasian Speech Science and Technology Association|
|Conference||14th Australasian International Conference on Speech Science and Technology|
|Period||3/12/12 → 6/12/12|