Multimodal speaker verification based on easy-to-obtain biometric traits such as face and voice is rapidly gaining acceptance as the preferred technology for many applications. In many such practical applications, other characteristics of the speaker such as gender or age are known and may be exploited for enhanced verification accuracy. In this paper we present a parallel approach determining gender as an ancillary speaker characteristic, which is incorporated in the decision of a face-voice speaker verification system. Preliminary experiments with the DaFEx multimodal audio-video database show that fusing the results of gender recognition and identity verification improves the performance of multimodal speaker verification.
|Title of host publication||10th Annual Conference of the International Speech Communication Association (Interspeech 2009)|
|Place of Publication||Brighton, UK|
|Publisher||International Speech Communication Association|
|Number of pages||4|
|Publication status||Published - 2009|
|Event||Interspeech-2009 - Brighton, United Kingdom|
Duration: 6 Sep 2009 → 9 Sep 2009
|Period||6/09/09 → 9/09/09|
Chetty, G., & Wagner, M. (2009). Multimodal Speaker Verification Using Ancillary Known Speaker Characteristics Such as Gender or Age. In M. Uther (Ed.), 10th Annual Conference of the International Speech Communication Association (Interspeech 2009) (pp. 1167-1170). Brighton, UK: International Speech Communication Association.