Audio Visual Speaker Verification based on Hybrid Fusion of Cross Modal Features

Girija Chetty, Michael Wagner

Research output: A Conference proceeding or a Chapter in BookConference contribution

6 Citations (Scopus)

Abstract

In this paper, we propose hybrid fusion of audio and explicit correlation features for speaker identity verification applications. Experiments were performed with the GMM based speaker models with a hybrid fusion technique involving late fusion of explicit cross-modal fusion features, with implicit eigen lip and audio MFCC features. An evaluation of the system performance with different gender specific datasets from controlled VidTIMIT data base and opportunistic UCBN database shows a significant performance improvement
Original languageEnglish
Title of host publicationPattern Recognition and Machine Intelligence Proceedings
EditorsA Ghosh, R.De
Place of PublicationGerman
PublisherSpringer
Pages469-478
Number of pages10
ISBN (Electronic)9783540770466
ISBN (Print)9783540770459
DOIs
Publication statusPublished - 2007
EventPattern Recognition and Machine Intelligence (PReMI) - , India
Duration: 18 Dec 200722 Dec 2007

Conference

ConferencePattern Recognition and Machine Intelligence (PReMI)
CountryIndia
Period18/12/0722/12/07

Fingerprint Dive into the research topics of 'Audio Visual Speaker Verification based on Hybrid Fusion of Cross Modal Features'. Together they form a unique fingerprint.

  • Cite this

    Chetty, G., & Wagner, M. (2007). Audio Visual Speaker Verification based on Hybrid Fusion of Cross Modal Features. In A. Ghosh, & R.De (Eds.), Pattern Recognition and Machine Intelligence Proceedings (pp. 469-478). Springer. https://doi.org/10.1007/978-3-540-77046-6_58