Audio Visual Speaker Verification based on Hybrid Fusion of Cross Modal Features

Girija Chetty, Michael Wagner

Research output: A Conference proceeding or a Chapter in BookConference contributionpeer-review

7 Citations (Scopus)


In this paper, we propose hybrid fusion of audio and explicit correlation features for speaker identity verification applications. Experiments were performed with the GMM based speaker models with a hybrid fusion technique involving late fusion of explicit cross-modal fusion features, with implicit eigen lip and audio MFCC features. An evaluation of the system performance with different gender specific datasets from controlled VidTIMIT data base and opportunistic UCBN database shows a significant performance improvement
Original languageEnglish
Title of host publicationPattern Recognition and Machine Intelligence Proceedings
EditorsA Ghosh, R.De
Place of PublicationGerman
Number of pages10
ISBN (Electronic)9783540770466
ISBN (Print)9783540770459
Publication statusPublished - 2007
EventPattern Recognition and Machine Intelligence (PReMI) - , India
Duration: 18 Dec 200722 Dec 2007


ConferencePattern Recognition and Machine Intelligence (PReMI)


Dive into the research topics of 'Audio Visual Speaker Verification based on Hybrid Fusion of Cross Modal Features'. Together they form a unique fingerprint.

Cite this