Abstract
In this paper we propose multimodal fusion of super resolved texture (SRT) features and 3D shape features
with acoustic features for 3D audio-video person authentication systems with liveness checks. The proposed SRT
features allow information related to non-rigid variations on speaking faces, such as expression lines, gestures,
and wrinkles, enhancing the performance of the system against impostor and spoof attacks. Experiments with
multimodal fusion of acoustic and super-resolved texture and 3D shape features for two different speaking face
data corpus, VidTIMIT, and AVOZES, allowed equal error rates (EERs) of less than 0.5 % for imposter and
type-1 replay attacks (still photo and pre-recorded audio) and less than 3% for more complex type-2 replay
attacks (pre-recorded video or fake CG animated video).
Original language | English |
---|---|
Title of host publication | Proceedings of the Conference on Image and Vision Computing - New Zealand |
Editors | Brendan McCane |
Place of Publication | New Zealand |
Publisher | University of Otago, Dunedin, New Zealand. |
Pages | 132-137 |
Number of pages | 6 |
ISBN (Print) | 00473105233 |
Publication status | Published - 2005 |
Event | IVCNZ-2005 - Dunedin, New Zealand Duration: 28 Nov 2005 → 29 Nov 2005 |
Conference
Conference | IVCNZ-2005 |
---|---|
Country/Territory | New Zealand |
City | Dunedin |
Period | 28/11/05 → 29/11/05 |