How Vulnerable are Prosodic Features to Professional Imitators?

Mireia Farrus, Michael Wagner, Jan Anguita, Javier Hernando

Research output: A Conference proceeding or a Chapter in BookConference contribution

Abstract

Voice imitation is one of the potential threats to security systems that use automatic speaker recognition. Since prosodic features have been considered for state-of-the-art recognition systems in recent years, the question arises as to how vulnerable these features are to voice mimicking. In this study, two experiments are conducted for twelve individual features in order to determine how a prosodic speaker identification system would perform against professionally imitated voices. By analysing prosodic parameters, the results show that the identification error rate increases for most of the features, except for the range of the fundamental frequency, which seems to be relatively robust against voice mimicking. When all twelve features are fused, the identification error rate increases from 5% between the target voices and the imitators’ natural voices to 22% between the target voices and the imitators’ impersonations
Original languageEnglish
Title of host publicationProceedings of Odyssey 2008
Subtitle of host publicationThe Speaker and Language Recognition Workshop
EditorsNico Brummer, Johann de Preez
Place of PublicationSouth Africa
PublisherInternational Speech Communication Association
Pages1-6
Number of pages6
Publication statusPublished - 2008
EventOdyssey 2008, The Speaker and Language Recognition Workshop - Stellenbosch, Stellenbosch, South Africa
Duration: 21 Jan 200824 Jan 2008

Publication series

NameOdyssey: The Speaker and Language Recognition Workshop
PublisherInternational Speech Communication Association

Conference

ConferenceOdyssey 2008, The Speaker and Language Recognition Workshop
CountrySouth Africa
CityStellenbosch
Period21/01/0824/01/08

Fingerprint Dive into the research topics of 'How Vulnerable are Prosodic Features to Professional Imitators?'. Together they form a unique fingerprint.

  • Cite this

    Farrus, M., Wagner, M., Anguita, J., & Hernando, J. (2008). How Vulnerable are Prosodic Features to Professional Imitators? In N. Brummer, & J. D. Preez (Eds.), Proceedings of Odyssey 2008: The Speaker and Language Recognition Workshop (pp. 1-6). (Odyssey: The Speaker and Language Recognition Workshop). International Speech Communication Association. https://www.isca-speech.org/archive_open/archive_papers/odyssey_2008/papers/od08_002.pdf