Robustness of Prosodic Features to Voice Imitation

Mireia Farrus, Michael Wagner, Jan Anguita, Javier Hernando

Research output: A Conference proceeding or a Chapter in BookConference contributionpeer-review

9 Citations (Scopus)
35 Downloads (Pure)

Abstract

Prosody plays an important role in the human recognition process; therefore, prosodic elements are normally used by impersonators aiming to resemble someone else. Since such voice imitation is one of the potential threats to security systems relying on automatic speaker recognition, and prosodic features have been considered for state-of-the-art recognition systems in recent years, the question arises as to what extent a mimicker is able to get close the prosodic characteristics of a target speaker. To this end, two experiments are conducted for twelve individual features in order to determine how a prosodic speaker identification system would perform against professionally imitated voices. The results show that the identification error rate increases for all the features except F0 range when the impersonators' modified voices are used instead of the impersonators natural voices. Moreover, it seems easier to copy prosody on the basis of a whole sentence than for a specific word.
Original languageEnglish
Title of host publicationProceedings of Interspeech 2008
Subtitle of host publicationincorporating SST 2008, 22-26 September 2008, Brisbane, Australia
Editors Fletcher, Goecke, Burnham, Wagner
Place of PublicationAustralia
PublisherInternational Speech Communication Association
Pages1-6
Number of pages6
ISBN (Print)9781615673780
Publication statusPublished - 2008
EventInterspeech 2008 - Brisbane, Australia
Duration: 22 Sept 200826 Sept 2008

Conference

ConferenceInterspeech 2008
Country/TerritoryAustralia
CityBrisbane
Period22/09/0826/09/08

Fingerprint

Dive into the research topics of 'Robustness of Prosodic Features to Voice Imitation'. Together they form a unique fingerprint.

Cite this