Beyond the long-term Mean: Exploring the Potential of F0 Distribution Parameters in Forensic Speaker Recognition

Yuko Kinoshita, Shunichi Ishihara, Phil Rose

Research output: A Conference proceeding or a Chapter in BookConference contributionpeer-review

Abstract

Despite its many prima facie attractive properties for Forensic Speaker Recognition, F0 is regarded as having limited forensic value due to its large within-speaker variability. However, its forensic use to date has been limited mostly to its long-term mean and standard deviation. This paper examines the discriminatory potential, within a Likelihood Ratio-based approach, of additional parametric features from the distribution of long-term F0: its skew, kurtosis, modal F0 and modal density. Motivated by the observation that the overall long-term F0 distribution shows less within-speaker occasion-to-occasion difference, we report a forensic discrimination experiment with noncontemporaneous speech samples from 201 male Japanese speakers. Using a multivariate LR as discriminant distance with the six LTF0 distribution parameters, an EER of 10.7% is obtained from 201 target and 80400 non-target trials. We also investigate how the EER degrades as a function of amount of voiced speech.
Original languageEnglish
Title of host publicationOdyssey 2008: The Speaker and Language Recognition Workshop
EditorsNiko Brummer
Place of PublicationStellenbosch, South Africa
PublisherInternational Speech Communication Association
Pages1-8
Number of pages8
Volume1
ISBN (Print)9780620403313
Publication statusPublished - 2008
EventOdyssey 2008, The Speaker and Language Recognition Workshop - Stellenbosch, Stellenbosch, South Africa
Duration: 21 Jan 200824 Jan 2008

Conference

ConferenceOdyssey 2008, The Speaker and Language Recognition Workshop
Country/TerritorySouth Africa
CityStellenbosch
Period21/01/0824/01/08

Fingerprint

Dive into the research topics of 'Beyond the long-term Mean: Exploring the Potential of F0 Distribution Parameters in Forensic Speaker Recognition'. Together they form a unique fingerprint.

Cite this