Abstract
Despite its many prima facie attractive properties for Forensic Speaker Recognition, F0 is regarded as having limited forensic value due to its large within-speaker variability. However, its forensic use to date has been limited mostly to its long-term mean and standard deviation. This paper examines the discriminatory potential, within a Likelihood Ratio-based approach, of additional parametric features from the distribution of long-term F0: its skew, kurtosis, modal F0 and modal density. Motivated by the observation that the overall long-term F0 distribution shows less within-speaker occasion-to-occasion difference, we report a forensic discrimination experiment with noncontemporaneous speech samples from 201 male Japanese speakers. Using a multivariate LR as discriminant distance with the six LTF0 distribution parameters, an EER of 10.7% is obtained from 201 target and 80400 non-target trials. We also investigate how the EER degrades as a function of amount of voiced speech.
| Original language | English |
|---|---|
| Title of host publication | Odyssey 2008: The Speaker and Language Recognition Workshop |
| Editors | Niko Brummer |
| Place of Publication | Stellenbosch, South Africa |
| Publisher | International Speech Communication Association |
| Pages | 1-8 |
| Number of pages | 8 |
| Volume | 1 |
| ISBN (Print) | 9780620403313 |
| Publication status | Published - 2008 |
| Event | Odyssey 2008, The Speaker and Language Recognition Workshop - Stellenbosch, Stellenbosch, South Africa Duration: 21 Jan 2008 → 24 Jan 2008 |
Conference
| Conference | Odyssey 2008, The Speaker and Language Recognition Workshop |
|---|---|
| Country/Territory | South Africa |
| City | Stellenbosch |
| Period | 21/01/08 → 24/01/08 |