F0 can tell us more: speaker verification using the long term distribution

Yuko Kinoshita, Shunichi Ishihara, Diederich BAKKER

Research output: A Conference proceeding or a Chapter in BookConference contribution

Abstract

This study explores some options for improving the performance of F0-based speaker verification. We tested different parameterisation techniques that enable us to capture non-unimodal distribution. We also tested the use of dynamic features (delta F0) and different scales (log10). As a result, we discovered that combinations of these techniques could significantly improve both the performance of speaker verification, and the reliability of the likelihood ratios.
Original languageEnglish
Title of host publicationProceedings of the 13th Australasian International Conference on Speech Science and Technology
Place of PublicationMelbourne, Australia
PublisherAustralian Speech Science and Technology Association (ASSTA)
Pages50-53
Number of pages4
ISBN (Print)9780958194631
Publication statusPublished - 2010
EventSST 2010: Thirteenth Australasian International Conference on Speech Science and Technology 2010 - Melbourne, Australia
Duration: 14 Dec 201016 Dec 2010

Conference

ConferenceSST 2010: Thirteenth Australasian International Conference on Speech Science and Technology 2010
CountryAustralia
CityMelbourne
Period14/12/1016/12/10

Fingerprint

Parameterization

Cite this

Kinoshita, Y., Ishihara, S., & BAKKER, D. (2010). F0 can tell us more: speaker verification using the long term distribution. In Proceedings of the 13th Australasian International Conference on Speech Science and Technology (pp. 50-53). Melbourne, Australia: Australian Speech Science and Technology Association (ASSTA).
Kinoshita, Yuko ; Ishihara, Shunichi ; BAKKER, Diederich. / F0 can tell us more: speaker verification using the long term distribution. Proceedings of the 13th Australasian International Conference on Speech Science and Technology. Melbourne, Australia : Australian Speech Science and Technology Association (ASSTA), 2010. pp. 50-53
@inproceedings{3cc1aa4e536f4b179dac1cb33d8fbde8,
title = "F0 can tell us more: speaker verification using the long term distribution",
abstract = "This study explores some options for improving the performance of F0-based speaker verification. We tested different parameterisation techniques that enable us to capture non-unimodal distribution. We also tested the use of dynamic features (delta F0) and different scales (log10). As a result, we discovered that combinations of these techniques could significantly improve both the performance of speaker verification, and the reliability of the likelihood ratios.",
author = "Yuko Kinoshita and Shunichi Ishihara and Diederich BAKKER",
year = "2010",
language = "English",
isbn = "9780958194631",
pages = "50--53",
booktitle = "Proceedings of the 13th Australasian International Conference on Speech Science and Technology",
publisher = "Australian Speech Science and Technology Association (ASSTA)",

}

Kinoshita, Y, Ishihara, S & BAKKER, D 2010, F0 can tell us more: speaker verification using the long term distribution. in Proceedings of the 13th Australasian International Conference on Speech Science and Technology. Australian Speech Science and Technology Association (ASSTA), Melbourne, Australia, pp. 50-53, SST 2010: Thirteenth Australasian International Conference on Speech Science and Technology 2010, Melbourne, Australia, 14/12/10.

F0 can tell us more: speaker verification using the long term distribution. / Kinoshita, Yuko; Ishihara, Shunichi; BAKKER, Diederich.

Proceedings of the 13th Australasian International Conference on Speech Science and Technology. Melbourne, Australia : Australian Speech Science and Technology Association (ASSTA), 2010. p. 50-53.

Research output: A Conference proceeding or a Chapter in BookConference contribution

TY - GEN

T1 - F0 can tell us more: speaker verification using the long term distribution

AU - Kinoshita, Yuko

AU - Ishihara, Shunichi

AU - BAKKER, Diederich

PY - 2010

Y1 - 2010

N2 - This study explores some options for improving the performance of F0-based speaker verification. We tested different parameterisation techniques that enable us to capture non-unimodal distribution. We also tested the use of dynamic features (delta F0) and different scales (log10). As a result, we discovered that combinations of these techniques could significantly improve both the performance of speaker verification, and the reliability of the likelihood ratios.

AB - This study explores some options for improving the performance of F0-based speaker verification. We tested different parameterisation techniques that enable us to capture non-unimodal distribution. We also tested the use of dynamic features (delta F0) and different scales (log10). As a result, we discovered that combinations of these techniques could significantly improve both the performance of speaker verification, and the reliability of the likelihood ratios.

M3 - Conference contribution

SN - 9780958194631

SP - 50

EP - 53

BT - Proceedings of the 13th Australasian International Conference on Speech Science and Technology

PB - Australian Speech Science and Technology Association (ASSTA)

CY - Melbourne, Australia

ER -

Kinoshita Y, Ishihara S, BAKKER D. F0 can tell us more: speaker verification using the long term distribution. In Proceedings of the 13th Australasian International Conference on Speech Science and Technology. Melbourne, Australia: Australian Speech Science and Technology Association (ASSTA). 2010. p. 50-53