Estimation of Prior Probabilities in Speaker Recognition

    Research output: A Conference proceeding or a Chapter in BookConference contribution

    4 Citations (Scopus)

    Abstract

    According to Bayesian decision theory, the maximum a posteriori (MAP) decision rule is used to minimize the speaker recognition error rate. The a posteriori probability is determined if the a priori probability and the likelihood function are known. However, there has been no method to determine the a priori probability, therefore the maximum likelihood (ML) decision rule is used instead. The paper proposes a method to estimate the a priori probability for speakers based on a training data set and speaker models. Speaker identification experiments performed on 138 Gaussian mixture speaker models in the YOHO database using the MAP rule showed lower error rates than using the ML rule.
    Original languageEnglish
    Title of host publicationProceedings of the 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing
    EditorsJ Kwok, LM Po
    Place of PublicationHong Kong
    PublisherIEEE, Institute of Electrical and Electronics Engineers
    Pages141-144
    Number of pages4
    ISBN (Print)0-7803-8688-4
    DOIs
    Publication statusPublished - 2004
    Event2004 International Symposium on Intelligent Multimedia, Video and Speech Processing - , Hong Kong
    Duration: 19 Oct 200421 Oct 2004

    Conference

    Conference2004 International Symposium on Intelligent Multimedia, Video and Speech Processing
    CountryHong Kong
    Period19/10/0421/10/04

    Fingerprint

    Maximum likelihood
    Decision theory
    Identification (control systems)
    Experiments

    Cite this

    Tran, D. (2004). Estimation of Prior Probabilities in Speaker Recognition. In J. Kwok, & LM. Po (Eds.), Proceedings of the 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing (pp. 141-144). Hong Kong: IEEE, Institute of Electrical and Electronics Engineers. https://doi.org/10.1109/ISIMP.2004.1434020
    Tran, Dat. / Estimation of Prior Probabilities in Speaker Recognition. Proceedings of the 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing. editor / J Kwok ; LM Po. Hong Kong : IEEE, Institute of Electrical and Electronics Engineers, 2004. pp. 141-144
    @inproceedings{b21ef1c7f5604fb585f511c8b32a6170,
    title = "Estimation of Prior Probabilities in Speaker Recognition",
    abstract = "According to Bayesian decision theory, the maximum a posteriori (MAP) decision rule is used to minimize the speaker recognition error rate. The a posteriori probability is determined if the a priori probability and the likelihood function are known. However, there has been no method to determine the a priori probability, therefore the maximum likelihood (ML) decision rule is used instead. The paper proposes a method to estimate the a priori probability for speakers based on a training data set and speaker models. Speaker identification experiments performed on 138 Gaussian mixture speaker models in the YOHO database using the MAP rule showed lower error rates than using the ML rule.",
    author = "Dat Tran",
    year = "2004",
    doi = "10.1109/ISIMP.2004.1434020",
    language = "English",
    isbn = "0-7803-8688-4",
    pages = "141--144",
    editor = "J Kwok and LM Po",
    booktitle = "Proceedings of the 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing",
    publisher = "IEEE, Institute of Electrical and Electronics Engineers",
    address = "United States",

    }

    Tran, D 2004, Estimation of Prior Probabilities in Speaker Recognition. in J Kwok & LM Po (eds), Proceedings of the 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing. IEEE, Institute of Electrical and Electronics Engineers, Hong Kong, pp. 141-144, 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing, Hong Kong, 19/10/04. https://doi.org/10.1109/ISIMP.2004.1434020

    Estimation of Prior Probabilities in Speaker Recognition. / Tran, Dat.

    Proceedings of the 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing. ed. / J Kwok; LM Po. Hong Kong : IEEE, Institute of Electrical and Electronics Engineers, 2004. p. 141-144.

    Research output: A Conference proceeding or a Chapter in BookConference contribution

    TY - GEN

    T1 - Estimation of Prior Probabilities in Speaker Recognition

    AU - Tran, Dat

    PY - 2004

    Y1 - 2004

    N2 - According to Bayesian decision theory, the maximum a posteriori (MAP) decision rule is used to minimize the speaker recognition error rate. The a posteriori probability is determined if the a priori probability and the likelihood function are known. However, there has been no method to determine the a priori probability, therefore the maximum likelihood (ML) decision rule is used instead. The paper proposes a method to estimate the a priori probability for speakers based on a training data set and speaker models. Speaker identification experiments performed on 138 Gaussian mixture speaker models in the YOHO database using the MAP rule showed lower error rates than using the ML rule.

    AB - According to Bayesian decision theory, the maximum a posteriori (MAP) decision rule is used to minimize the speaker recognition error rate. The a posteriori probability is determined if the a priori probability and the likelihood function are known. However, there has been no method to determine the a priori probability, therefore the maximum likelihood (ML) decision rule is used instead. The paper proposes a method to estimate the a priori probability for speakers based on a training data set and speaker models. Speaker identification experiments performed on 138 Gaussian mixture speaker models in the YOHO database using the MAP rule showed lower error rates than using the ML rule.

    U2 - 10.1109/ISIMP.2004.1434020

    DO - 10.1109/ISIMP.2004.1434020

    M3 - Conference contribution

    SN - 0-7803-8688-4

    SP - 141

    EP - 144

    BT - Proceedings of the 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing

    A2 - Kwok, J

    A2 - Po, LM

    PB - IEEE, Institute of Electrical and Electronics Engineers

    CY - Hong Kong

    ER -

    Tran D. Estimation of Prior Probabilities in Speaker Recognition. In Kwok J, Po LM, editors, Proceedings of the 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing. Hong Kong: IEEE, Institute of Electrical and Electronics Engineers. 2004. p. 141-144 https://doi.org/10.1109/ISIMP.2004.1434020