Validation of an automatic lip-tracking algorithm and design of a database for audio-video speech processing

Roland GOECKE, QN Tran, J. Bruce Millar, Alexander Zelinsky, Jordi Robert-Ribes

    Research output: A Conference proceeding or a Chapter in BookConference contribution

    Abstract

    We have recently proposed a new algorithm for the automatic extraction of lip feature points. Based on their positions, parameters describing the shape of the mouth are derived. Since the algorithm is based on a stereo vision face tracking system, all measurements are in real-world distances. In this paper, we evaluate the accuracy of the automatic feature extraction algorithm by comparing its results with a manual feature extraction process. The results show an average error of about 1-2mm for the internal mouth width and height. In the second part of the paper, we present the design of an AV speech database for Australian English for future experiments on the correlation of audio and video speech signals.
    Original languageEnglish
    Title of host publicationProceedings of the 8th Australian International Conference on Speech Science and Technology SST-2000
    EditorsMichael Barlow, Philip Rose
    Place of PublicationCanberra
    PublisherAustralian Speech Science and Technology Association (ASSTA)
    Pages92-97
    Number of pages6
    ISBN (Print)0958857989
    Publication statusPublished - 4 Dec 2000
    Event8th Australian International Conference on Speech Science and Technology SST-2000 - Canberra, Australia
    Duration: 4 Dec 20007 Dec 2000

    Conference

    Conference8th Australian International Conference on Speech Science and Technology SST-2000
    Abbreviated titleSST-2000
    CountryAustralia
    CityCanberra
    Period4/12/007/12/00

    Fingerprint

    Speech processing
    Feature extraction
    Stereo vision
    Experiments

    Cite this

    GOECKE, R., Tran, QN., Millar, J. B., Zelinsky, A., & Robert-Ribes, J. (2000). Validation of an automatic lip-tracking algorithm and design of a database for audio-video speech processing. In M. Barlow, & P. Rose (Eds.), Proceedings of the 8th Australian International Conference on Speech Science and Technology SST-2000 (pp. 92-97). Canberra: Australian Speech Science and Technology Association (ASSTA).
    GOECKE, Roland ; Tran, QN ; Millar, J. Bruce ; Zelinsky, Alexander ; Robert-Ribes, Jordi. / Validation of an automatic lip-tracking algorithm and design of a database for audio-video speech processing. Proceedings of the 8th Australian International Conference on Speech Science and Technology SST-2000. editor / Michael Barlow ; Philip Rose. Canberra : Australian Speech Science and Technology Association (ASSTA), 2000. pp. 92-97
    @inproceedings{f30c8062cd964a87b9d71ee47aab34c1,
    title = "Validation of an automatic lip-tracking algorithm and design of a database for audio-video speech processing",
    abstract = "We have recently proposed a new algorithm for the automatic extraction of lip feature points. Based on their positions, parameters describing the shape of the mouth are derived. Since the algorithm is based on a stereo vision face tracking system, all measurements are in real-world distances. In this paper, we evaluate the accuracy of the automatic feature extraction algorithm by comparing its results with a manual feature extraction process. The results show an average error of about 1-2mm for the internal mouth width and height. In the second part of the paper, we present the design of an AV speech database for Australian English for future experiments on the correlation of audio and video speech signals.",
    keywords = "Audio-Video Speech Data Corpus, Lip tracking",
    author = "Roland GOECKE and QN Tran and Millar, {J. Bruce} and Alexander Zelinsky and Jordi Robert-Ribes",
    year = "2000",
    month = "12",
    day = "4",
    language = "English",
    isbn = "0958857989",
    pages = "92--97",
    editor = "Michael Barlow and Philip Rose",
    booktitle = "Proceedings of the 8th Australian International Conference on Speech Science and Technology SST-2000",
    publisher = "Australian Speech Science and Technology Association (ASSTA)",

    }

    GOECKE, R, Tran, QN, Millar, JB, Zelinsky, A & Robert-Ribes, J 2000, Validation of an automatic lip-tracking algorithm and design of a database for audio-video speech processing. in M Barlow & P Rose (eds), Proceedings of the 8th Australian International Conference on Speech Science and Technology SST-2000. Australian Speech Science and Technology Association (ASSTA), Canberra, pp. 92-97, 8th Australian International Conference on Speech Science and Technology SST-2000, Canberra, Australia, 4/12/00.

    Validation of an automatic lip-tracking algorithm and design of a database for audio-video speech processing. / GOECKE, Roland; Tran, QN; Millar, J. Bruce; Zelinsky, Alexander; Robert-Ribes, Jordi.

    Proceedings of the 8th Australian International Conference on Speech Science and Technology SST-2000. ed. / Michael Barlow; Philip Rose. Canberra : Australian Speech Science and Technology Association (ASSTA), 2000. p. 92-97.

    Research output: A Conference proceeding or a Chapter in BookConference contribution

    TY - GEN

    T1 - Validation of an automatic lip-tracking algorithm and design of a database for audio-video speech processing

    AU - GOECKE, Roland

    AU - Tran, QN

    AU - Millar, J. Bruce

    AU - Zelinsky, Alexander

    AU - Robert-Ribes, Jordi

    PY - 2000/12/4

    Y1 - 2000/12/4

    N2 - We have recently proposed a new algorithm for the automatic extraction of lip feature points. Based on their positions, parameters describing the shape of the mouth are derived. Since the algorithm is based on a stereo vision face tracking system, all measurements are in real-world distances. In this paper, we evaluate the accuracy of the automatic feature extraction algorithm by comparing its results with a manual feature extraction process. The results show an average error of about 1-2mm for the internal mouth width and height. In the second part of the paper, we present the design of an AV speech database for Australian English for future experiments on the correlation of audio and video speech signals.

    AB - We have recently proposed a new algorithm for the automatic extraction of lip feature points. Based on their positions, parameters describing the shape of the mouth are derived. Since the algorithm is based on a stereo vision face tracking system, all measurements are in real-world distances. In this paper, we evaluate the accuracy of the automatic feature extraction algorithm by comparing its results with a manual feature extraction process. The results show an average error of about 1-2mm for the internal mouth width and height. In the second part of the paper, we present the design of an AV speech database for Australian English for future experiments on the correlation of audio and video speech signals.

    KW - Audio-Video Speech Data Corpus

    KW - Lip tracking

    UR - http://www.assta.org/sst/Abstracts-SST-2000.html

    UR - http://www.mendeley.com/research/validation-automatic-liptracking-algorithm-design-database-audiovideo-speech-processing

    M3 - Conference contribution

    SN - 0958857989

    SP - 92

    EP - 97

    BT - Proceedings of the 8th Australian International Conference on Speech Science and Technology SST-2000

    A2 - Barlow, Michael

    A2 - Rose, Philip

    PB - Australian Speech Science and Technology Association (ASSTA)

    CY - Canberra

    ER -

    GOECKE R, Tran QN, Millar JB, Zelinsky A, Robert-Ribes J. Validation of an automatic lip-tracking algorithm and design of a database for audio-video speech processing. In Barlow M, Rose P, editors, Proceedings of the 8th Australian International Conference on Speech Science and Technology SST-2000. Canberra: Australian Speech Science and Technology Association (ASSTA). 2000. p. 92-97