The Audio-Video Australian English Speech Data Corpus AVOZES

Roland Goecke, J Millar

    Research output: A Conference proceeding or a Chapter in BookConference contribution

    31 Citations (Scopus)

    Abstract

    This paper presents the Audio-Video Australian English Speech data corpus AVOZES. It contains recordings of 20 speakers uttering a variety of phrases. The corpus was designed for research on the statistical relationship of audio and video speech parameters with an audio-video (AV) automatic speech recognition (ASR) task in mind, but may be useful for other research tasks. AVOZES is the first published AV speaking-face data corpus for Australian English and is novel in its use of a stereo camera system for the video recordings and its modular design.
    Original languageEnglish
    Title of host publicationINTERSPEECH 2004 - ICSLP: 8th International Conference on Spoken Language Processing
    EditorsS.H Kim, D.H Youn
    Place of PublicationCanada
    PublisherISCA
    Pages2525-2528
    Number of pages4
    Publication statusPublished - 2004
    EventINTERSPEECH 2004 - ICSLP 8th International Conference on Spoken Language Processing - Jeju, Korea, Republic of
    Duration: 3 Oct 20047 Oct 2004

    Conference

    ConferenceINTERSPEECH 2004 - ICSLP 8th International Conference on Spoken Language Processing
    CountryKorea, Republic of
    CityJeju
    Period3/10/047/10/04

    Fingerprint

    Video recording
    Speech recognition
    Cameras

    Cite this

    Goecke, R., & Millar, J. (2004). The Audio-Video Australian English Speech Data Corpus AVOZES. In S. H. Kim, & D. H. Youn (Eds.), INTERSPEECH 2004 - ICSLP: 8th International Conference on Spoken Language Processing (pp. 2525-2528). Canada: ISCA.
    Goecke, Roland ; Millar, J. / The Audio-Video Australian English Speech Data Corpus AVOZES. INTERSPEECH 2004 - ICSLP: 8th International Conference on Spoken Language Processing. editor / S.H Kim ; D.H Youn. Canada : ISCA, 2004. pp. 2525-2528
    @inproceedings{9c44b57937924b2cb39ee0b9a5697c22,
    title = "The Audio-Video Australian English Speech Data Corpus AVOZES",
    abstract = "This paper presents the Audio-Video Australian English Speech data corpus AVOZES. It contains recordings of 20 speakers uttering a variety of phrases. The corpus was designed for research on the statistical relationship of audio and video speech parameters with an audio-video (AV) automatic speech recognition (ASR) task in mind, but may be useful for other research tasks. AVOZES is the first published AV speaking-face data corpus for Australian English and is novel in its use of a stereo camera system for the video recordings and its modular design.",
    author = "Roland Goecke and J Millar",
    year = "2004",
    language = "English",
    pages = "2525--2528",
    editor = "S.H Kim and D.H Youn",
    booktitle = "INTERSPEECH 2004 - ICSLP: 8th International Conference on Spoken Language Processing",
    publisher = "ISCA",

    }

    Goecke, R & Millar, J 2004, The Audio-Video Australian English Speech Data Corpus AVOZES. in SH Kim & DH Youn (eds), INTERSPEECH 2004 - ICSLP: 8th International Conference on Spoken Language Processing. ISCA, Canada, pp. 2525-2528, INTERSPEECH 2004 - ICSLP 8th International Conference on Spoken Language Processing, Jeju, Korea, Republic of, 3/10/04.

    The Audio-Video Australian English Speech Data Corpus AVOZES. / Goecke, Roland; Millar, J.

    INTERSPEECH 2004 - ICSLP: 8th International Conference on Spoken Language Processing. ed. / S.H Kim; D.H Youn. Canada : ISCA, 2004. p. 2525-2528.

    Research output: A Conference proceeding or a Chapter in BookConference contribution

    TY - GEN

    T1 - The Audio-Video Australian English Speech Data Corpus AVOZES

    AU - Goecke, Roland

    AU - Millar, J

    PY - 2004

    Y1 - 2004

    N2 - This paper presents the Audio-Video Australian English Speech data corpus AVOZES. It contains recordings of 20 speakers uttering a variety of phrases. The corpus was designed for research on the statistical relationship of audio and video speech parameters with an audio-video (AV) automatic speech recognition (ASR) task in mind, but may be useful for other research tasks. AVOZES is the first published AV speaking-face data corpus for Australian English and is novel in its use of a stereo camera system for the video recordings and its modular design.

    AB - This paper presents the Audio-Video Australian English Speech data corpus AVOZES. It contains recordings of 20 speakers uttering a variety of phrases. The corpus was designed for research on the statistical relationship of audio and video speech parameters with an audio-video (AV) automatic speech recognition (ASR) task in mind, but may be useful for other research tasks. AVOZES is the first published AV speaking-face data corpus for Australian English and is novel in its use of a stereo camera system for the video recordings and its modular design.

    M3 - Conference contribution

    SP - 2525

    EP - 2528

    BT - INTERSPEECH 2004 - ICSLP: 8th International Conference on Spoken Language Processing

    A2 - Kim, S.H

    A2 - Youn, D.H

    PB - ISCA

    CY - Canada

    ER -

    Goecke R, Millar J. The Audio-Video Australian English Speech Data Corpus AVOZES. In Kim SH, Youn DH, editors, INTERSPEECH 2004 - ICSLP: 8th International Conference on Spoken Language Processing. Canada: ISCA. 2004. p. 2525-2528