AusTalk, The Australian Speech Database: Design Framework, Recording Experience and Localisation

Sharifa Alghowinem, Michael WAGNER, Roland GOECKE

    Research output: A Conference proceeding or a Chapter in BookConference contribution

    Abstract

    Aiming to create a comprehensive Australian speech database, the “AusTalk” project was carefully designed by 30 speech scientists contributing their disciplinary expertise. Standardised three one-hour audio-visual sessions for each of 1000 speakers around Australia were recorded having diverse components suitable for different research areas. The design of this database provides a good framework for any speech data corpus collection. In this paper, we present the AusTalk design and recording protocol, as well as problems faced and lessons learned. Localisation of this protocol and the potential customisation based on other countries' specifications are discussed. Collecting such speech databases including accent groups is encouraged to boost speech research in areas such as linguistics, speech and speaker recognition, forensic voice comparison, auditory-visual speech processing and many more.
    Original languageEnglish
    Title of host publication8th International Conference on IT in Asia 2013 (CITA'13)
    EditorsJane Labadin, Jacey-Lynn Minoi, Dayang NurFatimah Awang Iskandar, Azman Bujang Masli
    Place of PublicationMalaysia
    PublisherIEEE, Institute of Electrical and Electronics Engineers
    Pages1-7
    Number of pages7
    ISBN (Print)9781479910915
    DOIs
    Publication statusPublished - 2013
    Event8th International Conference on Information Technology in Asia - Smart Devices Trend: Technologising Future Lifestyle - Kuching, Kuching, Malaysia
    Duration: 1 Jul 20134 Jul 2013

    Conference

    Conference8th International Conference on Information Technology in Asia - Smart Devices Trend: Technologising Future Lifestyle
    CountryMalaysia
    CityKuching
    Period1/07/134/07/13

    Fingerprint

    Speech processing
    Linguistics
    Specifications

    Cite this

    Alghowinem, S., WAGNER, M., & GOECKE, R. (2013). AusTalk, The Australian Speech Database: Design Framework, Recording Experience and Localisation. In J. Labadin, J-L. Minoi, D. N. A. Iskandar, & A. B. Masli (Eds.), 8th International Conference on IT in Asia 2013 (CITA'13) (pp. 1-7). Malaysia: IEEE, Institute of Electrical and Electronics Engineers. https://doi.org/10.1109/CITA.2013.6637567
    Alghowinem, Sharifa ; WAGNER, Michael ; GOECKE, Roland. / AusTalk, The Australian Speech Database: Design Framework, Recording Experience and Localisation. 8th International Conference on IT in Asia 2013 (CITA'13). editor / Jane Labadin ; Jacey-Lynn Minoi ; Dayang NurFatimah Awang Iskandar ; Azman Bujang Masli. Malaysia : IEEE, Institute of Electrical and Electronics Engineers, 2013. pp. 1-7
    @inproceedings{92f668b7124741bda98e3a94d6a3336d,
    title = "AusTalk, The Australian Speech Database: Design Framework, Recording Experience and Localisation",
    abstract = "Aiming to create a comprehensive Australian speech database, the “AusTalk” project was carefully designed by 30 speech scientists contributing their disciplinary expertise. Standardised three one-hour audio-visual sessions for each of 1000 speakers around Australia were recorded having diverse components suitable for different research areas. The design of this database provides a good framework for any speech data corpus collection. In this paper, we present the AusTalk design and recording protocol, as well as problems faced and lessons learned. Localisation of this protocol and the potential customisation based on other countries' specifications are discussed. Collecting such speech databases including accent groups is encouraged to boost speech research in areas such as linguistics, speech and speaker recognition, forensic voice comparison, auditory-visual speech processing and many more.",
    keywords = "audio databases, audio recording, protocols",
    author = "Sharifa Alghowinem and Michael WAGNER and Roland GOECKE",
    year = "2013",
    doi = "10.1109/CITA.2013.6637567",
    language = "English",
    isbn = "9781479910915",
    pages = "1--7",
    editor = "Jane Labadin and Jacey-Lynn Minoi and Iskandar, {Dayang NurFatimah Awang} and Masli, {Azman Bujang}",
    booktitle = "8th International Conference on IT in Asia 2013 (CITA'13)",
    publisher = "IEEE, Institute of Electrical and Electronics Engineers",
    address = "United States",

    }

    Alghowinem, S, WAGNER, M & GOECKE, R 2013, AusTalk, The Australian Speech Database: Design Framework, Recording Experience and Localisation. in J Labadin, J-L Minoi, DNA Iskandar & AB Masli (eds), 8th International Conference on IT in Asia 2013 (CITA'13). IEEE, Institute of Electrical and Electronics Engineers, Malaysia, pp. 1-7, 8th International Conference on Information Technology in Asia - Smart Devices Trend: Technologising Future Lifestyle, Kuching, Malaysia, 1/07/13. https://doi.org/10.1109/CITA.2013.6637567

    AusTalk, The Australian Speech Database: Design Framework, Recording Experience and Localisation. / Alghowinem, Sharifa; WAGNER, Michael; GOECKE, Roland.

    8th International Conference on IT in Asia 2013 (CITA'13). ed. / Jane Labadin; Jacey-Lynn Minoi; Dayang NurFatimah Awang Iskandar; Azman Bujang Masli. Malaysia : IEEE, Institute of Electrical and Electronics Engineers, 2013. p. 1-7.

    Research output: A Conference proceeding or a Chapter in BookConference contribution

    TY - GEN

    T1 - AusTalk, The Australian Speech Database: Design Framework, Recording Experience and Localisation

    AU - Alghowinem, Sharifa

    AU - WAGNER, Michael

    AU - GOECKE, Roland

    PY - 2013

    Y1 - 2013

    N2 - Aiming to create a comprehensive Australian speech database, the “AusTalk” project was carefully designed by 30 speech scientists contributing their disciplinary expertise. Standardised three one-hour audio-visual sessions for each of 1000 speakers around Australia were recorded having diverse components suitable for different research areas. The design of this database provides a good framework for any speech data corpus collection. In this paper, we present the AusTalk design and recording protocol, as well as problems faced and lessons learned. Localisation of this protocol and the potential customisation based on other countries' specifications are discussed. Collecting such speech databases including accent groups is encouraged to boost speech research in areas such as linguistics, speech and speaker recognition, forensic voice comparison, auditory-visual speech processing and many more.

    AB - Aiming to create a comprehensive Australian speech database, the “AusTalk” project was carefully designed by 30 speech scientists contributing their disciplinary expertise. Standardised three one-hour audio-visual sessions for each of 1000 speakers around Australia were recorded having diverse components suitable for different research areas. The design of this database provides a good framework for any speech data corpus collection. In this paper, we present the AusTalk design and recording protocol, as well as problems faced and lessons learned. Localisation of this protocol and the potential customisation based on other countries' specifications are discussed. Collecting such speech databases including accent groups is encouraged to boost speech research in areas such as linguistics, speech and speaker recognition, forensic voice comparison, auditory-visual speech processing and many more.

    KW - audio databases

    KW - audio recording

    KW - protocols

    U2 - 10.1109/CITA.2013.6637567

    DO - 10.1109/CITA.2013.6637567

    M3 - Conference contribution

    SN - 9781479910915

    SP - 1

    EP - 7

    BT - 8th International Conference on IT in Asia 2013 (CITA'13)

    A2 - Labadin, Jane

    A2 - Minoi, Jacey-Lynn

    A2 - Iskandar, Dayang NurFatimah Awang

    A2 - Masli, Azman Bujang

    PB - IEEE, Institute of Electrical and Electronics Engineers

    CY - Malaysia

    ER -

    Alghowinem S, WAGNER M, GOECKE R. AusTalk, The Australian Speech Database: Design Framework, Recording Experience and Localisation. In Labadin J, Minoi J-L, Iskandar DNA, Masli AB, editors, 8th International Conference on IT in Asia 2013 (CITA'13). Malaysia: IEEE, Institute of Electrical and Electronics Engineers. 2013. p. 1-7 https://doi.org/10.1109/CITA.2013.6637567