Abstract
Aiming to create a comprehensive Australian speech database, the “AusTalk” project was carefully designed by 30 speech scientists contributing their disciplinary expertise. Standardised three one-hour audio-visual sessions for each of 1000 speakers around Australia were recorded having diverse components suitable for different research areas. The design of this database provides a good framework for any speech data corpus collection. In this paper, we present the AusTalk design and recording protocol, as well as problems faced and lessons learned. Localisation of this protocol and the potential customisation based on other countries' specifications are discussed. Collecting such speech databases including accent groups is encouraged to boost speech research in areas such as linguistics, speech and speaker recognition, forensic voice comparison, auditory-visual speech processing and many more.
Original language | English |
---|---|
Title of host publication | 8th International Conference on IT in Asia 2013 (CITA'13) |
Editors | Jane Labadin, Jacey-Lynn Minoi, Dayang NurFatimah Awang Iskandar, Azman Bujang Masli |
Place of Publication | Malaysia |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 1-7 |
Number of pages | 7 |
ISBN (Print) | 9781479910915 |
DOIs | |
Publication status | Published - 2013 |
Event | 8th International Conference on Information Technology in Asia - Smart Devices Trend: Technologising Future Lifestyle - Kuching, Kuching, Malaysia Duration: 1 Jul 2013 → 4 Jul 2013 |
Conference
Conference | 8th International Conference on Information Technology in Asia - Smart Devices Trend: Technologising Future Lifestyle |
---|---|
Country/Territory | Malaysia |
City | Kuching |
Period | 1/07/13 → 4/07/13 |