A Blueprint for a comprehensive Australian English auditory-visual speech corpus

Denis Burnham, Eliathamby Ambikairajah, Joanne Arciuli, Mohammed Bennamoun, Catherine T. Best, Steven Bird, Andrew R. Butcher, Steve Cassidy, Girija Chetty, Felicity M. Cox, Anne Cutler, Robert Dale, Julien R. Epps, Janet M. Fletcher, Roland Goecke, David B. Grayden, John T. Hajek, John C. Ingram, Shunichi Ishihara, Nenagh Kemp & 10 others Yuko Kinoshita, Takaaki Kuratate, Trent W. Lewis, Debbie E. Loakes, Mark Onslow, David M. Powers, Philip Rose, Roberto Togneri, Dat Tran, Michael Wagner

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contribution

Abstract

Large auditory-visual (AV) speech corpora are the grist of modern research in speech science, but no such corpus exists for Australian English. This is unfortunate, for speech science is the brains behind speech technology and applications such as text-to-speech (TTS) synthesis, automatic speech recognition (ASR), speaker recognition and forensic identification, talking heads, and hearing prostheses. Advances in these research areas in Australia require a large corpus of Australian English. Here the authors describe a blueprint for building the Big Australian Speech Corpus (the Big ASC), a corpus of over 1,100 speakers from urban and rural Australia, including speakers of non-indigenous, indigenous, ethnocultural, and disordered forms of Australian English, each of whom would be sampled on three occasions in a range of speech tasks designed by the researchers who would be using the corpus.
Original languageEnglish
Title of host publicationSelected proceedings of the 2008 HCSNet Workshop on Designing the Australian National Corpus
Subtitle of host publicationmustering languages
EditorsMichael Haugh, Kate Burridge, Jean Mulder, Pam Peters
Place of PublicationSomerville, MA
PublisherCascadilla Proceedings Project
Pages96-107
Number of pages12
ISBN (Print)9781574734355
Publication statusPublished - 2009
EventHCSNet Workshop on Designing the Australian National Corpus - Sydney
Duration: 4 Dec 20095 Dec 2009

Workshop

WorkshopHCSNet Workshop on Designing the Australian National Corpus
CitySydney
Period4/12/095/12/09

    Fingerprint

Cite this

Burnham, D., Ambikairajah, E., Arciuli, J., Bennamoun, M., Best, C. T., Bird, S., ... Wagner, M. (2009). A Blueprint for a comprehensive Australian English auditory-visual speech corpus. In M. Haugh, K. Burridge, J. Mulder, & P. Peters (Eds.), Selected proceedings of the 2008 HCSNet Workshop on Designing the Australian National Corpus: mustering languages (pp. 96-107). Somerville, MA: Cascadilla Proceedings Project.