A Blueprint for a comprehensive Australian English auditory-visual speech corpus

Denis Burnham, Eliathamby Ambikairajah, Joanne Arciuli, Mohammed Bennamoun, Catherine T. Best, Steven Bird, Andrew R. Butcher, Steve Cassidy, Girija Chetty, Felicity M. Cox, Anne Cutler, Robert Dale, Julien R. Epps, Janet M. Fletcher, Roland Goecke, David B. Grayden, John T. Hajek, John C. Ingram, Shunichi Ishihara, Nenagh Kemp & 10 others Yuko Kinoshita, Takaaki Kuratate, Trent W. Lewis, Debbie E. Loakes, Mark Onslow, David M. Powers, Philip Rose, Roberto Togneri, Dat Tran, Michael Wagner

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionResearchpeer-review

Abstract

Large auditory-visual (AV) speech corpora are the grist of modern research in speech science, but no such corpus exists for Australian English. This is unfortunate, for speech science is the brains behind speech technology and applications such as text-to-speech (TTS) synthesis, automatic speech recognition (ASR), speaker recognition and forensic identification, talking heads, and hearing prostheses. Advances in these research areas in Australia require a large corpus of Australian English. Here the authors describe a blueprint for building the Big Australian Speech Corpus (the Big ASC), a corpus of over 1,100 speakers from urban and rural Australia, including speakers of non-indigenous, indigenous, ethnocultural, and disordered forms of Australian English, each of whom would be sampled on three occasions in a range of speech tasks designed by the researchers who would be using the corpus.
LanguageEnglish
Title of host publicationSelected proceedings of the 2008 HCSNet Workshop on Designing the Australian National Corpus
Subtitle of host publicationmustering languages
EditorsMichael Haugh, Kate Burridge, Jean Mulder, Pam Peters
Place of PublicationSomerville, MA
PublisherCascadilla Proceedings Project
Pages96-107
Number of pages12
ISBN (Print)9781574734355
Publication statusPublished - 2009
EventHCSNet Workshop on Designing the Australian National Corpus - Sydney
Duration: 4 Dec 20095 Dec 2009

Workshop

WorkshopHCSNet Workshop on Designing the Australian National Corpus
CitySydney
Period4/12/095/12/09

Fingerprint

Blueprints
Speech synthesis
Audition
Prosthetics
Speech recognition
Brain

Cite this

Burnham, D., Ambikairajah, E., Arciuli, J., Bennamoun, M., Best, C. T., Bird, S., ... Wagner, M. (2009). A Blueprint for a comprehensive Australian English auditory-visual speech corpus. In M. Haugh, K. Burridge, J. Mulder, & P. Peters (Eds.), Selected proceedings of the 2008 HCSNet Workshop on Designing the Australian National Corpus: mustering languages (pp. 96-107). Somerville, MA: Cascadilla Proceedings Project.
Burnham, Denis ; Ambikairajah, Eliathamby ; Arciuli, Joanne ; Bennamoun, Mohammed ; Best, Catherine T. ; Bird, Steven ; Butcher, Andrew R. ; Cassidy, Steve ; Chetty, Girija ; Cox, Felicity M. ; Cutler, Anne ; Dale, Robert ; Epps, Julien R. ; Fletcher, Janet M. ; Goecke, Roland ; Grayden, David B. ; Hajek, John T. ; Ingram, John C. ; Ishihara, Shunichi ; Kemp, Nenagh ; Kinoshita, Yuko ; Kuratate, Takaaki ; Lewis, Trent W. ; Loakes, Debbie E. ; Onslow, Mark ; Powers, David M. ; Rose, Philip ; Togneri, Roberto ; Tran, Dat ; Wagner, Michael. / A Blueprint for a comprehensive Australian English auditory-visual speech corpus. Selected proceedings of the 2008 HCSNet Workshop on Designing the Australian National Corpus: mustering languages. editor / Michael Haugh ; Kate Burridge ; Jean Mulder ; Pam Peters. Somerville, MA : Cascadilla Proceedings Project, 2009. pp. 96-107
@inproceedings{6d0e9ef83d1e47d191e3c51324f47f27,
title = "A Blueprint for a comprehensive Australian English auditory-visual speech corpus",
abstract = "Large auditory-visual (AV) speech corpora are the grist of modern research in speech science, but no such corpus exists for Australian English. This is unfortunate, for speech science is the brains behind speech technology and applications such as text-to-speech (TTS) synthesis, automatic speech recognition (ASR), speaker recognition and forensic identification, talking heads, and hearing prostheses. Advances in these research areas in Australia require a large corpus of Australian English. Here the authors describe a blueprint for building the Big Australian Speech Corpus (the Big ASC), a corpus of over 1,100 speakers from urban and rural Australia, including speakers of non-indigenous, indigenous, ethnocultural, and disordered forms of Australian English, each of whom would be sampled on three occasions in a range of speech tasks designed by the researchers who would be using the corpus.",
author = "Denis Burnham and Eliathamby Ambikairajah and Joanne Arciuli and Mohammed Bennamoun and Best, {Catherine T.} and Steven Bird and Butcher, {Andrew R.} and Steve Cassidy and Girija Chetty and Cox, {Felicity M.} and Anne Cutler and Robert Dale and Epps, {Julien R.} and Fletcher, {Janet M.} and Roland Goecke and Grayden, {David B.} and Hajek, {John T.} and Ingram, {John C.} and Shunichi Ishihara and Nenagh Kemp and Yuko Kinoshita and Takaaki Kuratate and Lewis, {Trent W.} and Loakes, {Debbie E.} and Mark Onslow and Powers, {David M.} and Philip Rose and Roberto Togneri and Dat Tran and Michael Wagner",
year = "2009",
language = "English",
isbn = "9781574734355",
pages = "96--107",
editor = "Michael Haugh and Kate Burridge and Jean Mulder and Pam Peters",
booktitle = "Selected proceedings of the 2008 HCSNet Workshop on Designing the Australian National Corpus",
publisher = "Cascadilla Proceedings Project",

}

Burnham, D, Ambikairajah, E, Arciuli, J, Bennamoun, M, Best, CT, Bird, S, Butcher, AR, Cassidy, S, Chetty, G, Cox, FM, Cutler, A, Dale, R, Epps, JR, Fletcher, JM, Goecke, R, Grayden, DB, Hajek, JT, Ingram, JC, Ishihara, S, Kemp, N, Kinoshita, Y, Kuratate, T, Lewis, TW, Loakes, DE, Onslow, M, Powers, DM, Rose, P, Togneri, R, Tran, D & Wagner, M 2009, A Blueprint for a comprehensive Australian English auditory-visual speech corpus. in M Haugh, K Burridge, J Mulder & P Peters (eds), Selected proceedings of the 2008 HCSNet Workshop on Designing the Australian National Corpus: mustering languages. Cascadilla Proceedings Project, Somerville, MA, pp. 96-107, HCSNet Workshop on Designing the Australian National Corpus, Sydney, 4/12/09.

A Blueprint for a comprehensive Australian English auditory-visual speech corpus. / Burnham, Denis; Ambikairajah, Eliathamby; Arciuli, Joanne; Bennamoun, Mohammed; Best, Catherine T.; Bird, Steven; Butcher, Andrew R.; Cassidy, Steve; Chetty, Girija; Cox, Felicity M.; Cutler, Anne; Dale, Robert; Epps, Julien R.; Fletcher, Janet M.; Goecke, Roland; Grayden, David B.; Hajek, John T.; Ingram, John C.; Ishihara, Shunichi; Kemp, Nenagh; Kinoshita, Yuko; Kuratate, Takaaki; Lewis, Trent W.; Loakes, Debbie E.; Onslow, Mark; Powers, David M.; Rose, Philip; Togneri, Roberto; Tran, Dat; Wagner, Michael.

Selected proceedings of the 2008 HCSNet Workshop on Designing the Australian National Corpus: mustering languages. ed. / Michael Haugh; Kate Burridge; Jean Mulder; Pam Peters. Somerville, MA : Cascadilla Proceedings Project, 2009. p. 96-107.

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionResearchpeer-review

TY - GEN

T1 - A Blueprint for a comprehensive Australian English auditory-visual speech corpus

AU - Burnham, Denis

AU - Ambikairajah, Eliathamby

AU - Arciuli, Joanne

AU - Bennamoun, Mohammed

AU - Best, Catherine T.

AU - Bird, Steven

AU - Butcher, Andrew R.

AU - Cassidy, Steve

AU - Chetty, Girija

AU - Cox, Felicity M.

AU - Cutler, Anne

AU - Dale, Robert

AU - Epps, Julien R.

AU - Fletcher, Janet M.

AU - Goecke, Roland

AU - Grayden, David B.

AU - Hajek, John T.

AU - Ingram, John C.

AU - Ishihara, Shunichi

AU - Kemp, Nenagh

AU - Kinoshita, Yuko

AU - Kuratate, Takaaki

AU - Lewis, Trent W.

AU - Loakes, Debbie E.

AU - Onslow, Mark

AU - Powers, David M.

AU - Rose, Philip

AU - Togneri, Roberto

AU - Tran, Dat

AU - Wagner, Michael

PY - 2009

Y1 - 2009

N2 - Large auditory-visual (AV) speech corpora are the grist of modern research in speech science, but no such corpus exists for Australian English. This is unfortunate, for speech science is the brains behind speech technology and applications such as text-to-speech (TTS) synthesis, automatic speech recognition (ASR), speaker recognition and forensic identification, talking heads, and hearing prostheses. Advances in these research areas in Australia require a large corpus of Australian English. Here the authors describe a blueprint for building the Big Australian Speech Corpus (the Big ASC), a corpus of over 1,100 speakers from urban and rural Australia, including speakers of non-indigenous, indigenous, ethnocultural, and disordered forms of Australian English, each of whom would be sampled on three occasions in a range of speech tasks designed by the researchers who would be using the corpus.

AB - Large auditory-visual (AV) speech corpora are the grist of modern research in speech science, but no such corpus exists for Australian English. This is unfortunate, for speech science is the brains behind speech technology and applications such as text-to-speech (TTS) synthesis, automatic speech recognition (ASR), speaker recognition and forensic identification, talking heads, and hearing prostheses. Advances in these research areas in Australia require a large corpus of Australian English. Here the authors describe a blueprint for building the Big Australian Speech Corpus (the Big ASC), a corpus of over 1,100 speakers from urban and rural Australia, including speakers of non-indigenous, indigenous, ethnocultural, and disordered forms of Australian English, each of whom would be sampled on three occasions in a range of speech tasks designed by the researchers who would be using the corpus.

M3 - Conference proceeding contribution

SN - 9781574734355

SP - 96

EP - 107

BT - Selected proceedings of the 2008 HCSNet Workshop on Designing the Australian National Corpus

A2 - Haugh, Michael

A2 - Burridge, Kate

A2 - Mulder, Jean

A2 - Peters, Pam

PB - Cascadilla Proceedings Project

CY - Somerville, MA

ER -

Burnham D, Ambikairajah E, Arciuli J, Bennamoun M, Best CT, Bird S et al. A Blueprint for a comprehensive Australian English auditory-visual speech corpus. In Haugh M, Burridge K, Mulder J, Peters P, editors, Selected proceedings of the 2008 HCSNet Workshop on Designing the Australian National Corpus: mustering languages. Somerville, MA: Cascadilla Proceedings Project. 2009. p. 96-107